Protein Help

YOR343W-B Protein

Protein abundance data, domains, shared domains with other proteins, protein sequence retrieval for various strains, sequence-based physico-chemical properties, protein modification sites, and external identifiers for the protein.


Aliases
YOR343C-B
Protein Product
gag-pol fusion protein
Feature Type
transposable element gene
EC Number
2.7.7.7, 2.7.7.49, 3.1.26.4

AlphaFold Protein Structure

AlphaFold, developed by DeepMind, is an AI program that accurately predicts protein structures from amino acid sequences, enabling visualization of protein conformations. The predicted structures can be accessed through the Protein Data Bank (PDB) and AlphaFold Protein Structure Database.


Model Confidence

Very high
Confident
Low
Very low

Experimental Data

Contains experimentally-derived protein half-life data obtained using stable isotope labeling by amino acids (SILAC) coupled with mass spectrometry. This section also contains protein abundance data for both untreated and treated cells obtained from over 20 studies. These data have been normalized and converted to a common unit of molecules per cell.


Protein Half Life

No half-life data available for YOR343W-B.

Protein Abundance

No protein abundance data available for YOR343W-B.

Domains and Classification - S288C

Collection of computationally identified domains and motifs, as determined by InterProScan analysis; includes protein coordinates for the domain, a domain Description, a Source and corresponding accession ID, and the number of S. cerevisiae genes that share the same domain.


Increase the total number of rows showing on this page by using the pull-down located below the table, or use the page scroll at the table's top right to browse through its pages; use the arrows to the right of a column header to sort by that column; filter the table using the "Filter" box at the top of the table.

Gene Protein Coordinates Accession ID Description Source No. of Genes with Domain

Domain Locations

Visual representation of the locations of the domains within the protein, as listed in the Domains and Classification table. Each row displays the domain(s) derived from a different Source, with domains color-coded according to this Source.

Scroll over a domain to view its exact coordinates and its Description.

Shared Domains

This diagram displays domains (colored squares) shared between the given protein (yellow circle) and other proteins (gray circles); the domains are color-coded according to their source, as displayed in the Domain Locations table, above.

Reset

Click on a gene or domain name to go to its specific page within SGD; drag any of the gene or domain objects around within the visualization for easier viewing; click “Reset” to automatically redraw the diagram.

Sequence

Protein sequence for the given gene in S288C and other strains, when available. Use the pull-down menu under "Strain" to select the sequence for a specific strain. The displayed sequence can be downloaded in FASTA format as a .txt file. Amino acids displayed in blue represent modification sites. More detailed evidence for these modification sites is presented in the Post-translational Modifications table, located just below the protein sequence.


1 MESQQLHQNP HCPHGSAYAS VTSKEVPSNQ DPLAVSASNL PEFDRDSTKV NSQEETTPGT
61 SAVPENHHHV SPQPASVPPP QNGQYQQHGM MTPNKAMASN WAHYQQPSMM TCSHYQTSPA
121 YYQPDPHYPL PQYIPPLSTS SPDPIDSQDQ HSEVPQAKTK VRNNVLPPHP HTSEENFSTW
181 VKFYIRFLKN SNLGDIIPND QGEIKRQMTY EEHAYIYNTF QAFAPFHLLP TWVKQILEIN
241 YSDILTVLCK SVSKMQTNNQ ELKDWIALAN LEYNGSTSAD TFEITVSTII QRLKENNINV
301 SDRLACQLIL KGLSGDFKYL RNQYRTKTNM KLSQLFAEIQ LIYDENKIMN LNKPSQYKQH
361 SEYKNVSRTS PNTTNTKVTT RNYHRTNSSK PRAAKAHNIA TSSKFSRVNN DHINESTVSS
421 QYLSDDNELS LSQQQKESKP TRTIDSNDEL PDHLLIDSGA SQTLVRSAHY LHHATPNSEI
481 NIVDAQKQDI PINAIGNLHF NFQNGTKTSI KALHTPNIAY DLLSLSELTN QNITACFTRN
541 TLERSDGTVL APIVKHGDFY WLSKKYLIPS HISKLTINNV NKSKSVNKYP YPLIHRMLGH
601 ANFRSIQKSL KKNAVTYLKE SDIEWSNAST YQCPDCLIGK STKHRHVKGS RLKYQESYEP
661 FQYLHTDIFG PVHHLPKSAP SYFISFTDEK TRFQWVYPLH DRREESILNV FTSILAFIKN
721 QFNARVLVIQ MDRGSEYTNK TLHKFFTNRG ITACYTTTAD SRAHGVAERL NRTLLNDCRT
781 LLHCSGLPNH LWFSAVEFST IIRNSLVSPK NDKSARQHAG LAGLDITTIL PFGQPVIVNN
841 HNPDSKIHPR GIPGYALHPS RNSYGYIIYL PSLKKTVDTT NYVILQNNQT KLDQFDYDTL
901 TFDDDLNRLT AHNQSFIEQN ETEQSYDQNT ESDHDYQSEI EINSDPLVND FSSQSLNPLQ
961 LDKEPVQKVR APKEVDADIS EYNILPSTIR SRTPHIINKE STEMGGTIES DTTSPRHSST
1021 FTARNQKRPG SPNDMIDLTS QDRVNYGLEN IKTTRLGGTE EPYIQRNSDT NIKYRTTNST
1081 PSIDDRSSNS DSTTPIISIE TKAACDNTPS IDTDPPEYRS SDHATPNIMP DKSSKNVTAD
1141 SILDDLPLPD LTHKSPTDTS DVSKDIPHIH SRQTNSSLGG MDDSNVLTTT KSKKRSLEDN
1201 ETEIEVSRDT WNNKNMRSLE PPRSKKRINL IAAIKGVKSI KPVRTTLRYD EAITYNKDNK
1261 EKDRYVEAYH KEISQLLKMN TWDTNKYYDR NDIDPKKVIN SMFIFNKKRD GTHKARFVAR
1321 GDIQHPDTYD SDMQSNTVHH YALMTSLSIA LDNDYYITQL DISSAYLYAD IKEELYIRPP
1381 PHLGLNDKLL RLRKSLYGLK QSGANWYETI KSYLINCCDM QEVRGWSCVF KNSQVTICLF
1441 VDDMILFSKD LNANKKIITT LKKQYDTKII NLGEGDNEIQ YDILGLEIKY QRSKYMKLGM
1501 EKSLTEKLPK LNVPLNPKGK KLRAPGQPGH YIDQDELEID EDEYKEKVHE MQKLIGLASY
1561 VGYKFRFDLL YYINTLAQHI LFPSRQVLDM TYELIQFMWD TRDKQLIWHK NKPTKPDNKL
1621 VAISDASYGN QPYYKSQIGN IFLLNGKVIG GKSTKASLTC TSTTEAEIHA VSEAIPLLNN
1681 LSHLVQELNK KPIIKGLLTD SRSTISIIKS TNEEKFRNRF FGTKAMRLRD EVSGNNLYVY
1741 YIETKKNIAD VMTKPLPIKT FKLLTNKWIH *

* Blue amino acids indicate modification sites. More information below.

Post-translational Modifications - S288C

Modification sites for the protein in the selected strain, based on the presence of a residue in the specific strain, as inferred from experimental evidence.

9 entries for 9 sites

Increase the total number of rows showing on this page by using the pull-down located below the table, or use the page scroll at the table's top right to browse through its pages; use the arrows to the right of a column header to sort by that column; filter the table using the "Filter" box at the top of the table.

SiteModificationModifierReference
S370phosphorylated residueMacGilvray ME, et al. (2020) PMID: 32597660
S402phosphorylated residueMacGilvray ME, et al. (2020) PMID: 32597660
S1019phosphorylated residueMacGilvray ME, et al. (2020) PMID: 32597660
S1031phosphorylated residueMacGilvray ME, et al. (2020) PMID: 32597660
S1087phosphorylated residueZhou X, et al. (2021) PMID: 33481703
S1088phosphorylated residueZhou X, et al. (2021) PMID: 33481703
S1090phosphorylated residueZhou X, et al. (2021) PMID: 33481703
T1094phosphorylated residueZhou X, et al. (2021) PMID: 33481703
K1746ubiquitinylated lysineRSP5Fang NN, et al. (2014) PMID: 25344756
Showing 1 to 9 of 9 entries

Sequence-Based Physico-chemical Properties - S288C

Calculated protein properties, including amino acid composition, length, coding region calculations, and atomic composition.

Amino Acid Composition

Sort table using the arrow to the right of a column header to sort by that column; download all properties as a .txt file using the "Download Properties" button.

Amino AcidFrequencyPercentage
A784.41
C160.90
D1146.44
E874.92
F492.77
G603.39
H643.62
I1277.18
K1347.57
L1508.47
M301.69
N1347.57
P995.59
Q844.75
R754.24
S1649.27
T1387.80
V703.95
W150.85
Y824.63

Physical Details

Length (a.a): 1770
Molecular Weight (Da): 202171.6
Isoelectric Point (pl): 8.35
Formula: C8932H14026N2490O2775S46
Aliphatic Index: 73.16
Instability Index: 45.93

Coding Region Translation Calculations

Codon Bias: 0.05
Codon Adaptation Index: 0.15
Frequence of Optimal Codons: 0.47
Hydropathicity of Protein: -0.71
Aromaticity Score: 0.08

Extinction Coefficients at 280nm

ALL Cys residues appear as half cystines: 205680.0
NO Cys residues appear as half cystines: 204680.0

Atomic Composition

Sort table using the arrow to the right of a column header to sort by that column; download all properties as a .txt file using the "Download Properties" button.

Atom Frequency Percentage

Data not found or not available for  S288C

External Identifiers

List of external identifiers for the protein from various database sources.

17 entries for 5 sources


Increase the total number of rows showing on this page by using the pull-down located below the table, or use the page scroll at the table's top right to browse through its pages; use the arrows to the right of a column header to sort by that column; filter the table using the "Filter" box at the top of the table.

External IDSource
2.7.7.7ExPASy
2.7.7.49ExPASy
3.1.26.4ExPASy
398366311GenBank/EMBL/DDBJ
CAA99667.1GenBank/EMBL/DDBJ
CAA99670.1GenBank/EMBL/DDBJ
Z75252GenBank/EMBL/DDBJ
Z75251GenBank/EMBL/DDBJ
74627284GenBank/EMBL/DDBJ
1420747GenBank/EMBL/DDBJ
Showing 1 to 10 of 17 entries

Resources


Homologs

AnalogYeast | BLASTP at NCBI

Protein Databases

AlphaFold Protein Structure | Pfam domains | SUPERFAMILY | UniProtKB

Localization

YeastGFP

Post-translational Modifications

PhosphoGRID