Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_65037 |
Symbol | |
ID | 4851858 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 3024002 |
End bp | 3026851 |
Gene Length | 2850 bp |
Protein Length | 949 aa |
Translation table | |
GC content | 44% |
IMG OID | 640393566 |
Product | predicted protein |
Protein accession | XP_001387145 |
Protein GI | 126275808 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG5110] 26S proteasome regulatory complex component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.882076 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCCTG CCAAAGAAAC GGAAACCGTG CCTCCCAAGG AGGTTACTGA AGCTACTAAG AAGAAGGAAC AGAAGGAGGA AGAGTTGTCT GAAGAAGACC AGCGTTTGAA GGATGACTTG GAACTTCTCG TGGAACGTTT GGCAGAGCCA GATCAAGAGC TGGGTTTGTA TGACAAATAC TTGTCGACAT TGAAGACCTA TATTAAGGAT TCCACTACTT CAATGACAGC TGTGCCCAAG CCATTGAAGT TCTTGAGACC TCACTATCCT GCCTTGACTG AATTGTACGA CAAATGGGCT GAAACTCACG GTGTCAAAAC AAATGTTGTC GTAAGTTTGG CCGATATTTT GTCTGTATTA GCTATGACTT ACTCTGACGA CGGAAAGAGA GACTCGTTGA AATACAGATT GTTGTCGTCT GATACTACTA TCGCCGACTG GGGTCACGAA TACATGCGTC ACTTGGCATT AGAAATCGGA GAATCGTACC AGGAGGAGTT GGGTTCGGAC GAATCCATCG AAAAATTGGT CCAATTGGCT ATACAGATTG TTCCTTTCTT CTTGGAACAC AATGCAGAAG CTGACGCCGT AGACTTGCTC TTGGAAATCG AAAGTATTGA CAAGTTACCT AAGTTCGTAG ATGACAGCAC TTACGCCAGA GTATGTCTTT ATATGGTCAG TTGTGTACCT TACTTAGCTC CTCCCGACGA TAAGTCATTC TTGCACACTG CCTATGCCAT CTATTTGGAA CATAGTCAAT TGACACAAGC CTTAACTTTG GCTATCAAGT TAGATGATGA GGACTTGATT AAGAAAGTTT TTGAAGTCAC CGACGATGTC TTGATTCATA AACAGTTGGG ACTCATGCTC GCCCAGCAGA AGAACAGCTT CAAGTACCCT GGTGAGAATC CAGAAGTGCA GGAAACCATC CACAATGTCA AGTTGCACGA ATACTACAGC TATCTTGCTA AGGAGTTGAA TCTCTTAGAT CCAAAGGTGC CTGAGGATAT CTATAAATCA CACTTGGAAA ACGCCAAGTT TGGTTTGGGA ACCTCTGGCT CTATTGACTC TGCCAAGCAG AACTTGGCTG CTGCGTTTGT CAACACCTTC ATCAACTTGG GATATGGTAC CGACAAGTTA ATTCAAACTG AAGAAGACAA CAAGTCGTGG ATCTACAAGA CCAAGGGACT CGGCATGGTT TCGACTACTG CCTCGTTGGG TTCTCTCCAC CAATGGAATA TCAACGAAGG TTTCCAGGTC TTGGACAAGT ACACCTATTC TTCTGAGGAC GAAATTAAAG CTGGTGCCTT ATTGGGTACG GGTATTGTTT CTGCTAATGT TCACGACGAT GTCGAAGCTG CTTTGGCTTT ACTTCAAGAC TATGTCTCTG ACCCAAACAA GCTCTTGCAA TCTTCTGCCA TTAATGGGTT AGGTATTGCA TTTGCCGGTT CCTCCAATGA GGAAGTGTTG AACTTGTTAT TGCCATTGGT TTCTGATTTA GATATTCCTT TTGAAATCTC CTGCTTAGCA GCTTTGGCTT TAGGGCATGT ATTTGTTGGT ACTTGTCACG GCGATATTAC ATCCACCATA TTGCAGACCT TATTGGAAAG AGACTTCATT CAATTGACCA ATAAGTTCAT CAAATTCATG TCTTTAGGTT TGGGTTTGTT ATACATGGGC AAGACCGAAC AAGTCGAGGA CGTATTGGAA ACCATCGATG CCATCGAACA CCCTATCTGT AAGACCTTGA AGGTCTTAGT GAATATCTGT GCTTATGCTG GTACCGGTAA TGTTTTGCAG ATCCAGTCCT TATTACAGAT GTGCACTGCT AAGCCTAAAG ATCAAGCCGC TGATGAAAAG AAATTGGAAC AATCAGAAGA AGACCAATTG AAGAGCGAAA ACAAAGAAGA CAAGACAGCT GAAGATGTCG CCGACGTAGA AATGGAAGAA GCCACAGCTG AAAAGAAGGC AAGCGAAAAG GAAGAAAAAG AAGAAAAAGA GGAAGAGGAA GAGGAAGAAA AGGAAGAGGA AGAAGAGTTG TTTCAGGGTA TTGCTGTTTT GGGTTTGGCC TGTATTGCTA TGGGAGATGA AATCGGTCAA GACATGTCAC TTCGTCACTT TGCCCATTTG ATGCATTATG GTAATTCTCT GATCCGTAGA GCTGTGCCAT TGGCAATGGG ATTGGTGTCT ACTTCATATC CACAAATGAA GGTGTTTGAT ACCTTGTCTC GTTACTCGCA CGACCCAGAC TTGGAAGTAG CTCAGAATGC CATCTACTCC ATGGGTTTGG TTGGTGCTGG TACAAACAAC GCCAGATTGG CACAGTTGTT GAGGCAATTG GCTTCTTACT ATATCAAGTC GCCAGACTCA TTGTTCATGG TACGAATTGC ACAGGGAATC TTGCACTTGG GTAAGGGCAC ATTGACGTTG AGTCCTTTCA ATAGTGAGAG AAGCATCTTA TCCAAGGTTT CATTGGCATC GTTGTTGACT ATTTCTGTAG CATTATTAGA TCCAAAGGCC TTCATCTTGA GTGACTCTAC TACCGAGACC TCCCATCAAG TATTGTACTA CTTGATTCCT GGTGTAAAGC CCAGAATGTT GGTTACAGTA GATGAAGATT TAAACCCTAT CAAGGTGAAT GTCAGAGTCG GTCAGGCTGT TGATGTGGTA GGACAGGCAG GTAGGCCCAA GACCATTACT GGATGGGTGA CTCAATCTAC TCCTGTGTTG TTGAACTACG GAGAAAGAGC CGAGTTGGAA AACACTGACG AATGGATCTC ATTGAGTGGG TCGTTGGACG GGGTTGTGAT TTTGAAGAAG AACCCTGACT TCATGGAGGT GGATGCCTAG
|
Protein sequence | MSPAKETETV PPKEVTEATK KKEQKEEELS EEDQRLKDDL ELLVERLAEP DQELGLYDKY LSTLKTYIKD STTSMTAVPK PLKFLRPHYP ALTELYDKWA ETHGVKTNVV VSLADILSVL AMTYSDDGKR DSLKYRLLSS DTTIADWGHE YMRHLALEIG ESYQEELGSD ESIEKLVQLA IQIVPFFLEH NAEADAVDLL LEIESIDKLP KFVDDSTYAR VCLYMVSCVP YLAPPDDKSF LHTAYAIYLE HSQLTQALTL AIKLDDEDLI KKVFEVTDDV LIHKQLGLML AQQKNSFKYP GENPEVQETI HNVKLHEYYS YLAKELNLLD PKVPEDIYKS HLENAKFGLG TSGSIDSAKQ NLAAAFVNTF INLGYGTDKL IQTEEDNKSW IYKTKGLGMV STTASLGSLH QWNINEGFQV LDKYTYSSED EIKAGALLGT GIVSANVHDD VEAALALLQD YVSDPNKLLQ SSAINGLGIA FAGSSNEEVL NLLLPLVSDL DIPFEISCLA ALALGHVFVG TCHGDITSTI LQTLLERDFI QLTNKFIKFM SLGLGLLYMG KTEQVEDVLE TIDAIEHPIC KTLKVLVNIC AYAGTGNVLQ IQSLLQMCTA KPKDQAADEK KLEQSEEDQL KSENKEDKTA EDVADVEMEE ATAEKKASEK EEKEEKEEEE EEEKEEEEEL FQGIAVLGLA CIAMGDEIGQ DMSLRHFAHL MHYGNSLIRR AVPLAMGLVS TSYPQMKVFD TLSRYSHDPD LEVAQNAIYS MGLVGAGTNN ARLAQLLRQL ASYYIKSPDS LFMVRIAQGI LHLGKGTLTL SPFNSERSIL SKVSLASLLT ISVALLDPKA FILSDSTTET SHQVLYYLIP GVKPRMLVTV DEDLNPIKVN VRVGQAVDVV GQAGRPKTIT GWVTQSTPVL LNYGERAELE NTDEWISLSG SLDGVVILKK NPDFMEVDA
|
| |