Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_43429 |
Symbol | |
ID | 4837753 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | + |
Start bp | 158762 |
End bp | 161086 |
Gene Length | 2325 bp |
Protein Length | 774 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640389068 |
Product | predicted protein |
Protein accession | XP_001383316 |
Protein GI | 150864482 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACTTCC TTTCCAGTGC GTTGAATTCC TTAACAGGGT CTTCTATTCC GTACACTCTT AAAGAAAAAA TCGTGGATCC CACTAGCACT TCCAATTTGG TGAATAGAAA CTCCATCTGG ACAGTGTACA ATGGCTTGAA CCCGAAGAGT GATCAGTCGC CAGTGACCGT ATTTGAGTTC AATCTTAAGG ATCCTGTGAA TATTCAAAGA CGCTGGGAGC CTTTGGCCCG AAATGCATTC AAAAAACTCA AGTTGATCAA ATTTCCTGCT ATTCTCTCTG TGATAGACTT TATCGAAAAC GATTCTTATT TATACATCGT CACTGAACCA GTGATCCCTT TGCTTAATTA CTTGCAAGAT TCGGAATTGC CATTGTCCCA AGATGCGAAA TTGTGCGGCC TTCAATCCAT TGCCCAAGCA TTGCTGTTTA TTAACATGTC CTGTAGTAGT GTGCACGGTA ATATCAACAT TTCCAGTTCT GTTTTTGTCA CAGCACTGGG CGATTGGAAG TTGTTTGGTT TTGAATTGTT AACCAACTTG AAGTCTGACC CAGACCAACC TCTCTATAGA CTTTCAGGGT CGTCACCGGA TTTCAGAAAC GTAGTTCCAG ATGAAGTGAA CTCTGATGGA GTAGAAGCTG TCAGAAGCTT TCCAATTAAA CTTGACTCTT ACAAGTATGG TGCGTTTGCT TACCAAGTTT TGTCCACTAG TGACTTCAGA GATATTAGCT CTCAATTCGA TGCTCGAAAC ATATCTAGCA AGGTCATTCC AAGCAGAATC GCGGGCCCAT TGCGAAAGTT GGTCAACTCA AAGTTGAACT TGAGAAGTAG TATCGATAAG TATGAACAAG AAACAAGTTC TTTCAACAAC ACCAACGCCC TCATAAAGCT TAATAAGCAA TTGGAGGACT TCAAGTTCCA AAATGACGAA CAGAAGATGG AGTTCATCAA GTTTGAATTG TCTGGTTATT TCGGAGAAAC TCATGCAGAG GGGTATTTTC CTTCTGGTTT CTTGAACTAT AAGCTCTTGC CAGAATTAAT TAGCCAATTT AGCGCTTTGT CCAAAGTCAA ACCAACCGTA AACACGTCAC CAGCAGAAAC TCAACAGCGT CAAGAAACTC TTGCTTTGCT CTTAGACTAC ATCTTGAAAT TTGGTTCCAA GCTTTCAGAA ATTGATTTCA ACAAGTCTAT AAAACCAATT ATTCTTGAAA CATTCAATCT TGGAGACAGA TCTATCAGAT TAGTTCTCTT GACCCACTTA CCAATGTATG CCTCCTTCTT GTCCGAATCC GATATCCAAC TGAGAATATT TCTAAACTTG ATCAGCGGTT TCCAAGATAC CAATTTCATG ATTAGAGAGA CTACGTTGAA ATCAATCACT ATTGTTATAG ATAAAATCTC TGTAAAGCAA GTTAACCAAG ACCTATTGAA GGTATTGGCG AAGTCACAGA TGGATCCTAA GCCATCTATA AGAGTCAACA CGTTAATCTT AATCATCAAG ATTTCCAGCA AGATTTACAA GAATTCCAAG AATAATGTAT TGATAACAGC ATTATCAAAG TCTTTGAGAG ACACATTTAC CCCCAGTAAG TTGACTGCAT TGTCTGGTTT TGAGAGTCTA ATCGATGAGT TCTCCTTAGA TGAAATTTGC ACAAAGATCT TGGGCCACCT TGCTATTTCG TTGATGGACA AAACTTCGAG CAAGGTGCGT AAGGAAGCCA AAAGAATCTT TCAATTGTAT TTAGACTCAG TTGAAGCTCA CGCGTCCACC TTGCCAAACA TTGATGCAGA TGATGAGGCT GAAGAAGCAG AGTTTTTCAG TAAATATGCT CCGACCATGA CAAATTCAAA CACTACAAGC AATGAAGCTA ATGATAGTTC TAACGGTGGT GGAGCCCTCT CGTTGGGCTG GAGCATGGTC AATAAATTTG TTGGACCATC TGCTGTGCAG GGCCCATTAA ACCATGACTT CAACAATTCC ACGCCCGATT TAACCAGAGA AGCGACACCT ACCGCTGAGA ATCCTTCAAG AATACCATCA AAGAAACAAC AATCTTGGAT GAGTGATGTT GTGGTAGATG ATGGAGACGG TTGGGGAGGC TTTGATGACA TTGACGATAC ACCCAAGACG ATTGTTGAAC CTCTAGCAGC ACCAAAGAAA TCTACCCCCA AACCAAGAGT CATCAAGAAA ACGGAGGCAC CTGTTTCGGG CCGTAAAATT TCTGGCCTTA AGTTGGGCGC CCCAACCAAG AAGCCAATCT CTGCTTTAAA GTTAGATTTG ACAGTTGAAG ATGACGATTC CAAGGCATGG GATGATGATT GGTAG
|
Protein sequence | MNFLSSALNS LTGSSIPYTL KEKIVDPTST SNLVNRNSIW TVYNGLNPKS DQSPVTVFEF NLKDPVNIQR RWEPLARNAF KKLKLIKFPA ILSVIDFIEN DSYLYIVTEP VIPLLNYLQD SELPLSQDAK LCGLQSIAQA LSFINMSCSS VHGNINISSS VFVTASGDWK LFGFELLTNL KSDPDQPLYR LSGSSPDFRN VVPDEVNSDG VEAVRSFPIK LDSYKYGAFA YQVLSTSDFR DISSQFDARN ISSKVIPSRI AGPLRKLVNS KLNLRSSIDK YEQETSSFNN TNALIKLNKQ LEDFKFQNDE QKMEFIKFEL SGYFGETHAE GYFPSGFLNY KLLPELISQF SALSKVKPTV NTSPAETQQR QETLALLLDY ILKFGSKLSE IDFNKSIKPI ILETFNLGDR SIRLVLLTHL PMYASFLSES DIQSRIFLNL ISGFQDTNFM IRETTLKSIT IVIDKISVKQ VNQDLLKVLA KSQMDPKPSI RVNTLILIIK ISSKIYKNSK NNVLITALSK SLRDTFTPSK LTALSGFESL IDEFSLDEIC TKILGHLAIS LMDKTSSKVR KEAKRIFQLY LDSVEAHAST LPNIDADDEA EEAEFFSKYA PTMTNSNTTS NEANDSSNGG GALSLGWSMV NKFVGPSAVQ GPLNHDFNNS TPDLTREATP TAENPSRIPS KKQQSWMSDV VVDDGDGWGG FDDIDDTPKT IVEPLAAPKK STPKPRVIKK TEAPVSGRKI SGLKLGAPTK KPISALKLDL TVEDDDSKAW DDDW
|
| |