Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_72879 |
Symbol | |
ID | 4839783 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 640318 |
End bp | 643299 |
Gene Length | 2982 bp |
Protein Length | 967 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640391098 |
Product | predicted protein |
Protein accession | XP_001385471 |
Protein GI | 150866013 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.828998 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTGGCAC TTGGTCTAAT CCTTCTATTA CTGACGGCAA TGCTTTTCAC GAGCCCGATG ACTGGAGCAG ATTCCCCAGG ATGTCGTCCC GTGTACATGT ATCCATCGTA TGCTCGGATT ACGTCGTTTG ACGAGTCCCA TACCAAATTT GCATCTAAAT ACTCGCTATA TCTCTATAGA GAACAGGGTA AGGATCCAAT TCCTGATGAA AATAATGGAT ACGAGCGTTT GGACGGAATA CCCGTCCTCT TCATTCCAGG TAATGCTGGG AGCTACCGTC AGGCTCGTTC TATAGCAGCA CAGCTGTCAA ATCTCTACTT TGACAAAAAA GTACATAACA GTGGAGCACG TAACTTTGAT TTTTTCACAG CTGACTTCAA CGAGGACTTC ACGGCGTTTC ATGGCCGTAC CATGCTCGAC CAGGCTGAAT TTCTAAACGA AGCCATACAT TTCATACTAG GTTTGTATTC CAATACGGAC AACCCTCCGA AATCGGTTAT AATAGTTGGC CACTCGATGG GTGGTGTAGT AGCCCGAGTC ATGCTCACTT TACCCAACTA TACCGACGGA ACTGTGAACA CCATCATCAC GTTGGCTTCG CCTCACGCAG CCGCTCCGCT AACGTTTGAT GGGGATATCT TAAAAATATA CTCTGCTGTA GATCGGTTCT GGTTCGATGG ATTCCATCCG ACTACGCTGG ATATTTCTCA GACAGCACAT CGTAGATTGC ACGATGTTTC ATTGATTTCC ATAACTGGAG GTTTAACTGA TACAACGTTG CCAGCTGACT ATACTACTTT GAGCTTTTTG GTGCCTCCTT CAAATGGATT CACCGTGTAC AGCACAGGAA TCCCTAATGT TTGGACCAAT GTGGACCACT TGGCCATTGT TTGGTGCCAT CAATTGCGGA CCCAAATAGC CAAAGCCATG CTAGAAATCG CGGACTTCTC CTCTCCTTCA AGAACGTATC TGTTAGAAAG ACGAATGCAG GTGTTTAGAA ATAACTTCTT GACTGGATAT GAAGACTACG CCAGCCAGGA TCTTGTTCCT AACATAGAAC AGCAACAGAA CTTGGCTATG AAATTGGACT TGGCTCAAAT CAAGTCATTC AACATAAACG GCGATAAAAA GCTCAGGATC ACGAAAGATA ATCAAAACAG TAAATGGAAC ATATTCCAAC TAGGCAACGA TGCAAAATTG CAATTGGATG TGTTAAGCTC GCTTGAACCA ACTGAATGGG AATCTTGGTT AGCTGACGAA GAATCCAATA GACCAGTCAT TTTACTATGT TCCAATTTGG GTGAAAACAA TCAGGAGCTT GATTTGACGG ATTTCACTAA TGACCAGACT AACGAATTCG TAGAGTTAAA ATGTATTGAC ACCGCCAGAG ACGTCCATTT AGTACCTCGG TCTATGCTGG ATTCAAAATC ATTAGGCGAG TCTTCGTTTG GCGGAGAAAA GACTCCATTC TATTCGTTGC AATACAACGA CACGATACTA ACTCGCTATG ATCTTGTTGT AGTTGCTCAA AGAAATATAG CAACAGATAG CGACTTTGTT ATAGCAGAAC TTGCTGATCA GAAATCCACT ATTTTCGAGC TTGGAAAACA CATGTCTACT CTTTTCCACA GCAAAGCCGA TCTCTCACTA TCGTCTAACC GCCCATTGTC TGTCAACATC AAATTGCCTG CTGCTTGGAG CAGTTTGCTA TCGTATAAAT TGAAGCTAAA CCTCCCTGAC AAAACAGAAG GAAAGTTTGC CTCCTTCATT CGGCAATGGA TAGACGAACC ATACGAAAGC AAGTGGCATA TCAATGTGGA AAAAAACAAT GTAATCACAT TGAGAATGCA CGGAATCTCT CCATTCGTAC CTTTCAAAGT AAAGGATGAC TACGGGTTGA ATATCCAACT ATGGTCTGAC ACAAACACCA AGGAAGAGAT ACCATTGGAT ATTGAGCTCT CTGTGGATTT GATCGACAGT TTCAGATTGT TTGTCATGAG ATATCGATTA TCGATTGTAG CCACATGTGT TCTGATTTCA TTGTTGGCCA TGCTTGTACA ATTTCAATTG TACTTTAGAA CCGGCAAGTT CCCCAACTTT ATCTTTGTGC TTTCATACCT CAACACCGGC TGGCCACTCG CGATGATCAT ATCTATCCTC ATAGGCCTTA ATCCCATAGT CAAGTTGGGT TGGGTGCAAT ATTTGCTCAA CTTCATTGAT CCCGTGGTAT TACTGGATGC CAATGAAGTC AACCTTTCGC TCAAACAAGA GTTCCGGCTC AACTCGTTCT ACTTGGGTCT AGAAGAGTCC AGTCTCTGGT TCATTGGCAT CATGCTTTAC TTCATTGGAA CCTTCTTGGT CGTAGCCACC TACTATTTAT TATCAGCTAT CAGCGTCTTG GGATACGCTT TGGTGAATCA TTTGCCTAAA ACTACTTCTC CCACAAAAAC TCGCATCGTC GTCACCTTCT TGCTAATCAT GATGATACCA ATTTATATCC CGTACCAGAT CGTATACATT ATCAGTTGTG TTATCCAGGC TATCAACGTT TTGAAATCGG TTAACAACTC TCAGTTGTTC AACTACCAGA TCTCGCTCTT GATTCTCATG TTGTGGATTC TCCCGGTGAA CATCCCCATT GTTATTGTGT TTGTCCACAA CTTGTCGGTC AACTGGAAGA CGCCGTTCTC GTCGCACCAC AACTTGCTTT CCATAGTTCC GGTGTTATTG TTAACAGAGA GGAACGGGCT CTTGTCTCGT TTGCCAAAGA AAAAAGACCA GTTCTTCTTC AAGTGGTATA TGGGGTACTT TATCTTCTAC TGTTTGATCT ACGGCAGTCG GCACACGTAC TGGCTCCATC ATTTGTTCAA CTTGATGAGT TGTTTGGTGT TGGTGCTTAC GTTTGGCGAT GAAGAGAAGA AGGAAGAAGT CAGCATTACA TAGCATCACT AGCATGTACA TAATATAAGC ATTAACATAG AT
|
Protein sequence | MLFTSPMTGA DSPGCRPVYM YPSYARITSF DESHTKFASK YSLYLYREQG KDPIPDENNG YERLDGIPVL FIPGNAGSYR QARSIAAQSS NLYFDKKVHN SGARNFDFFT ADFNEDFTAF HGRTMLDQAE FLNEAIHFIL GLYSNTDNPP KSVIIVGHSM GGVVARVMLT LPNYTDGTVN TIITLASPHA AAPLTFDGDI LKIYSAVDRF WFDGFHPTTS DISQTAHRRL HDVSLISITG GLTDTTLPAD YTTLSFLVPP SNGFTVYSTG IPNVWTNVDH LAIVWCHQLR TQIAKAMLEI ADFSSPSRTY SLERRMQVFR NNFLTGYEDY ASQDLVPNIE QQQNLAMKLD LAQIKSFNIN GDKKLRITKD NQNSKWNIFQ LGNDAKLQLD VLSSLEPTEW ESWLADEESN RPVILLCSNL GENNQELDLT DFTNDQTNEF VELKCIDTAR DVHLVPRSMS DSKSLGESSF GGEKTPFYSL QYNDTILTRY DLVVVAQRNI ATDSDFVIAE LADQKSTIFE LGKHMSTLFH SKADLSLSSN RPLSVNIKLP AAWSSLLSYK LKLNLPDKTE GKFASFIRQW IDEPYESKWH INVEKNNVIT LRMHGISPFV PFKVKDDYGL NIQLWSDTNT KEEIPLDIEL SVDLIDSFRL FVMRYRLSIV ATCVSISLLA MLVQFQLYFR TGKFPNFIFV LSYLNTGWPL AMIISILIGL NPIVKLGWVQ YLLNFIDPVV LSDANEVNLS LKQEFRLNSF YLGLEESSLW FIGIMLYFIG TFLVVATYYL LSAISVLGYA LVNHLPKTTS PTKTRIVVTF LLIMMIPIYI PYQIVYIISC VIQAINVLKS VNNSQLFNYQ ISLLILMLWI LPVNIPIVIV FVHNLSVNWK TPFSSHHNLL SIVPVLLLTE RNGLLSRLPK KKDQFFFKWY MGYFIFYCLI YGSRHTYWLH HLFNLMSCLV LVLTFGDEEK KEEVSIT
|
| |