Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_76300 |
Symbol | |
ID | 4836908 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 2300422 |
End bp | 2303560 |
Gene Length | 3139 bp |
Protein Length | 904 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640388223 |
Product | predicted protein |
Protein accession | XP_001382669 |
Protein GI | 150864000 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000841792 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AGGAGATTAC ATTCTTCTGA TACAACTGGA CCTGTTGAAT CTTGATCTCA TCCATATGCT CTTGATCTGT TCTGATCTGA TTTTGAACTT CTGAGCCAGT CTTTAATCAT CTATTACGTC GTTCTTACTA CAGCATTATT CTCGTTCTCA GTACACCATA ATTGAAATCT TCAGTCTCCA AAACTTCACC AAACCGCACC TCCAGGCTTT CCCAAAATGC ATATCTTCAC TGCCAAACAC CAGAAGCTCA TTCTCCAGGT GTATCCGCCT GGAAAGGCAG TCGACAAAAG AGCCAACCCG TCGGAGCTCT CGTACTTACT CTACTACGCC TCCACCAGAA GAGTCAAGTT GGAGAAGGTA GTCACTTTTC TCGAGGCAAA GACAGCTAGT GATGTCTCCC ACAACCGTTC AGGTAACCTC CAAGTGACTT TGGAGATCAT CTCCTCTCTT ATCGAGAAAT GTGCCGACAA CTTGAACGTG TTTGCTCTGT ACGTGTGTTC GATTCTCACG TCAATTCTTA ACACCAAGGA CTTGTCACTC TGCAAAAGCG TTGTAAACAC CTATGGAGTG TTCTGTGAGA ATCTCGACGG CGGCTTGTTT TCAGGTGACA AGGCTTTTGT AGACTTGTTC ACCAGCTTGT CTGAAGACTT GATCTTGACA GGCTCCAAGA ACCTGCAAGC CCCGGGCCCT AACCAGTTGG AATGGAAGAT GATAGCACTT ATGGCCATTA GACACATTTC GCCATGTTTG AGTCATTCGA CCAAAATTGC CACCAAGTTC ATCAACATCT CAATTCCTTT CTTGCTGAAG ACGATTCACA GCACAAACTC ACAAAGTAAC TTGCTTTCAC GTGTCCGGAG CAACATAAAT GTAGAAGGGG ACGATCGCCG TCTTCAGAGA GTTCTTCTGT CAAAGGTTCC TCAAAATATC CACAAGCAAA TCCAAGAAGA TTTCGACAAC GATAATTTGC TGGAAGATGA CATCACCGAA GAAGCTCTCC GTAGTTTGAA GGCTTTGTTT AACACCACCC TAACAAGTCA GATCTCACTG GCCACTAGAG CTGTTGTCAA ATACAATTAC GAAAGTAAAA CCGACCTGAA ATGGGGAGCC ACTTTCTTAG AGATGTGCAC CACCTGGATC CCAGTCCAGC TTCGATTTGT AAGTTTGTCG ACCTTGATGG CCAGATTGAC TACAATATCT AGCGAGGCCA CCAACAGTAG TCCCAGTTTT GGTTTACAAA CGCAGTACGC TAGTCATATC TTGGCTTTGG TATCTTCTGA TGTCAACATG ATTGGTTTGT CTGTTTCTGA CATCATTCAG CAGATATTGT CGTTGCAATC AAATTTGATC TTGGATCAGC TGCTGTTCTT GAAGCCAGAA CAGGTCAAGA GATTGTCGTC CATATACTCT GGCTGTATTT GTAACTTGGC ATCCCACATC TACTACTACG ACCAGGTTCC TGATGCCATC CAAGAGATCT TGGTCAAGAT CGACACCGTT GTAGACTTTT CATTTGTCGA CAGAGACCAG GACGTTCTGC AAAACATCAA CGCCAACAAC ATCCACAACT TGGTGATGAC ATTGTTGGGC GATGTTTCTG TGATCTTTAG TACTTTGACC AAGAAGTCAA GCTCGATAGC CAGAAACCAT GTAAACTTGG AGCATTGGGA AATAAGCTTG CCGTTGTTGT CACCTGAGAG TCTGTATGAC AACGATACGC TTAGAAAGAC TGTCTTCACA CCTTCACAGA TCATTGACAT TCAAATTAAG TACTTGAAGG TGTTGCACGA TTTCTTAACT AACGAGTTGT CTTCAGCTGG CAACACCCCT CAGCCGGTTA AGAAGAATTC GTTAGATCCA AAGACAATTG AGTCCAAGAG ATCGTCATTT GAGTCTGTCA ACTCTGAGAA TTTGAACGGT ACAGTCAAGG ACTTGTTGAA ACCCAACATC AATCAGTATA TATCCAACCC CAACAACGTG ATCTCTCATT TCTTGCTCTA TGTGCACAAA TTCTTCAACT TCCACGAGTC TCCCAATACT GAAGTTGTAC TTTCTTTGAT TTCTGTGATG AAAGACATGC TCAACATATT GGGTGTCAAC TTTGTAACCA ACTTCTTACC TTTCCTTTAC AACTGGTTGA TTCCCTTGAA CGATATTAGC GATGTTCCAT CCAATGCGAA GTTCAGAGAT TCCATTGCAC ATATTGTGAC CTACTATTGT TTGAAGACAT TGGATGACAA ATACCCCGAC GACTTGGAAG GTTATGCTTG TGGCAGTAAG TTTTTCTCGA AGTTGTTGCG AGCAGTGGAG TACAGAAAAG TCAACAAGTT GTGGATCCAG GGACTTGATT CGTCGCCTAC CGATCTTGAG ATCATCAAGA ACACGGCCAG TATCAACGCC GTTGTTCACA GTTCTGATGT TTACGAGAAC GATACCAATT CGAAGTTCTC CTTTACCAGA AAGGACTACG ACGACTTTGT CTGTGGCAAT AACTTCACCA TTGTCCACAT CAATCCAGCC AAGTCTTTGG ATTTGAACGG AATCTCTACT GTAGTGCATG CTGATGCAAG AGGTTCCTCT GGAAGCGGAA TCTTAGGTCC CGGTGATATT AGCAATAAGT TTGGTGGTTC ATTCCAGGGC TCGGGTTCGA CTCCAGAAAG TGAAGTTCAC TCGCTGATTG AACTGTTAAG GCACAATGGC AACTCTGGCT TAGGCTATGG GTTAGGCACT GTAGGCGATA TCAGCTCTAT CCATTCTGAG ATCTTGCAGA ATACCCAGCA TTTGAACGGT AGAAACTTCA GTTTGACAGC GAACGGAACC TTCAACAGTG ACATCACCAG CTCGTCGATT CTAACCTCTG ATGGCAGATA TGTCAATTCG CCTAGAGTCG CCGACTTGAA GGACTTGATG ACAGACCCCA GAAGAGGCTA CAGAAAGACG TCGTTGATAA GCGATAACAT CTCTACCTCT ACTACAAGCC ACATTGGCCA GACACCAGGT TCTGTTCTTG GAAAGCAGAT GGTGTCTACA GACTTGGAAT CGATTCTCAC CAGCTTGAAC ACTGAAGATG ATTCACGGAT TATAGTCTAG TATTTATTGT TATAGAGATG TTCATGTAAA ATAAAATTGA TAAGATGAA
|
Protein sequence | MHIFTAKHQK LILQVYPPGK AVDKRANPSE LSYLLYYAST RRVKLEKVVT FLEAKTASDV SHNRSGNLQV TLEIISSLIE KCADNLNVFA SYVCSILTSI LNTKDLSLCK SVVNTYGVFC ENLDGGLFSG DKAFVDLFTS LSEDLILTGS KNSQAPGPNQ LEWKMIALMA IRHISPCLSH STKIATKFIN ISIPFLSKTI HSTNSQSNLL SRVRSNINVE GDDRRLQRVL SSKVPQNIHK QIQEDFDNDN LSEDDITEEA LRSLKALFNT TLTSQISSAT RAVVKYNYES KTDSKWGATF LEMCTTWIPV QLRFVSLSTL MARLTTISSE ATNSSPSFGL QTQYASHILA LVSSDVNMIG LSVSDIIQQI LSLQSNLILD QSSFLKPEQV KRLSSIYSGC ICNLASHIYY YDQVPDAIQE ILVKIDTVVD FSFVDRDQDV SQNINANNIH NLVMTLLGDV SVIFSTLTKK SSSIARNHVN LEHWEISLPL LSPESSYDND TLRKTVFTPS QIIDIQIKYL KVLHDFLTNE LSSAGNTPQP RSSFESVNSE NLNGTVKDLL KPNINQYISN PNNVISHFLL YVHKFFNFHE SPNTEVVLSL ISVMKDMLNI LGVNFVTNFL PFLYNWLIPL NDISDVPSNA KFRDSIAHIV TYYCLKTLDD KYPDDLEGYA CGSKFFSKLL RAVEYRKVNK LWIQGLDSSP TDLEIIKNTA SINANDTNSK FSFTRKDYDD FVCGNNFTIV HINPAKSLDL NGISTVVHAD ARGSSGSGIL GPESEVHSSI ESLRHNGNSG LGYGLGTVGD ISSIHSEILQ NTQHLNGRNF SLTANGTFNS DITSSSILTS DGRYVNSPRV ADLKDLMTDP RRGYRKTSHI GQTPGSVLGK QMVSTDLESI LTSLNTEDDS RIIV
|
| |