Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_77822 |
Symbol | |
ID | 4838878 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | + |
Start bp | 1507579 |
End bp | 1509304 |
Gene Length | 1726 bp |
Protein Length | 533 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640390193 |
Product | predicted protein |
Protein accession | XP_001384244 |
Protein GI | 150865144 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0352] Thiamine monophosphate synthase [COG2145] Hydroxyethylthiazole kinase, sugar kinase family |
TIGRFAM ID | [TIGR00693] thiamine-phosphate pyrophosphorylase [TIGR00694] hydroxyethylthiazole kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00729956 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.350827 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ACTACTTGTA CAATGGCTAC TGACGGTCAA GGAAAGAGCG CAAGAGCTCC TAATGATAAT GTGGACTACT CCTTATATTT GGTAACAGAC TCTACCATGA TTCCAGAATC ATCTACCTTC ATCAAGCAGG TCCAGGAAGC CGTAGATAGC GGTGTTACTT TGGTTCAATT GCGTGAAAAG TCGCTTTCGA CTCTTGAATT CGTTAAAAGA GCCGAAGAAG TTCACAGTAT AACCAAAAAG AAGGGAATTC CTTTAATTAT CAATGATAGA ATTGATGTCG CACTCGCTAT AGATGCTGAA GGTGTTCATG TTGGTCAGGA TGACATGCCT GCTGAATTGG CTAGAAAGTT GTTGGGCCCA AATAAGATTT TGGGGGTCAC CTGCTCAAAT CCCGAAGAGG TCGATGTTGT AGTCAAACAG AAAGTAGCCG ACTACGTTGG CTTGGGGACC GTGTACAAGA CTAACACCAA GAAGAATGTC AGTGATCCCG AAGGCACGGG ACCCATCGGA ATCAGAAAAA TGCTTCGTGT TTTAGGAGAC TACAACAAGG AAGCTGACTA CAAGATCAAG TGTGTGGCTA TCGGCGGAAT TAATCATTCC AATGCCGATA AAGTGTTGTA CCAATGTCAA GTGACTGAGC AGAAGCTCGA TGGTGTGGCC GTAGTTTCGT GTATAATGGC TTCAGCAGAT GCTAGGAAGG CTACTTTAGA CTTAGCAAAA ATTATTGCTT CTGAGCCTGC CTTTTCCATT ACTACTCGTT TATCTGAATC TAACCTTGAT AGTTTTCTGA AGCGTCATGG CATTAAGACT TTGGGCCAAG TTCAACCATT GGTTCACCAC ATTACCAATA ATGTAGTGAA GAACTTCAGT GCCAATGTGA CCCTATCTAT TGGGGCGTCT CCTATCATGT CTGAACTAGC TGAAGAATTC GACGAGCTTG TTGGCAATGC CAAACATATA GCTTTGGTTT TAAATCTAGG AACACCTTCT CTGGACATGA TGTCAGTCTT TAAACAAGCA ATTCAAGTAT ACAACAAATA TGGCAAACAT ATTATATTTG ATCCCGTCGC CTGTGGGGCT ACCAAAGCTC GTCTAGAGTG TGCCAGAATC CTCTTGAACA CTGGCCATGT CTCTGTTATC AAAGGAAATG TTGGTGAAAT CATGTCTGTA TGGAAGTTGA CTACTTTCTA CATTGCCAAA TCTGTTGGAG AATCCAACAC CATGAGGGGA GTTGACTCCA TTTTCAATCT TCTGGAACAC ATTTTGCTCG AAAGAGCTCG TCAAGTAGCC CTGGAGTTCA AGACAGTTGT TGTAGTCACC GGTAAAAAGA ACTTCACGGT TTCTCCAACT GGTATAGTAA GATCATGTCT TGGAGGAAGT CCATTGATGG GCAAGATAAC AGGTACTGGG TGTTCTTTAG GCAGCACTAT TGCTGCATTC TTGGCAGCTG GAGCTGACGG TGAATTAGAA CAAGACTCGA CGTGTGTATT TGATGCTGCC ACTATTGCAG TAGAGATGTA TAACAGTGCA GGTAAGCAAG CGGCAGTCAG TGTCAGTACT CCAGGATCTT TCAGTACTAA ATTTCTTGAC CATCTAAGCA GGGGACAGGA TCTTATCCCA ACTTTGTTCT CTAGACCCTA AATGCACAAT TGCATCTATG ACTCCATACA TATACTATTG TATTGCATAT AGATAAGAAT TTCGTTAATT ATACATCGAT GTCAGA
|
Protein sequence | MATDGQGKSA RAPNDNVDYS LYLVTDSTMI PESSTFIKQV QEAVDSGVTL VQLREKSLST LEFVKRAEEV HSITKKKGIP LIINDRIDVA LAIDAEGVHV GQDDMPAELA RKLLGPNKIL GVTCSNPEEV DVVVKQKVAD YVGLGTVYKT NTKKNVSDPE GTGPIGIRKM LRVLGDYNKE ADYKIKCVAI GGINHSNADK VLYQCQVTEQ KLDGVAVVSC IMASADARKA TLDLAKIIAS EPAFSITTRL SESNLDSFSK LQPLVHHITN NVVKNFSANV TLSIGASPIM SELAEEFDEL VGNAKHIALV LNLGTPSSDM MSVFKQAIQV YNKYGKHIIF DPVACGATKA RLECARILLN TGHVSVIKGN VGEIMSVWKL TTFYIAKSVG ESNTMRGVDS IFNLSEHILL ERARQVASEF KTVVVVTGKK NFTVSPTGIV RSCLGGSPLM GKITGTGCSL GSTIAAFLAA GADGELEQDS TCVFDAATIA VEMYNSAGKQ AAVSVSTPGS FSTKFLDHLS RGQDLIPTLF SRP
|
| |