Gene PICST_77822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_77822 
Symbol 
ID4838878 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1507579 
End bp1509304 
Gene Length1726 bp 
Protein Length533 aa 
Translation table12 
GC content43% 
IMG OID640390193 
Productpredicted protein 
Protein accessionXP_001384244 
Protein GI150865144 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0352] Thiamine monophosphate synthase
[COG2145] Hydroxyethylthiazole kinase, sugar kinase family 
TIGRFAM ID[TIGR00693] thiamine-phosphate pyrophosphorylase
[TIGR00694] hydroxyethylthiazole kinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00729956 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.350827 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ACTACTTGTA CAATGGCTAC TGACGGTCAA GGAAAGAGCG CAAGAGCTCC TAATGATAAT 
GTGGACTACT CCTTATATTT GGTAACAGAC TCTACCATGA TTCCAGAATC ATCTACCTTC
ATCAAGCAGG TCCAGGAAGC CGTAGATAGC GGTGTTACTT TGGTTCAATT GCGTGAAAAG
TCGCTTTCGA CTCTTGAATT CGTTAAAAGA GCCGAAGAAG TTCACAGTAT AACCAAAAAG
AAGGGAATTC CTTTAATTAT CAATGATAGA ATTGATGTCG CACTCGCTAT AGATGCTGAA
GGTGTTCATG TTGGTCAGGA TGACATGCCT GCTGAATTGG CTAGAAAGTT GTTGGGCCCA
AATAAGATTT TGGGGGTCAC CTGCTCAAAT CCCGAAGAGG TCGATGTTGT AGTCAAACAG
AAAGTAGCCG ACTACGTTGG CTTGGGGACC GTGTACAAGA CTAACACCAA GAAGAATGTC
AGTGATCCCG AAGGCACGGG ACCCATCGGA ATCAGAAAAA TGCTTCGTGT TTTAGGAGAC
TACAACAAGG AAGCTGACTA CAAGATCAAG TGTGTGGCTA TCGGCGGAAT TAATCATTCC
AATGCCGATA AAGTGTTGTA CCAATGTCAA GTGACTGAGC AGAAGCTCGA TGGTGTGGCC
GTAGTTTCGT GTATAATGGC TTCAGCAGAT GCTAGGAAGG CTACTTTAGA CTTAGCAAAA
ATTATTGCTT CTGAGCCTGC CTTTTCCATT ACTACTCGTT TATCTGAATC TAACCTTGAT
AGTTTTCTGA AGCGTCATGG CATTAAGACT TTGGGCCAAG TTCAACCATT GGTTCACCAC
ATTACCAATA ATGTAGTGAA GAACTTCAGT GCCAATGTGA CCCTATCTAT TGGGGCGTCT
CCTATCATGT CTGAACTAGC TGAAGAATTC GACGAGCTTG TTGGCAATGC CAAACATATA
GCTTTGGTTT TAAATCTAGG AACACCTTCT CTGGACATGA TGTCAGTCTT TAAACAAGCA
ATTCAAGTAT ACAACAAATA TGGCAAACAT ATTATATTTG ATCCCGTCGC CTGTGGGGCT
ACCAAAGCTC GTCTAGAGTG TGCCAGAATC CTCTTGAACA CTGGCCATGT CTCTGTTATC
AAAGGAAATG TTGGTGAAAT CATGTCTGTA TGGAAGTTGA CTACTTTCTA CATTGCCAAA
TCTGTTGGAG AATCCAACAC CATGAGGGGA GTTGACTCCA TTTTCAATCT TCTGGAACAC
ATTTTGCTCG AAAGAGCTCG TCAAGTAGCC CTGGAGTTCA AGACAGTTGT TGTAGTCACC
GGTAAAAAGA ACTTCACGGT TTCTCCAACT GGTATAGTAA GATCATGTCT TGGAGGAAGT
CCATTGATGG GCAAGATAAC AGGTACTGGG TGTTCTTTAG GCAGCACTAT TGCTGCATTC
TTGGCAGCTG GAGCTGACGG TGAATTAGAA CAAGACTCGA CGTGTGTATT TGATGCTGCC
ACTATTGCAG TAGAGATGTA TAACAGTGCA GGTAAGCAAG CGGCAGTCAG TGTCAGTACT
CCAGGATCTT TCAGTACTAA ATTTCTTGAC CATCTAAGCA GGGGACAGGA TCTTATCCCA
ACTTTGTTCT CTAGACCCTA AATGCACAAT TGCATCTATG ACTCCATACA TATACTATTG
TATTGCATAT AGATAAGAAT TTCGTTAATT ATACATCGAT GTCAGA
 
Protein sequence
MATDGQGKSA RAPNDNVDYS LYLVTDSTMI PESSTFIKQV QEAVDSGVTL VQLREKSLST 
LEFVKRAEEV HSITKKKGIP LIINDRIDVA LAIDAEGVHV GQDDMPAELA RKLLGPNKIL
GVTCSNPEEV DVVVKQKVAD YVGLGTVYKT NTKKNVSDPE GTGPIGIRKM LRVLGDYNKE
ADYKIKCVAI GGINHSNADK VLYQCQVTEQ KLDGVAVVSC IMASADARKA TLDLAKIIAS
EPAFSITTRL SESNLDSFSK LQPLVHHITN NVVKNFSANV TLSIGASPIM SELAEEFDEL
VGNAKHIALV LNLGTPSSDM MSVFKQAIQV YNKYGKHIIF DPVACGATKA RLECARILLN
TGHVSVIKGN VGEIMSVWKL TTFYIAKSVG ESNTMRGVDS IFNLSEHILL ERARQVASEF
KTVVVVTGKK NFTVSPTGIV RSCLGGSPLM GKITGTGCSL GSTIAAFLAA GADGELEQDS
TCVFDAATIA VEMYNSAGKQ AAVSVSTPGS FSTKFLDHLS RGQDLIPTLF SRP