Gene PICST_59866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_59866 
SymbolTHI80 
ID4839110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp825167 
End bp826168 
Gene Length1002 bp 
Protein Length333 aa 
Translation table12 
GC content45% 
IMG OID640390425 
ProductThiamine pyrophosphokinase (TPK) (Thiamine kinase) 
Protein accessionXP_001385167 
Protein GI150865803 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1564] Thiamine pyrophosphokinase 
TIGRFAM ID[TIGR01378] thiamine pyrophosphokinase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.324549 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAAGG CTAAGAAGCT CGAGTCGGAA GTGGTGGAAT CCCCGGACGA CATACAGATT 
CCAGCTCCGG ATCTTGCGCA TACTCTCATA CGACCCTTCG AGTTCCTTGT CAACTCCAGA
GACAATTGCC ACCGCAACGC ATTAGTCATT CTAAACCAAC TGCTAACAGG TATAGATGTG
CCGCGACTAT GGTCCAATAC AGAATTGCAT GTTTGTGCGG ATGGAGGCGC CAATCAATTG
TACGATTATT TCGAAGCAGA TACAAAATCA CATTCTCACC TAGCCACTGA ACAAACTAGT
GAACAAGCTA GCGATCTCAT TCGTCAGCAG TATATTCCAC AGTTTATTGT AGGAGACCTA
GACTCGCTAC GTGACGACGT TCGTGACTAC TACGAGAGAA AAGGTGCACG TATAATTCCA
CAGTATACCC AATATTCTAC AGACTTTTCC AAAGCTATAG CTACAGTAAG ATTGTACTAC
TATTCTGAAG CACTGAGACA GGTGTTGGTC AATGATTCTA TAGACACCAA CAACGGACTC
GCAGAGATAA TCGAAAAATA CGAAGCCGGT AGCAAGCAAG AGCAAACTGT CCGAATCTAT
ATCTTAAGCG GAATCGGAGG TCGTTTTGAC CAGACCATCC ATTCGATATC ACAGCTCTAC
ATACTCAACC AGTCCCACCC ATTTCTTCAG CATTTCTTCA TCACAACCAG TGATCTCATT
TTCCTTCTTA AGAAAGGCGT TAATTACGTA GCATATCCCA GTAAAACGAC ATTCCATCTG
GCCCAGGTTC CTACCTGTGG CTTGTTACCT TTGGGCAACT CGAAAGTCAT GATATCCAGT
CACGGATTGA AGTATGATGT CCGCAACTGG GAAAGCGAAA TGCTTGGAAA TGTCAGCTCC
AGTAATGGGA TCAGCGGTGT CGATGGTGTT GTTGTTGAAG TATCTGGCCC ACTTGTCATG
AACATCGAAA TAGAGCATGG TGTAGAAGAT CTGAAATTAT AG
 
Protein sequence
MSKAKKLESE VVESPDDIQI PAPDLAHTLI RPFEFLVNSR DNCHRNALVI LNQSLTGIDV 
PRLWSNTELH VCADGGANQL YDYFEADTKS HSHLATEQTS EQASDLIRQQ YIPQFIVGDL
DSLRDDVRDY YERKGARIIP QYTQYSTDFS KAIATVRLYY YSEASRQVLV NDSIDTNNGL
AEIIEKYEAG SKQEQTVRIY ILSGIGGRFD QTIHSISQLY ILNQSHPFLQ HFFITTSDLI
FLLKKGVNYV AYPSKTTFHS AQVPTCGLLP LGNSKVMISS HGLKYDVRNW ESEMLGNVSS
SNGISGVDGV VVEVSGPLVM NIEIEHGVED SKL