Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_59866 |
Symbol | THI80 |
ID | 4839110 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | - |
Start bp | 825167 |
End bp | 826168 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640390425 |
Product | Thiamine pyrophosphokinase (TPK) (Thiamine kinase) |
Protein accession | XP_001385167 |
Protein GI | 150865803 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1564] Thiamine pyrophosphokinase |
TIGRFAM ID | [TIGR01378] thiamine pyrophosphokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.324549 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAAGG CTAAGAAGCT CGAGTCGGAA GTGGTGGAAT CCCCGGACGA CATACAGATT CCAGCTCCGG ATCTTGCGCA TACTCTCATA CGACCCTTCG AGTTCCTTGT CAACTCCAGA GACAATTGCC ACCGCAACGC ATTAGTCATT CTAAACCAAC TGCTAACAGG TATAGATGTG CCGCGACTAT GGTCCAATAC AGAATTGCAT GTTTGTGCGG ATGGAGGCGC CAATCAATTG TACGATTATT TCGAAGCAGA TACAAAATCA CATTCTCACC TAGCCACTGA ACAAACTAGT GAACAAGCTA GCGATCTCAT TCGTCAGCAG TATATTCCAC AGTTTATTGT AGGAGACCTA GACTCGCTAC GTGACGACGT TCGTGACTAC TACGAGAGAA AAGGTGCACG TATAATTCCA CAGTATACCC AATATTCTAC AGACTTTTCC AAAGCTATAG CTACAGTAAG ATTGTACTAC TATTCTGAAG CACTGAGACA GGTGTTGGTC AATGATTCTA TAGACACCAA CAACGGACTC GCAGAGATAA TCGAAAAATA CGAAGCCGGT AGCAAGCAAG AGCAAACTGT CCGAATCTAT ATCTTAAGCG GAATCGGAGG TCGTTTTGAC CAGACCATCC ATTCGATATC ACAGCTCTAC ATACTCAACC AGTCCCACCC ATTTCTTCAG CATTTCTTCA TCACAACCAG TGATCTCATT TTCCTTCTTA AGAAAGGCGT TAATTACGTA GCATATCCCA GTAAAACGAC ATTCCATCTG GCCCAGGTTC CTACCTGTGG CTTGTTACCT TTGGGCAACT CGAAAGTCAT GATATCCAGT CACGGATTGA AGTATGATGT CCGCAACTGG GAAAGCGAAA TGCTTGGAAA TGTCAGCTCC AGTAATGGGA TCAGCGGTGT CGATGGTGTT GTTGTTGAAG TATCTGGCCC ACTTGTCATG AACATCGAAA TAGAGCATGG TGTAGAAGAT CTGAAATTAT AG
|
Protein sequence | MSKAKKLESE VVESPDDIQI PAPDLAHTLI RPFEFLVNSR DNCHRNALVI LNQSLTGIDV PRLWSNTELH VCADGGANQL YDYFEADTKS HSHLATEQTS EQASDLIRQQ YIPQFIVGDL DSLRDDVRDY YERKGARIIP QYTQYSTDFS KAIATVRLYY YSEASRQVLV NDSIDTNNGL AEIIEKYEAG SKQEQTVRIY ILSGIGGRFD QTIHSISQLY ILNQSHPFLQ HFFITTSDLI FLLKKGVNYV AYPSKTTFHS AQVPTCGLLP LGNSKVMISS HGLKYDVRNW ESEMLGNVSS SNGISGVDGV VVEVSGPLVM NIEIEHGVED SKL
|
| |