Gene PICST_70077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_70077 
SymbolTHI20 
ID4837639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp427321 
End bp429280 
Gene Length1960 bp 
Protein Length583 aa 
Translation table12 
GC content42% 
IMG OID640388954 
ProductPhosphomethylpyrimidine kinase THI21 (HMP-phosphate kinase) (HMP-P kinase) 
Protein accessionXP_001382844 
Protein GI126132638 
COG category[H] Coenzyme transport and metabolism
[K] Transcription 
COG ID[COG0351] Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase
[COG0819] Putative transcription activator 
TIGRFAM ID[TIGR00097] phosphomethylpyrimidine kinase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.583202 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CTCTTTTCCA TTGTTCTTAC CATTCAATAC AGGCATCGTG TTGAAAATGA CAACATTTTC 
TGTGGTGAAA TTGAGAACTC CCACTGTCAA GTCCAAACCA GTCCTACCAG CTGTATTAAC
TATAGCAGGA TCCGACAACT CAGGTGGAGC CGGCATTGAG GCCGACTTGA AGACATTCAG
TGCCCACAAG GTTTACGGGT TGACTTGTAT TGCTGCTTTG ACGGCCCAAA ATACACAATT
GGTGAAAACA TTTGAAAAAA CTCCAAAAGA ATTGGTTAAA AACATCTTGC AGCTCAACTT
TGACGATTTC CTCTACGGAT ATGAAGACAG TACACAGCCA TTGAAGGTCA TCAAGACTGG
GATGCTCACA GAAGAAGCTG TTCATGTTAT CCAGGACTTT CTACCAGACA TCAAGAAACA
CAATGTGAAA CTAATAGTCG ACCCAGTGAT GATCAGTACT TCTGGATCCA GTCTTTTTGA
TAGCGAAGGC ATGAAGCTTT GTGTGAATAC CTTGATCAGT GGAGCCTATT TGATCACACC
TAATTTTGTA GAAGCCAGAG CACTTTGGGA GATTGCTTGT GGAGAAAGTG CAGCTATCGA
GAAGCTCACT ATCAACTCTT TGGATGACTT TATAGACTTT GTCAAGCAAT TGCAGAAGAC
CCTCAAGTCA CAGAATGTTT TAGTAAAGGG TGGACATATT CCTTGGGATT CCCGAACGGG
CAAGCCATTT GTTGGAACCA ATCTTGCTGA TGTTGAAGAC AGTATTGTTG TTTTGGATGT
GTTATATGAA TCTGAAATTG ATAAGGCGAC TGTGTTTGAG TCCAAGTACA TAAATACCAA
GGATTCACAT GGAACCGGTT GCACTCTTGC TTCAGCCATA TCTGCAAATG TAGCCAAGGG
AAAGAACTTG AAAGAAAGTA TCTCTTTATC CATAGACTAT ATCCACAAGG GAATGTTGAG
TGTAGGCAAG AAATTGGGAT ATGGAAATGG ACCCTTGAAC CACAATGTGG AACCCGAAGA
AAATCTAAGC AATGTCATCA TTGGAAACAG CACAGACACA TACATGAGCG TTAAGAAAGG
GAACCAGTCT TTCTTTGAGT ACTTCAAGAC CCATCCTAGT GTCAAGGAAA GCTGGAAGCT
CTACACTGAG CATAGATTTA TCCAGCAATT GGCTCAGGAC AAGTTGCCAT TCCAGCGCTT
CCTTTACTTC TTGAAGCAAG ATTACTACTA CTTGATCAAC TATGCACAGA TGCACGGGTT
AGCGGCTTCA GTCGCACCAA CATACCATCA AACCCATGCC GAAGCACTTA TCATAGGAGA
AGTCATCACT GAAATTGAGA AGCACAAGGA AAAGCTTTCC AAGAAATACG ACATTGATTA
TGAAAGAGAT ATTGATTTCG ATATCGAGTT GCAACCTGGA AAGGCATGCG TGGACTACTG
CAACTATCTC TTGGAAATTG GAAATAGGGA GAATTTCTTG GGTATCAAGG TAGCTTTGGC
TCCTTGTTTG CATGGATACG CTGAGGCTGG GTTGTATGGT AAGAGCATCA GAGAGAGCTA
TGACAAGAGT ACCTCCAGCT TGGATAAGGT ACTTTCTGAA ACCTACGACA CGTGGTTAGG
AGACTATAGT TCCGAATGGT ATTTGAACGC TCATAAAGAA GGAGAGGCTA CGCTTCAGGA
GTTGATGGAA TCGAACGACG TTTCAAATGA GAGAATGGAC GAACTTGTTG AGATTTTCAG
GAAGGTGACA GAGTTGGAAG TGCACTTTTG GGACGAGGTC TTGGATGTAT TACCATAATT
GTTGGTGAAC ATCATGAGTT GTAAGTTCTG ATGCAATAAC ATAGACTATG AAGTGTTCAC
GAGTCTCTCT AAGAAAGTAT TTCGATAGAG AAGATTGAAG TGCAAAGAAA ATAGAAAATG
TATAGAACTG TTTAACAAAT TGAATATAGA AAAAGCGTGG
 
Protein sequence
MTTFSVVKLR TPTVKSKPVL PAVLTIAGSD NSGGAGIEAD LKTFSAHKVY GLTCIAALTA 
QNTQLVKTFE KTPKELVKNI LQLNFDDFLY GYEDSTQPLK VIKTGMLTEE AVHVIQDFLP
DIKKHNVKLI VDPVMISTSG SSLFDSEGMK LCVNTLISGA YLITPNFVEA RALWEIACGE
SAAIEKLTIN SLDDFIDFVK QLQKTLKSQN VLVKGGHIPW DSRTGKPFVG TNLADVEDSI
VVLDVLYESE IDKATVFESK YINTKDSHGT GCTLASAISA NVAKGKNLKE SISLSIDYIH
KGMLSVGKKL GYGNGPLNHN VEPEENLSNV IIGNSTDTYM SVKKGNQSFF EYFKTHPSVK
ESWKLYTEHR FIQQLAQDKL PFQRFLYFLK QDYYYLINYA QMHGLAASVA PTYHQTHAEA
LIIGEVITEI EKHKEKLSKK YDIDYERDID FDIELQPGKA CVDYCNYLLE IGNRENFLGI
KVALAPCLHG YAEAGLYGKS IRESYDKSTS SLDKVLSETY DTWLGDYSSE WYLNAHKEGE
ATLQELMESN DVSNERMDEL VEIFRKVTEL EVHFWDEVLD VLP