Gene Teth514_0530 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTeth514_0530 
SymbolthiH 
ID5876580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoanaerobacter sp. X514 
KingdomBacteria 
Replicon accessionNC_010320 
Strand
Start bp547787 
End bp549187 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content36% 
IMG OID641540866 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_001662174 
Protein GI167039189 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAAAG AAAAAGCAGA TTTTATTGAT GATGAAAAGA TAAGACAGGA TTTAGAAAAG 
GCTAAAAAAG CGACAATTAA ATATGCATTA GAAATTATAG AGAAAGCTAA AAAGCTAAAA
GGCATTACTC CTGAAGAAGC GGCGGTACTT TTAAATGTAG AAGATGAGGA TTTGCTTAAT
GAGATGTTTA AAGTAGCAAG GTATATAAAA GAAGAAATAT ATGGAAATAG AATTGTGATA
TTTGCCCCCC TTTATGTAAG TAACTACTGT GTAAACAACT GTAGATATTG TGGTTACAGA
CATTCTAATG AGCAGGAAAG AAAAAAGCTT ACAATGGAAG AAGTGAGAAG AGAAGTTGAG
ATTTTGGAAG AGATGGGACA TAAGAGATTA GCAGTTGAAG CTGGAGAAGA CCCTGTAAAT
TGCCCTATAG ATTATATTAT CGATGTAATA AAGACGATAT ACGATACAAA ACTTAAAAAC
GGAAGCATAA GAAGGGTAAA TGTCAATATA GCAGCGACTA CTGTGGAAAA TTACAAAAAA
CTTAAAGAAG TAGGAATAGG GACTTATATT TTATTCCAAG AAACTTACCA TAGACCTACG
TATGAATACA TGCATCCACA AGGTCCAAAA CACGATTACG ACTACCATTT GACTGCTATG
GATAGGGCTA TGGAGGCAGG CATTGACGAC GTAGGATTAG GGGTTTTGTA TGGGCTTTAT
GATTACAAAT ACGAAACTGT TGCGATGCTT TATCATGCGA ACCATTTAGA GGAGAAATTT
GGAGTTGGGC CACATACTAT TTCAGTACCG CGACTTAGAC CAGCTCTTAA CACTCCCATA
GATAAATTCC CATATATTGT ATCAGATAAA GACTTTAAAA AATTAGTAGC CGTCATAAGA
ATGGCAGTGC CCTATACAGG GATGATTTTG TCTACAAGAG AGAAGCCTAA ATTTAGAGAA
GAAGTAATAA GCATTGGCAT TTCTCAGATT AGTGCAGGTT CTTGTACAGG AGTAGGTGGA
TATCATGAAG AGATATCCAA AAAAGGTGGT TCAAAGCCAC AATTTGAGGT AGAAGACAAA
AGAAGTCCTA ATGAAATTTT GAGGACTTTG TGTGAACAAG GGTATCTCCC AAGTTATTGT
ACTGCCTGCT ACAGAATGGG ACGTACAGGA GACAGGTTTA TGACCTTTGC GAAATCAGGG
CAAATACACA ACTTCTGTCT ACCTAATGCG ATACTAACCT TCAAAGAGTT TTTGATTGAT
TACGGGGATG AAAAAACTAA GGAAATTGGA GAAAAAGCTA TAGCGGTAAA TTTAGAGAAA
ATTCCATCAA TAACTGTAAG GGAAGAGACA AAGAGAAGGC TTACAAGAAT AGAAAATGGA
GAAAGAGATC TTTTCTTTTA A
 
Protein sequence
MIKEKADFID DEKIRQDLEK AKKATIKYAL EIIEKAKKLK GITPEEAAVL LNVEDEDLLN 
EMFKVARYIK EEIYGNRIVI FAPLYVSNYC VNNCRYCGYR HSNEQERKKL TMEEVRREVE
ILEEMGHKRL AVEAGEDPVN CPIDYIIDVI KTIYDTKLKN GSIRRVNVNI AATTVENYKK
LKEVGIGTYI LFQETYHRPT YEYMHPQGPK HDYDYHLTAM DRAMEAGIDD VGLGVLYGLY
DYKYETVAML YHANHLEEKF GVGPHTISVP RLRPALNTPI DKFPYIVSDK DFKKLVAVIR
MAVPYTGMIL STREKPKFRE EVISIGISQI SAGSCTGVGG YHEEISKKGG SKPQFEVEDK
RSPNEILRTL CEQGYLPSYC TACYRMGRTG DRFMTFAKSG QIHNFCLPNA ILTFKEFLID
YGDEKTKEIG EKAIAVNLEK IPSITVREET KRRLTRIENG ERDLFF