Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Teth514_0530 |
Symbol | thiH |
ID | 5876580 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoanaerobacter sp. X514 |
Kingdom | Bacteria |
Replicon accession | NC_010320 |
Strand | + |
Start bp | 547787 |
End bp | 549187 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 641540866 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_001662174 |
Protein GI | 167039189 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAAAAG AAAAAGCAGA TTTTATTGAT GATGAAAAGA TAAGACAGGA TTTAGAAAAG GCTAAAAAAG CGACAATTAA ATATGCATTA GAAATTATAG AGAAAGCTAA AAAGCTAAAA GGCATTACTC CTGAAGAAGC GGCGGTACTT TTAAATGTAG AAGATGAGGA TTTGCTTAAT GAGATGTTTA AAGTAGCAAG GTATATAAAA GAAGAAATAT ATGGAAATAG AATTGTGATA TTTGCCCCCC TTTATGTAAG TAACTACTGT GTAAACAACT GTAGATATTG TGGTTACAGA CATTCTAATG AGCAGGAAAG AAAAAAGCTT ACAATGGAAG AAGTGAGAAG AGAAGTTGAG ATTTTGGAAG AGATGGGACA TAAGAGATTA GCAGTTGAAG CTGGAGAAGA CCCTGTAAAT TGCCCTATAG ATTATATTAT CGATGTAATA AAGACGATAT ACGATACAAA ACTTAAAAAC GGAAGCATAA GAAGGGTAAA TGTCAATATA GCAGCGACTA CTGTGGAAAA TTACAAAAAA CTTAAAGAAG TAGGAATAGG GACTTATATT TTATTCCAAG AAACTTACCA TAGACCTACG TATGAATACA TGCATCCACA AGGTCCAAAA CACGATTACG ACTACCATTT GACTGCTATG GATAGGGCTA TGGAGGCAGG CATTGACGAC GTAGGATTAG GGGTTTTGTA TGGGCTTTAT GATTACAAAT ACGAAACTGT TGCGATGCTT TATCATGCGA ACCATTTAGA GGAGAAATTT GGAGTTGGGC CACATACTAT TTCAGTACCG CGACTTAGAC CAGCTCTTAA CACTCCCATA GATAAATTCC CATATATTGT ATCAGATAAA GACTTTAAAA AATTAGTAGC CGTCATAAGA ATGGCAGTGC CCTATACAGG GATGATTTTG TCTACAAGAG AGAAGCCTAA ATTTAGAGAA GAAGTAATAA GCATTGGCAT TTCTCAGATT AGTGCAGGTT CTTGTACAGG AGTAGGTGGA TATCATGAAG AGATATCCAA AAAAGGTGGT TCAAAGCCAC AATTTGAGGT AGAAGACAAA AGAAGTCCTA ATGAAATTTT GAGGACTTTG TGTGAACAAG GGTATCTCCC AAGTTATTGT ACTGCCTGCT ACAGAATGGG ACGTACAGGA GACAGGTTTA TGACCTTTGC GAAATCAGGG CAAATACACA ACTTCTGTCT ACCTAATGCG ATACTAACCT TCAAAGAGTT TTTGATTGAT TACGGGGATG AAAAAACTAA GGAAATTGGA GAAAAAGCTA TAGCGGTAAA TTTAGAGAAA ATTCCATCAA TAACTGTAAG GGAAGAGACA AAGAGAAGGC TTACAAGAAT AGAAAATGGA GAAAGAGATC TTTTCTTTTA A
|
Protein sequence | MIKEKADFID DEKIRQDLEK AKKATIKYAL EIIEKAKKLK GITPEEAAVL LNVEDEDLLN EMFKVARYIK EEIYGNRIVI FAPLYVSNYC VNNCRYCGYR HSNEQERKKL TMEEVRREVE ILEEMGHKRL AVEAGEDPVN CPIDYIIDVI KTIYDTKLKN GSIRRVNVNI AATTVENYKK LKEVGIGTYI LFQETYHRPT YEYMHPQGPK HDYDYHLTAM DRAMEAGIDD VGLGVLYGLY DYKYETVAML YHANHLEEKF GVGPHTISVP RLRPALNTPI DKFPYIVSDK DFKKLVAVIR MAVPYTGMIL STREKPKFRE EVISIGISQI SAGSCTGVGG YHEEISKKGG SKPQFEVEDK RSPNEILRTL CEQGYLPSYC TACYRMGRTG DRFMTFAKSG QIHNFCLPNA ILTFKEFLID YGDEKTKEIG EKAIAVNLEK IPSITVREET KRRLTRIENG ERDLFF
|
| |