Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_19320 |
Symbol | thiH |
ID | 7312747 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | - |
Start bp | 2069224 |
End bp | 2070669 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 643612378 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_002509674 |
Protein GI | 220932766 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 0.604255 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCAGTT TATCATTTCC AGATCATCGC AGAGGGGAGT CGAGGAGTAA AGTCTGTGAA TACATTAATG GGGATAAAAT CGAGGCCATT CTGGAGGAGG CCAGTAATCC TTCAGAAGAG GAAGTTAACC ACATTATAAA AAAGTCCCTG GAGCTCAAGG GACTATCAGT AGAAGAAGCA GCTGTATTGC TCCAGGTAGA GGACCAGGAA TTAATCAATA AATTCCTGGA AGCAGCCAGA AAGGTGAAAG AAAAGATTTA CGGGAAAAGG CTTGTTCTTT TTGCTCCCCT GTATTTTTCG AATCTATGTA CTAATAGCTG TCTTTACTGT AGCTTCAGGC ATAATAACAA TAAAGTTAAA AGAAAAAAAC TCAGTATAGA AGAGATTAAG GAAGAAGTCA GAGCCCTGGA AAGAGAAGGA CATAAACGCC TGCTTGTTTT GACCGGGGAA ACCCCCGAAA CGGACCTTGA TTATGTGGTT GAAGGTATTA AAGCAGCTTA TGAAACCAGG ACTGAACATG GTGGTGAGAT CAGGAGGATA AATGTAGAGA TTGCCCCCCT GACCACAGAA GATTTCAAAA AACTCAAGGA AGCTAAAATT GGAACCTATA CCTGTTTTCA GGAGACATAT CACCGTCCTA CCTATAAGAA AATGCATCCA TCTGGCCCCA AAGCCGATTA TGACTGGCGG TTGTCAGTTA TGGACCGGGC CCAGCAGGCC GGTATTGATG ATATAGGTAT CGGGGCTCTC TTTGGACTGT ATGACTATAA ATTTGAGGTT ATTGCCCTGT TACTACATTC TGAATACCTG GATAAGACCT ATGGTGTTGG CCCCCATACA ATATCGGTTC CCCGTTTAAA CCCCGCCCTG GGGGCACCTG TCCAGGAGCC ACCATATCCT GTGTCTGATG AGGATTTCAG GAAACTTGTA GCAATCTTGA GGCTGGCTGT TCCCTATACA GGGATAATTT TATCAACCAG GGAGAGTATT GACATGCGTA ATGAATTGTT TTTGCACGGA GTTTCCCAGA TAAGTGCCGG ATCCCGGACT ACTCCAGGGG GGTATAGAGA GGCCCGGGAA CGGGAGCATG ATCTGGAGCA ATTTTCTCTC CATGACATCA GGCCCATGGA TGAAATTATA GCTGAAATTA GCAAACAGGG GTATATACCG AGTTTCTGTA CTGCCTGTTA CCGCCTGGGG CGGACCGGTC AGGATTTTAT GGACCTTGCT AAACCGGGTA AGATTCAGGA ATTTTGTAAA CCCAATGCCA TGTTTACCTT TAAAGAATAC CTGGTTGACT ACGCCAGTCC TGAAACACGC AAACTGGGTG AGGAGTGTTT ACAGGCTCAT TTAAAAGAAA TTAAAGATCT AAATCCCATG CTGGCTCAAA AAGTGAAGAC AAATCTTACT AAAATAGAAA ATGGTGAACA TGACCTGTAC TTTTAA
|
Protein sequence | MGSLSFPDHR RGESRSKVCE YINGDKIEAI LEEASNPSEE EVNHIIKKSL ELKGLSVEEA AVLLQVEDQE LINKFLEAAR KVKEKIYGKR LVLFAPLYFS NLCTNSCLYC SFRHNNNKVK RKKLSIEEIK EEVRALEREG HKRLLVLTGE TPETDLDYVV EGIKAAYETR TEHGGEIRRI NVEIAPLTTE DFKKLKEAKI GTYTCFQETY HRPTYKKMHP SGPKADYDWR LSVMDRAQQA GIDDIGIGAL FGLYDYKFEV IALLLHSEYL DKTYGVGPHT ISVPRLNPAL GAPVQEPPYP VSDEDFRKLV AILRLAVPYT GIILSTRESI DMRNELFLHG VSQISAGSRT TPGGYREARE REHDLEQFSL HDIRPMDEII AEISKQGYIP SFCTACYRLG RTGQDFMDLA KPGKIQEFCK PNAMFTFKEY LVDYASPETR KLGEECLQAH LKEIKDLNPM LAQKVKTNLT KIENGEHDLY F
|
| |