Gene Hore_19320 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_19320 
SymbolthiH 
ID7312747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2069224 
End bp2070669 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content44% 
IMG OID643612378 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_002509674 
Protein GI220932766 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value0.604255 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCAGTT TATCATTTCC AGATCATCGC AGAGGGGAGT CGAGGAGTAA AGTCTGTGAA 
TACATTAATG GGGATAAAAT CGAGGCCATT CTGGAGGAGG CCAGTAATCC TTCAGAAGAG
GAAGTTAACC ACATTATAAA AAAGTCCCTG GAGCTCAAGG GACTATCAGT AGAAGAAGCA
GCTGTATTGC TCCAGGTAGA GGACCAGGAA TTAATCAATA AATTCCTGGA AGCAGCCAGA
AAGGTGAAAG AAAAGATTTA CGGGAAAAGG CTTGTTCTTT TTGCTCCCCT GTATTTTTCG
AATCTATGTA CTAATAGCTG TCTTTACTGT AGCTTCAGGC ATAATAACAA TAAAGTTAAA
AGAAAAAAAC TCAGTATAGA AGAGATTAAG GAAGAAGTCA GAGCCCTGGA AAGAGAAGGA
CATAAACGCC TGCTTGTTTT GACCGGGGAA ACCCCCGAAA CGGACCTTGA TTATGTGGTT
GAAGGTATTA AAGCAGCTTA TGAAACCAGG ACTGAACATG GTGGTGAGAT CAGGAGGATA
AATGTAGAGA TTGCCCCCCT GACCACAGAA GATTTCAAAA AACTCAAGGA AGCTAAAATT
GGAACCTATA CCTGTTTTCA GGAGACATAT CACCGTCCTA CCTATAAGAA AATGCATCCA
TCTGGCCCCA AAGCCGATTA TGACTGGCGG TTGTCAGTTA TGGACCGGGC CCAGCAGGCC
GGTATTGATG ATATAGGTAT CGGGGCTCTC TTTGGACTGT ATGACTATAA ATTTGAGGTT
ATTGCCCTGT TACTACATTC TGAATACCTG GATAAGACCT ATGGTGTTGG CCCCCATACA
ATATCGGTTC CCCGTTTAAA CCCCGCCCTG GGGGCACCTG TCCAGGAGCC ACCATATCCT
GTGTCTGATG AGGATTTCAG GAAACTTGTA GCAATCTTGA GGCTGGCTGT TCCCTATACA
GGGATAATTT TATCAACCAG GGAGAGTATT GACATGCGTA ATGAATTGTT TTTGCACGGA
GTTTCCCAGA TAAGTGCCGG ATCCCGGACT ACTCCAGGGG GGTATAGAGA GGCCCGGGAA
CGGGAGCATG ATCTGGAGCA ATTTTCTCTC CATGACATCA GGCCCATGGA TGAAATTATA
GCTGAAATTA GCAAACAGGG GTATATACCG AGTTTCTGTA CTGCCTGTTA CCGCCTGGGG
CGGACCGGTC AGGATTTTAT GGACCTTGCT AAACCGGGTA AGATTCAGGA ATTTTGTAAA
CCCAATGCCA TGTTTACCTT TAAAGAATAC CTGGTTGACT ACGCCAGTCC TGAAACACGC
AAACTGGGTG AGGAGTGTTT ACAGGCTCAT TTAAAAGAAA TTAAAGATCT AAATCCCATG
CTGGCTCAAA AAGTGAAGAC AAATCTTACT AAAATAGAAA ATGGTGAACA TGACCTGTAC
TTTTAA
 
Protein sequence
MGSLSFPDHR RGESRSKVCE YINGDKIEAI LEEASNPSEE EVNHIIKKSL ELKGLSVEEA 
AVLLQVEDQE LINKFLEAAR KVKEKIYGKR LVLFAPLYFS NLCTNSCLYC SFRHNNNKVK
RKKLSIEEIK EEVRALEREG HKRLLVLTGE TPETDLDYVV EGIKAAYETR TEHGGEIRRI
NVEIAPLTTE DFKKLKEAKI GTYTCFQETY HRPTYKKMHP SGPKADYDWR LSVMDRAQQA
GIDDIGIGAL FGLYDYKFEV IALLLHSEYL DKTYGVGPHT ISVPRLNPAL GAPVQEPPYP
VSDEDFRKLV AILRLAVPYT GIILSTRESI DMRNELFLHG VSQISAGSRT TPGGYREARE
REHDLEQFSL HDIRPMDEII AEISKQGYIP SFCTACYRLG RTGQDFMDLA KPGKIQEFCK
PNAMFTFKEY LVDYASPETR KLGEECLQAH LKEIKDLNPM LAQKVKTNLT KIENGEHDLY
F