Gene TRQ2_1767 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTRQ2_1767 
SymbolhisD 
ID6093218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga sp. RQ2 
KingdomBacteria 
Replicon accessionNC_010483 
Strand
Start bp1783807 
End bp1785093 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content51% 
IMG OID642488964 
Producthistidinol dehydrogenase 
Protein accessionYP_001739781 
Protein GI170289543 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTCTAA TGAACAATCC CGGGGATAAA GAAGTGCTCA GACTATTGAA ACAAAGGATG 
GAATCTGTAT CGCAGGTGGA AGAGACCGTA AAAGAAATCA TCAGGAGAGT GAAAGAAGAA
GGAGACAGAG CCCTCGAGGA ATTCCTCAAA AGGTTCGAGA AGCACCCGGT GGGCATCGAG
AACCTTCGTG TCACAGAGAA GGAAATATCG GAAGCCCAAG TAGAGGAAGA ATTCGTTGAA
ACGATAAAAA TCGTGATAGA AGACCTGAAG GAATTCCATC GAAGACAGGA GGAAAGATCT
TTCTTTTTCA CGACGAAGGG TGGAAGCTTC CTGGGAGAGA TGGTTGTTCC TCTCGAGAGT
GTGGGAATCT ACGTTCCTGG GGGAAAGGTT CCGTACTTTT CCACGCTCCT CATGTGCGCG
GTTCCCGCTA TCGTTGCTGG TGTGGAAAGG ATCGCTGTCA CAACACCACC GAATGAAAAC
GGAGGCATCT CTCCATACAT CCTGAAAACC TGCGAAATCC TCGGTCTGAA GGAGATCTAC
CGGATGGGAG GGGCTCACGC GGTAGCCGCC CTCACATACG GCACGGAAAC AGTGAAACCC
GTTGACAAGA TCGTGGGACC CGGTGGAGTC TTCGTCACGC TCGCAAAGAA ACACGTTTAC
GGAGACGTTG GAATCGATTC CATAGCGGGT CCCAGCGAGA TTGCGATCGT GACAGACGGC
AGTGCCGACC TGGATCTCAT AGCCGCGGAT TTCCTCTCCC AGGCAGAGCA CGACGAGAAC
GCGATGAGTG TGGTGATAAC CACTTCGAAA GAAGTCTTCG AAAAATTACC TCAGGTCATT
GAAAGACACC TGGAAGCTCT TCCAGAAGAG AGAAGAAAAA CGGCCAGGAT TTCAACGGAA
AATTTCGGTA CCATCATCTT GACGGACAGT CTGAAAAGGG CCTTTGAGAT CTCCAACCTC
ATCGCCCCCG AACATCTGGA GGTCCTCGTG GAAAACCCGT TTGAGCCACT GGGACACATA
AAGAACGCGG GATCTGTCTT TCTCGGAAAG TACACCTGTG AGTCTGTGGG AGACTACGGT
GCGGGACCGA ACCACGTTCT TCCCACCTTC AGATCCGCGA GGTTCTCCTC AGGACTCAGG
GTTTCCGATT TCACGAAGAA GATATTCATC ACACACCTCT CCGAAGAAGA TTTCAGAAGA
AAGAGCGAGC TTTACTCGAA GATGGCGCGC TGGGAAGGTT TTGAAGCCCA CGCTCGGGCG
ATAGACGTCA GGAGGGAAAA GCTGTGA
 
Protein sequence
MILMNNPGDK EVLRLLKQRM ESVSQVEETV KEIIRRVKEE GDRALEEFLK RFEKHPVGIE 
NLRVTEKEIS EAQVEEEFVE TIKIVIEDLK EFHRRQEERS FFFTTKGGSF LGEMVVPLES
VGIYVPGGKV PYFSTLLMCA VPAIVAGVER IAVTTPPNEN GGISPYILKT CEILGLKEIY
RMGGAHAVAA LTYGTETVKP VDKIVGPGGV FVTLAKKHVY GDVGIDSIAG PSEIAIVTDG
SADLDLIAAD FLSQAEHDEN AMSVVITTSK EVFEKLPQVI ERHLEALPEE RRKTARISTE
NFGTIILTDS LKRAFEISNL IAPEHLEVLV ENPFEPLGHI KNAGSVFLGK YTCESVGDYG
AGPNHVLPTF RSARFSSGLR VSDFTKKIFI THLSEEDFRR KSELYSKMAR WEGFEAHARA
IDVRREKL