Gene Tmz1t_0900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0900 
SymbolhisD 
ID7084758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp991829 
End bp993136 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content71% 
IMG OID643697923 
Producthistidinol dehydrogenase 
Protein accessionYP_002354563 
Protein GI217969329 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.69214 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAGA CCCCGATCCG CCGCCTGGCC GCGCGCGAAC CCGAATTCCT GTCCACCCTC 
GATGCCCTGC TCGCCTTCGA GGCCGAGGCC GACGGCCGCA TCGACGCGGC GGTCACCGAG
ATCCTGCAGG CGGTGCGCAC CACCGGCGAC GCCGCGGTGG TCGAATACAC CCGCCGCTTC
GACGGGCTCG ACGTGCAATC CATGGTCGCG CTCGAGCTGC CCAGGAGCGA GCTGCTGCTC
GCGCTCGACA GCCTGCGCCC CGAGCAGCGC GAGGCGCTCA CCATCGCCGC CGACCGCGTG
CGCGTCTATC ACGAGCGCCA GAAGGGCGAG TCCTGGGAAT TCACCGAGGC CGACGGCACC
CGCCTGGGCC AGAAGGTCAC CCCGCTCGAC CGCGTCGGCC TCTACGTGCC GGGCGGGCGC
GCCTCCTACC CGAGCTCGGT GCTGATGAAT GCGATCCCGG CCAAGGTCGC CGGTGTCGGC
GAACTGATCA TGGTCGTGCC CACCCCGCGT GGCGAGAAGA ATCCGCTGGT GCTGGCGGCG
GCGGCGATCA CCGGCGTCGA CCGCGTGTTC ACCATCGGCG GCGCGCAGGC GGTGGCGGCG
CTGGCCTACG GCACGCAGAC CATCCCGCAG GTGGACAAGA TCGTCGGCCC GGGCAATGCC
TACGTGGCCG AGGCCAAGCG CCGCGTGTTC GGCACCGTCG GCATCGACAT GGTCGCCGGC
CCGTCTGAAG TGCTGATCAT CTCCGATGGC TCCGGCCACG CCGACTGGGT GGCAATGGAC
CTCTTCGCCC AGGCCGAGCA CGACGAGCTC GCGCAGTCCA TCCTGCTGTG TACCGACGCC
GGCTTCATCG ACGCGGTGCA CGACGCGATC GACCGCCTGC TGCCCACCAT GCCGCGCCGC
GACACGATCG CCAGGTCGCT TGCCAACCGC GGCGCGCTGA TCCACGTCGA CAGCCTGGAG
CAGGCCTGCG CGCTCGCCAA CCGCATCGCG CCCGAGCACC TCGAGCTGGC GCTGGAAGAT
GCCGAGGACT GGATCGCCCA CATCCGTCAC GCCGGCGCGA TCTTCGTCGG CCACTGGGCG
GTGGAGGCGC TCGGCGACTA CTGCGCCGGC CCCAACCACG TGCTGCCGAC CATGCGCAGC
GCGCGTTTCT CCTCGCCGCT CGGCACCTAC GACTTCCAGA AGCGCACCAG CATCGTCCAC
ATCTCGCAGG CGGGCGCGCA GCACCTGGGC AAGGTGGCTT CCATCCTGGC CCATGGCGAG
GGTCTGCAGG CGCACGCCCG CTCGGCGGAG ATGCGGCTGA AGGTTTGA
 
Protein sequence
MSQTPIRRLA AREPEFLSTL DALLAFEAEA DGRIDAAVTE ILQAVRTTGD AAVVEYTRRF 
DGLDVQSMVA LELPRSELLL ALDSLRPEQR EALTIAADRV RVYHERQKGE SWEFTEADGT
RLGQKVTPLD RVGLYVPGGR ASYPSSVLMN AIPAKVAGVG ELIMVVPTPR GEKNPLVLAA
AAITGVDRVF TIGGAQAVAA LAYGTQTIPQ VDKIVGPGNA YVAEAKRRVF GTVGIDMVAG
PSEVLIISDG SGHADWVAMD LFAQAEHDEL AQSILLCTDA GFIDAVHDAI DRLLPTMPRR
DTIARSLANR GALIHVDSLE QACALANRIA PEHLELALED AEDWIAHIRH AGAIFVGHWA
VEALGDYCAG PNHVLPTMRS ARFSSPLGTY DFQKRTSIVH ISQAGAQHLG KVASILAHGE
GLQAHARSAE MRLKV