Gene Rleg2_0239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_0239 
SymbolhisD 
ID6978952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp244671 
End bp245969 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content65% 
IMG OID643394951 
Producthistidinol dehydrogenase 
Protein accessionYP_002279765 
Protein GI209547848 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0183727 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCAATCT GGCTGGATCA GGCATCGGAA GGTTTCGAGC AGCGTTTTGC CACCTTTCTG 
ACGACGAAGC GTGAAGTCTC CGAGGATGTG AACACCGTCG TTCGCGCCAT CATCGATGAT
GTCAGGGCCC GCGGCGATGT GGCGCTTGCC GAATATTCGT TGAAGTTCGA CGGCATCGAT
TTCGCCACCG TGCCGATGCG CGTCACGCCT GAGGAGTTCG ACGCCGCGAT CGAGGCCGTG
CCTCCGGAAG TGCTGGGTGC GCTGAAGCTC GCAGCACTGC GCATTGAATC CCATCATCGC
CGGCAGCTGC CGAAGGACGA TATTTATGAG GACGACATGG GCGTCGGCCT CGGCTCGCGC
TGGACGGCGA TCGAGGCGGT CGGGCTCTAT GTTCCGGGCG GCACCGCGAG TTATCCGAGC
TCGGTGCTGA TGAATGCCGT GCCGGCCAAG GTTGCAGGCG TCGATCGCAT CGTCATCGCC
GTTCCCGCCA CCGGCGGTGC CGTCAATCCG GCGGTGCTTG CCGCCGCCAG GCTTGTCGGT
GTGACCGAGG TTTATCGTGT CGGCGGCGCC CAGGCGATCG CGGCTCTTGC CTATGGCACC
GAGACGATCG CCCCGGTTGC CAAGATCACC GGTCCCGGCA ATGCCTATGT CGCAGCCGCC
AAGCGCCATG TCTTCGGCAC TGTCGGCATC GATATGATCG CCGGCCCTTC CGAAGTGCTC
GTCATCGCCG ACAAGGACAA CAATCCCGAT TGGATCGCCG CCGACCTGCT GGCGCAGGCC
GAGCACGATG CCAGCGCCCA GGCGATCCTG ATCACCGACG ATGCCGAATT CGGCAAGGCG
GTGGAGCAGG CGGTCGAGCG CCAGCTGAAG ACGCTGAACC GTGCCGAGAC CGCCGCGGCA
AGCTGGCGCG ATTTCGGCGC GGTCATTCTC GTTGCCGATC TGAAGCAGGC CATTCCGCTT
GCCAACCGCA TCGCCGCCGA GCATCTGGAG CTTGCCGTCG CCGATCCGGA TCGGCTGCTC
GACGGCATCC GCAATGCCGG CGCGATCTTC ATCGGCGCTC ATACGCCCGA GGTGATCGGC
GATTATGTCG GCGGGTCCAA CCACGTGCTG CCGACGGCGC GCTCGGCGCG TTTCTCCTCC
GGCCTTTCGG TGCTCGATTT CGTCAAGCGC ACCTCGATCC TGCGGCTCGG CCCGCAGCAA
TTGCGCACCC TCGGCCCGGC GGCGATTGCT CTAGCCGTCT CCGAAGGCCT CGATGCTCAT
GCGCGATCGG TTGCGATCCG CCTCAACCTC GAAAGGTGA
 
Protein sequence
MAIWLDQASE GFEQRFATFL TTKREVSEDV NTVVRAIIDD VRARGDVALA EYSLKFDGID 
FATVPMRVTP EEFDAAIEAV PPEVLGALKL AALRIESHHR RQLPKDDIYE DDMGVGLGSR
WTAIEAVGLY VPGGTASYPS SVLMNAVPAK VAGVDRIVIA VPATGGAVNP AVLAAARLVG
VTEVYRVGGA QAIAALAYGT ETIAPVAKIT GPGNAYVAAA KRHVFGTVGI DMIAGPSEVL
VIADKDNNPD WIAADLLAQA EHDASAQAIL ITDDAEFGKA VEQAVERQLK TLNRAETAAA
SWRDFGAVIL VADLKQAIPL ANRIAAEHLE LAVADPDRLL DGIRNAGAIF IGAHTPEVIG
DYVGGSNHVL PTARSARFSS GLSVLDFVKR TSILRLGPQQ LRTLGPAAIA LAVSEGLDAH
ARSVAIRLNL ER