Gene Rleg2_5007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5007 
SymbolhisD 
ID6978101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp654177 
End bp655523 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content60% 
IMG OID643394153 
Producthistidinol dehydrogenase 
Protein accessionYP_002278971 
Protein GI209547053 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGACA CTCTCGTTTC GATACACACG CTCAGCGCCT TGGACGACGC CCAGCGCAAA 
GCCCTATTAC AACGAACCGA AAGCGATCTG AGCGCTTTCA TCGATAAGGT AAATCCGATC
ATTGAAGCCG TCCGCATTGA TGGCGACGCA GCCCTTGCGG ATTTCGCCCG CCAATTTGAC
AAAGCGCAGG TCGCGGCCGA CCAGCTTCGC GCGACCACCG AGGAGTTCTC CGCGGCGCGC
GCCAGCATTG ATCCAGAGCT CATCACCACT CTTGAATTCG CTGCTGGAAA CATCCGCCAC
TTCCACGAAA AGCAGATGCC GGAGACGCTT GTTTTGCACG AGACGCATCG GGGCGTCCTC
GTAGGCGACC GATGGAACGC GATCGACTCC GTCGCCTGCT ACGTTCCCCG CGGCAAAGGC
TCATTTCCCA GTTCGGTGCT GATGACAGCC ATTCCGGCAA AGGTGGCTGG CGTCAGAAAG
GTTGTTATCA TCACCCCGCC CGGCCCCGAC GGCACCGTCG ATCCGGCGAC GCTTGTCGCA
GCAGAGATCG CTGGGGTCTC AGACGTCTTC AAGTGCGGTG GTGCGCAAGC CATCGCGGCC
GTCGCCTTCG GGACTCAAAC GGTTCCAAAA TGTGACAAGG TCGTCGGTCC GGGCAGCCCA
TGGGTGGTTG CGGCAAAGAA GCAGCTGTCA TCCTTGATCG ATCCGGGAAG CCCCGCAGGT
CCGAGCGAAC TTATCATCTT TTCCGATGGT TCGGTGCCCG CCGAGCTGGT GGCGCTGGAT
CTCTGCGTAG AGTCGGAGCA CGGTCCGGAC TCTTCCGTCT TCTTCGTAAC CGACAATGCG
GACTTCGCAA ATGCGGTCGC TTCCTGTGTC CCAGCCCTCT GGTCACGAAT GGGTAACATC
CGGGCGCAAT ATTCCAAGAC TGTGCTTTCC GGTCCAAGGG GAGGGATCGT CATTGCCCGT
ACCCGGGCGG ACGCTCTCGC CTTCGTTAAC GACTACGCTC CGGAACACCT CGCAGTCCTC
GCCGATAATG CCTGGCAATA TCTCGCCTGC TTTGAGCATG CAGGCGAAAT CCTTCTCGGC
CCGCATTCCG CAATCAGCGT TGCCAACTTC GTGCTCGGAC CCAGTCACGT TCTGCCCACA
GGAGGAGCCG CCAAAACGAC CTCTCCATTG TCGGTCTTCG ACTTTCTGAA GCGCACTTCG
ATCGCTTCGC TGTCTAAAGA GGCCTATGGG GGCTTTGCCG CTCATGCCGA ACGGCTCGCT
CGGTACGAGG GCTTCGACGG TCATGCCAAC GCTGTTTCGA CGATCCGTGA CGAAGCGCTT
CGCTCTGCCG CCGGCGCCAG CACCTAA
 
Protein sequence
MSDTLVSIHT LSALDDAQRK ALLQRTESDL SAFIDKVNPI IEAVRIDGDA ALADFARQFD 
KAQVAADQLR ATTEEFSAAR ASIDPELITT LEFAAGNIRH FHEKQMPETL VLHETHRGVL
VGDRWNAIDS VACYVPRGKG SFPSSVLMTA IPAKVAGVRK VVIITPPGPD GTVDPATLVA
AEIAGVSDVF KCGGAQAIAA VAFGTQTVPK CDKVVGPGSP WVVAAKKQLS SLIDPGSPAG
PSELIIFSDG SVPAELVALD LCVESEHGPD SSVFFVTDNA DFANAVASCV PALWSRMGNI
RAQYSKTVLS GPRGGIVIAR TRADALAFVN DYAPEHLAVL ADNAWQYLAC FEHAGEILLG
PHSAISVANF VLGPSHVLPT GGAAKTTSPL SVFDFLKRTS IASLSKEAYG GFAAHAERLA
RYEGFDGHAN AVSTIRDEAL RSAAGAST