Gene Smed_0208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0208 
SymbolhisD 
ID5321039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp235116 
End bp236420 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content65% 
IMG OID640789142 
Producthistidinol dehydrogenase 
Protein accessionYP_001325902 
Protein GI150395435 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCAATCA GGCTGAACTA TCTCGATACC GGCTTTGAGC GCGATTTCGC CGCATTCCTG 
ACGACCAAGC GGGAAGTTTC CGAAGACGTA AACGCCGTCG TGCGCGACAT TATCGACGAT
GTCCGCGCGC GCGGCGATGC GGCGCTTGCA GATTATTCGG CGCGTTTTGA CGGAATAGAC
TTCAATGTCA CGGGCATGGC GGTGACGGCG GCGGAGATAG ATGCGGCGAT CCACGCCGTT
GCTCCGGAGG TTCTCGGCGC CCTGAAGGTC GCCGCGACCC GCATCGAGGC GCATCACCGG
CGGCAATTGC CGAAGGACGA CATCTATGAA GACCAGATGG GCGTCGGCCT CGGCTCCCGC
TGGACGCCGA TCGATGCGGT GGGCCTCTAT GTTCCGGGTG GCACGGCGAG CTATCCGAGC
TCGGTTCTGA TGAACGCTCT GCCGGCAAAG GTCGCCGGCG TCCCCCGCAT CGTCATGGTC
GTGCCGGCAA TGGGCGGTGC GGTCAATCCT GCGGTGCTTG CGGCGGCGCG GCTCGCCGGC
GTGGAAGAAA TCTATCGCAT CGGTGGTGCC CAGGCCGTCG CGGCCCTTGC CTACGGGACC
GGGACGATCG CGCCGGTGGC CAAAATCATG GGCCCCGGAA ACGCCTATGT CGCGGCCGCC
AAGCGACAGG TTTTCGGCAC CGTCGGCATC GACATGATCG CCGGACCTTC GGAAGTGCTG
GTGATTGCGG ATCGCGACAA CGATCCGGAT TGGATCGCCG CGGACATGCT TGCTCAGGCA
GAGCACGATG CCGGCGCTCA GGCGATCCTG ATCACCGACG ATGCCGCTTT CGGCGATGCA
GTCGAAGAGG CTGTGGAGCG TCAGTTGAAG ACGCTGCCGC GTGCCGACAC GGCGGCAGCG
AGCTGGCGCG ATTTCGGTGC CATCATTCTG GTTCCGGATT TCGACAAGGC CATCCCGCTC
GCCAACCGCA TCGCTCCCGA ACATCTCGAA CTGGCGACGG CCGATCCGGA CGCGATGGTC
CCCGCCATCC GCAATGCCGG CGCGATCTTC ATCGGCAGGC ACACGCCCGA AGTCATCGGC
GATTATGTGG GCGGTTCCAA CCACGTGCTG CCGACGGCGC GTTCGGCGCG CTTCTCGTCC
GGCCTCGGCG TGCTCGACTA TATGAAGCGA ACGTCTATCC TGCGGCTCGA TCCGGAACAG
TTGCGCATAC TCGGCCCCGC CGCGATCGCG CTGGCGAGAT CGGAAGGGCT CGAGGCTCAC
GCCCGATCGG TCGCAATCCG CCTCAACCTC GGGGAAAAGG GATGA
 
Protein sequence
MAIRLNYLDT GFERDFAAFL TTKREVSEDV NAVVRDIIDD VRARGDAALA DYSARFDGID 
FNVTGMAVTA AEIDAAIHAV APEVLGALKV AATRIEAHHR RQLPKDDIYE DQMGVGLGSR
WTPIDAVGLY VPGGTASYPS SVLMNALPAK VAGVPRIVMV VPAMGGAVNP AVLAAARLAG
VEEIYRIGGA QAVAALAYGT GTIAPVAKIM GPGNAYVAAA KRQVFGTVGI DMIAGPSEVL
VIADRDNDPD WIAADMLAQA EHDAGAQAIL ITDDAAFGDA VEEAVERQLK TLPRADTAAA
SWRDFGAIIL VPDFDKAIPL ANRIAPEHLE LATADPDAMV PAIRNAGAIF IGRHTPEVIG
DYVGGSNHVL PTARSARFSS GLGVLDYMKR TSILRLDPEQ LRILGPAAIA LARSEGLEAH
ARSVAIRLNL GEKG