Gene EcHS_A2159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2159 
SymbolhisD 
ID5594907 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2136952 
End bp2138256 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content58% 
IMG OID640921292 
Producthistidinol dehydrogenase 
Protein accessionYP_001458831 
Protein GI157161513 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones63 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTTTA ACACAATCAT TGACTGGAAT AGCTGTACTG CGGAGCAACA ACGCCAGCTG 
TTAATGCGCC CGGCGATCTC CGCTTCTGAA AGCATTACCC GCACTGTTAA CGATATTCTC
GATAACGTGA AAGCACGCGG CGATGAGGCC CTGCGGGAAT ACAGCGCGAA GTTTGATAAA
ACCACGGTTA CCGCGCTGAA GGTGTCTGCT GAGGAAATTG CCGCCGCCAG CGAACGCCTG
AGCGACGAGC TAAAACAGGC GATGGCGGTG GCAGTAAAGA ATATTGAAAC CTTCCACACT
GCGCAAAAAC TGCCGCCGGT AGATGTAGAA ACGCAGCCAG GCGTACGTTG CCAGCAAGTC
ACGCGCCCGG TAGATTCAGT GGGTTTGTAT ATTCCTGGCG GCTCCGCCCC GCTCCTCTCA
ACGGTATTAA TGCTGGCAAC TCCGGCGCGT ATTGCGAACT GTAAAAAAGT GGTGCTGTGC
TCACCGCCGC CGATTGCCGA TGAGATCCTT TACGCGGCGC AGCTGTGCGG TGTACAGGAC
GTGTTTAACG TCGGCGGCGC ACAGGCCATT GCCGCGCTGG CGTTTGGTAC GGAATCTGTG
CCGAAAGTGG ACAAAATCTT CGGGCCGGGT AACGCCTTTG TCACCGAGGC AAAACGTCAG
GTGAGCCAGC GTCTGGACGG TGCGGCGATC GATATGCCCG CAGGCCCTTC GGAAGTGCTG
GTGATTGCTG ACAGCGGCGC AACGCCGGAT TTCGTGGCTT CTGATTTGCT TTCCCAGGCT
GAACACGGCC CGGACTCACA GGTGATTTTA CTGACGCCCG CTGCTGATAT GGCGCGTCGC
GTAGCCGAAG CTGTCGAACG CCTGCTGGCA GAACTGCCGC GAGCTGAAAC CGCCCGCCAG
GCACTGAACG CCAGCCGCCT GATCGTGACT AAAGATTTAG CGCAGTGCGT AGAGATCTCC
AACCAGTACG GCCCGGAGCA CCTGATCATT CAGACCCGCA ACGCCCGCGA TCTGGTCGAT
GGCATCACCA GCGCCGGTTC GGTATTTCTT GGTGACTGGT CACCGGAATC CGCAGGTGAT
TACGCCTCCG GCACCAACCA CGTTCTGCCG ACTTACGGTT ACACCGCCAC CTGTTCCAGC
CTCGGGCTGG CAGATTTCCA GAAGCGTATG ACCGTGCAGG AACTGTCGAA AGAAGGCTTC
TCCGCGCTGG CTTCAACCAT TGAAACACTG GCCGCCGCCG AGCGCCTGAC CGCCCATAAA
AATGCCGTTA CTTTGCGTGT TAACGCCCTT AAGGAGCAAG CATGA
 
Protein sequence
MSFNTIIDWN SCTAEQQRQL LMRPAISASE SITRTVNDIL DNVKARGDEA LREYSAKFDK 
TTVTALKVSA EEIAAASERL SDELKQAMAV AVKNIETFHT AQKLPPVDVE TQPGVRCQQV
TRPVDSVGLY IPGGSAPLLS TVLMLATPAR IANCKKVVLC SPPPIADEIL YAAQLCGVQD
VFNVGGAQAI AALAFGTESV PKVDKIFGPG NAFVTEAKRQ VSQRLDGAAI DMPAGPSEVL
VIADSGATPD FVASDLLSQA EHGPDSQVIL LTPAADMARR VAEAVERLLA ELPRAETARQ
ALNASRLIVT KDLAQCVEIS NQYGPEHLII QTRNARDLVD GITSAGSVFL GDWSPESAGD
YASGTNHVLP TYGYTATCSS LGLADFQKRM TVQELSKEGF SALASTIETL AAAERLTAHK
NAVTLRVNAL KEQA