Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2159 |
Symbol | hisD |
ID | 5594907 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 2136952 |
End bp | 2138256 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640921292 |
Product | histidinol dehydrogenase |
Protein accession | YP_001458831 |
Protein GI | 157161513 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0141] Histidinol dehydrogenase |
TIGRFAM ID | [TIGR00069] histidinol dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 63 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTTTA ACACAATCAT TGACTGGAAT AGCTGTACTG CGGAGCAACA ACGCCAGCTG TTAATGCGCC CGGCGATCTC CGCTTCTGAA AGCATTACCC GCACTGTTAA CGATATTCTC GATAACGTGA AAGCACGCGG CGATGAGGCC CTGCGGGAAT ACAGCGCGAA GTTTGATAAA ACCACGGTTA CCGCGCTGAA GGTGTCTGCT GAGGAAATTG CCGCCGCCAG CGAACGCCTG AGCGACGAGC TAAAACAGGC GATGGCGGTG GCAGTAAAGA ATATTGAAAC CTTCCACACT GCGCAAAAAC TGCCGCCGGT AGATGTAGAA ACGCAGCCAG GCGTACGTTG CCAGCAAGTC ACGCGCCCGG TAGATTCAGT GGGTTTGTAT ATTCCTGGCG GCTCCGCCCC GCTCCTCTCA ACGGTATTAA TGCTGGCAAC TCCGGCGCGT ATTGCGAACT GTAAAAAAGT GGTGCTGTGC TCACCGCCGC CGATTGCCGA TGAGATCCTT TACGCGGCGC AGCTGTGCGG TGTACAGGAC GTGTTTAACG TCGGCGGCGC ACAGGCCATT GCCGCGCTGG CGTTTGGTAC GGAATCTGTG CCGAAAGTGG ACAAAATCTT CGGGCCGGGT AACGCCTTTG TCACCGAGGC AAAACGTCAG GTGAGCCAGC GTCTGGACGG TGCGGCGATC GATATGCCCG CAGGCCCTTC GGAAGTGCTG GTGATTGCTG ACAGCGGCGC AACGCCGGAT TTCGTGGCTT CTGATTTGCT TTCCCAGGCT GAACACGGCC CGGACTCACA GGTGATTTTA CTGACGCCCG CTGCTGATAT GGCGCGTCGC GTAGCCGAAG CTGTCGAACG CCTGCTGGCA GAACTGCCGC GAGCTGAAAC CGCCCGCCAG GCACTGAACG CCAGCCGCCT GATCGTGACT AAAGATTTAG CGCAGTGCGT AGAGATCTCC AACCAGTACG GCCCGGAGCA CCTGATCATT CAGACCCGCA ACGCCCGCGA TCTGGTCGAT GGCATCACCA GCGCCGGTTC GGTATTTCTT GGTGACTGGT CACCGGAATC CGCAGGTGAT TACGCCTCCG GCACCAACCA CGTTCTGCCG ACTTACGGTT ACACCGCCAC CTGTTCCAGC CTCGGGCTGG CAGATTTCCA GAAGCGTATG ACCGTGCAGG AACTGTCGAA AGAAGGCTTC TCCGCGCTGG CTTCAACCAT TGAAACACTG GCCGCCGCCG AGCGCCTGAC CGCCCATAAA AATGCCGTTA CTTTGCGTGT TAACGCCCTT AAGGAGCAAG CATGA
|
Protein sequence | MSFNTIIDWN SCTAEQQRQL LMRPAISASE SITRTVNDIL DNVKARGDEA LREYSAKFDK TTVTALKVSA EEIAAASERL SDELKQAMAV AVKNIETFHT AQKLPPVDVE TQPGVRCQQV TRPVDSVGLY IPGGSAPLLS TVLMLATPAR IANCKKVVLC SPPPIADEIL YAAQLCGVQD VFNVGGAQAI AALAFGTESV PKVDKIFGPG NAFVTEAKRQ VSQRLDGAAI DMPAGPSEVL VIADSGATPD FVASDLLSQA EHGPDSQVIL LTPAADMARR VAEAVERLLA ELPRAETARQ ALNASRLIVT KDLAQCVEIS NQYGPEHLII QTRNARDLVD GITSAGSVFL GDWSPESAGD YASGTNHVLP TYGYTATCSS LGLADFQKRM TVQELSKEGF SALASTIETL AAAERLTAHK NAVTLRVNAL KEQA
|
| |