Gene ECH74115_2953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2953 
SymbolhisD 
ID6967860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2727997 
End bp2729301 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content57% 
IMG OID643386793 
Producthistidinol dehydrogenase 
Protein accessionYP_002271261 
Protein GI209397361 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.525891 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.00000000725252 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCTTTA ACACAATCAT TGACTGGAAT AGCTGTACTG CAAAGCAACA ACGCCAGCTG 
TTAATGCGCC CGGCGATCTC CGCTTCTGAA AGCATTACCC GCACTGTTAA CGATATTCTC
GATAGCGTGA AAGCACGCGG TGATGACGCC CTGCGGGAAT ATAGCGCGAA GTTTGATAAA
ACCACGGTTA CCGCACTGAA GGTGTCTGCT GAGGAAATTG CCGCCGCCAG CGAACGCCTG
AGCGACGAGC TAAAACAGGC GATGGCGGTG GCAGTAAAGA ATATTGAAAC CTTCCACACT
GCGCAAAAAC TGCCGCCGGT AGATGTAGAA ACGCAGCCAG GCGTGCGTTG CCAGCAAGTC
ACGCGCCCGG TAGCTTCAGT TGGGTTGTAT ATTCCTGGCG GCTCCGCCCC GCTCTTCTCA
ACGGTATTAA TGCTGGCAAC TCCGGCGCGT ATTGCGGGCT GTAAAAAAGT GGTGTTGTGC
TCACCGCCGC CGATTGCCGA TGAGATCCTT TATGCGGCGC AGCTGTGCGG TGTGCAGGAC
GTGTTTAACG TCGGCGGCGC ACAGGCCATT GCCGCGCTGG CGTTTGGTAC GGAATCTGTG
CCGAAAGTGG ACAAAATCTT CGGGCCGGGT AACGCCTTTG TCACCGAGGC AAAACGTCAG
GTGAGCCAGC GTCTGGACGG TGCGGCGATC GATATGCCCG CAGGCCCGTC GGAAGTGTTG
GTCATTGCCG ACAGCGGCGC AACGCCGGAT TTCGTGGCTT CTGATTTGCT TTCTCAGGCT
GAACACGGCC CGGACTCACA GGTGATTTTA CTGACGCCTG ACGCTGATAT GGCGCATCAA
GTTGCCGAAG CCGTCGAACG CCAGTTAGCA GAACTGCCGC GTGCCGAAAC CGCCCGCCAG
GCACTGAACG CCAGCCGCCT GATCGTGACT AAAGATTTAG CGCAGTGCGT GGAGATCTCC
AACCAGTACG GCCCGGAGCA CCTGATCATT CAGACCCGCA ACGCCCGCGA ACTGGTCGAT
GGCATCACCA GCGCCGGTTC GGTATTTCTT GGTGACTGGT CACCGGAATC GGCAGGTGAT
TACGCCTCCG GAACCAACCA TGTTCTACCG ACTTACGGCT ACACCGCCAC CTGTTCCAGC
CTCGGACTGG CGGATTTCCA GAAGCGGATG ACCGTGCAGG AACTGTCGAA AGTAGGTTTC
TCCGCGCTGG CTTCGACCAT TGAAACACTG GCCGCCGCCG AGCGCCTGAC CGCCCACAAA
AATGCCGTTA CTTTGCGTGT TAACGCCCTT AAGGAGCAAG CATGA
 
Protein sequence
MSFNTIIDWN SCTAKQQRQL LMRPAISASE SITRTVNDIL DSVKARGDDA LREYSAKFDK 
TTVTALKVSA EEIAAASERL SDELKQAMAV AVKNIETFHT AQKLPPVDVE TQPGVRCQQV
TRPVASVGLY IPGGSAPLFS TVLMLATPAR IAGCKKVVLC SPPPIADEIL YAAQLCGVQD
VFNVGGAQAI AALAFGTESV PKVDKIFGPG NAFVTEAKRQ VSQRLDGAAI DMPAGPSEVL
VIADSGATPD FVASDLLSQA EHGPDSQVIL LTPDADMAHQ VAEAVERQLA ELPRAETARQ
ALNASRLIVT KDLAQCVEIS NQYGPEHLII QTRNARELVD GITSAGSVFL GDWSPESAGD
YASGTNHVLP TYGYTATCSS LGLADFQKRM TVQELSKVGF SALASTIETL AAAERLTAHK
NAVTLRVNAL KEQA