Gene EcolC_1622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1622 
SymbolhisD 
ID6066383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1802973 
End bp1804277 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content58% 
IMG OID641601037 
Producthistidinol dehydrogenase 
Protein accessionYP_001724607 
Protein GI170019653 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.426183 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTTA ACACAATCAT TGACTGGAAT AGCTGTACTG CGGAGCAACA ACGCCAGCTG 
TTAATGCGCC CGGCGATCTC CGCTTCTGAA AGCATTACCC GCACCGTTAA CGATATTCTC
GATAACGTGA AAGCACGCGG CGATGACGCC CTGCGGGAAT ACAGCGCAAA GTTTGATAAA
ACCACGGTTA CCGCGCTGAA GGTGTCTGCT GAGGAGATCG CCGCCGCCAG CGAACGCCTT
AGCGACGAGC TAAAACAGGC GATGGCGGTG GCAGTAAAAA ATATTGAAAC CTTCCACACT
GCGCAAAAAC TGCCGCCGGT AGATGTAGAA ACCCAGCCAG GCGTGCGTTG CCAGCAGGTC
ACGCGCCCAG TCGCTTCAGT TGGGTTGTAT ATTCCTGGCG GCTCCGCCCC GCTCTTCTCA
ACGGTATTAA TGCTGGCAAC TCCGGCGCGT ATTGCGGGCT GTAAAAAAGT GGTGCTGTGC
TCACCGCCGC CGATTGCCGA TGAGATCCTT TACGCGGCGC AGCTGTGCGG TGTACAGGAC
GTGTTTAACG TCGGCGGCGC ACAGGCCATT GCCGCACTGG CGTTTGGTAC GGAATCTGTG
CCGAAAGTGG ACAAAATCTT CGGGCCGGGT AACGCCTTTG TCACCGAAGC GAAACGTCAG
GTGAGCCAGC GTCTGGACGG TGCGGCGATC GATATGCCCG CAGGCCCGTC GGAAGTGCTG
GTGATTGCTG ACAGCGGCGC TACGCCGGAT TTCGTGGCTT CTGATTTGCT TTCTCAGGCT
GAACACGGCC CGGACTCACA GGTGATTTTA CTGACGCCCG CTGCTGATAT GGCGCGTCGC
GTTGCCGAGG CCGTCGAACG CCAACTGGCA GAACTGCCGC GTGCCGAAAC CGCCCGTCAG
GCACTGAGCG CCAGCCGCCT GATCGTGACC AACGATTTAG CGCAGTGCGT AGCGATTTCC
AACCAGTACG GCCCGGAGCA CCTGATCATT CAGACCCGCA ACGCCCGCGA ACTGGTCGAT
AGCATCACCA GCGCCGGTTC GGTATTTCTT GGTGACTGGT CACCGGAATC CGCAGGTGAT
TACGCCTCCG GCACCAACCA CGTTCTGCCG ACTTACGGTT ACACCGCCAC CTGTTCCAGC
CTCGGGCTGG CGGATTTCCA GAAGCGTATG ACCGTGCAGG AACTGTCGAA AGAAGGCTTC
TCCGCGCTGG CTTCAACCAT TGAAACATTG GCCGCCGCCG AGCGCCTGAC CGCCCATAAA
AATGCCGTTA CTTTGCGTGT TAACGCCCTT AAGGAGCAAG CATGA
 
Protein sequence
MSFNTIIDWN SCTAEQQRQL LMRPAISASE SITRTVNDIL DNVKARGDDA LREYSAKFDK 
TTVTALKVSA EEIAAASERL SDELKQAMAV AVKNIETFHT AQKLPPVDVE TQPGVRCQQV
TRPVASVGLY IPGGSAPLFS TVLMLATPAR IAGCKKVVLC SPPPIADEIL YAAQLCGVQD
VFNVGGAQAI AALAFGTESV PKVDKIFGPG NAFVTEAKRQ VSQRLDGAAI DMPAGPSEVL
VIADSGATPD FVASDLLSQA EHGPDSQVIL LTPAADMARR VAEAVERQLA ELPRAETARQ
ALSASRLIVT NDLAQCVAIS NQYGPEHLII QTRNARELVD SITSAGSVFL GDWSPESAGD
YASGTNHVLP TYGYTATCSS LGLADFQKRM TVQELSKEGF SALASTIETL AAAERLTAHK
NAVTLRVNAL KEQA