Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1040 |
Symbol | hisD |
ID | 6142608 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1060699 |
End bp | 1062003 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641615927 |
Product | histidinol dehydrogenase |
Protein accession | YP_001743119 |
Protein GI | 170683888 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0141] Histidinol dehydrogenase |
TIGRFAM ID | [TIGR00069] histidinol dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.204161 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTTTA ACACAATCAT TGACTGGAAT AGCTGTACTG CGGAGCAACA ACGCCAGCTG TTAATGCGCC CGGCGATCTC CGCTTCTGAA AGCATTACCC GCACTGTTAA CGATATTCTC GATAACGTGA AAGCACGCGG TGATGACGCC CTGCGGGAAT ACAGCGCGAA GTTTGATAAA ACCACGGTTA CCGCACTGAA GGTGTCTGCT GAGGAAATTG CCGCCGCCAG CGAACGCCTG AGCGACGAGC TAAAACAGGC GATGGCGGTG GCAGTAAAGA ATATTGAAAC CTTCCACACT GCGCAAAAAC TGCCGCCGGT AGATGTTGAA ACTCAGCCTG GTGTGCGTTG CCAGCAGGTC ACGCGCCCGG TAGCTTCGGT TGGTTTGTAT ATTCCTGGCG GTTCCGCCCC GCTCTTCTCA ACGGTATTAA TGCTGGCGAC TCCGGCGCGT ATTGCGGGTT GCAATAAAGT GGTGCTGTGC TCACCGCCGC CGATTGCCGA TGAAATTCTT TACGCGGCGC AGCTATGCGG TGTGCAGGAC GTGTTTAACG TCGGCGGCGC ACAGGCCATT GCCGCGCTGG CGTTTGGTAC GGAATCTGTG CCGAAAGTGG ACAAAATCTT CGGGCCGGGT AACGCCTTTG TCACCGAAGC GAAACGTCAG GTGAGCCAGC GTCTGGACGG TGCGGCGATC GATATGCCCG CAGGCCCGTC GGAAGTGCTG GTGATTGCTG ACAGCGGCGC TACGCCGGAT TTCGTGGCTT CTGATTTGCT TTCTCAGGCT GAACACGGCC CGGATTCACA GGTGATTTTA CTGACGCCTG ACGCTGATAT GGCGCATCAA GTTGCCGAAG CCGTCGAACG CCAGTTAGCA GAACTGCCGC GTGCCGAAAC CGCACGTCAG GCACTGAGCG CCAGCCGCCT GATCGTGACC AACGATTTAG CGCAGTGCGT GGCAATCTCC AACCAGTACG GCCCGGAGCA CCTGATCATT CAGACCCGCA ACGCCCGCGA ACTGGTCGAT AGCATCACCA GCGCCGGTTC GGTATTTCTT GGTGACTGGT CACCGGAATC GGCAGGTGAT TACGCCTCCG GCACCAACCA CGTTCTGCCG ACTTACGGTT ACACCGCCAC CTGTTCCAGC CTCGGACTGG CGGATTTCCA GAAGCGGATG ACCGTGCAGG AACTGTCGAA AGTAGGTTTC TCCGCGCTGG CTTCGACCAT TGAAACACTG GCCGCCGCCG AGCGCCTGAC CGCCCACAAA AATGCCGTTA CTTTGCGTGT TAACGCCCTT AAGGAGCAAG CATGA
|
Protein sequence | MSFNTIIDWN SCTAEQQRQL LMRPAISASE SITRTVNDIL DNVKARGDDA LREYSAKFDK TTVTALKVSA EEIAAASERL SDELKQAMAV AVKNIETFHT AQKLPPVDVE TQPGVRCQQV TRPVASVGLY IPGGSAPLFS TVLMLATPAR IAGCNKVVLC SPPPIADEIL YAAQLCGVQD VFNVGGAQAI AALAFGTESV PKVDKIFGPG NAFVTEAKRQ VSQRLDGAAI DMPAGPSEVL VIADSGATPD FVASDLLSQA EHGPDSQVIL LTPDADMAHQ VAEAVERQLA ELPRAETARQ ALSASRLIVT NDLAQCVAIS NQYGPEHLII QTRNARELVD SITSAGSVFL GDWSPESAGD YASGTNHVLP TYGYTATCSS LGLADFQKRM TVQELSKVGF SALASTIETL AAAERLTAHK NAVTLRVNAL KEQA
|
| |