Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_01908 |
Symbol | hisD |
ID | 8116338 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 1986213 |
End bp | 1987517 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 644848123 |
Product | hypothetical protein |
Protein accession | YP_002999696 |
Protein GI | 251785392 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0141] Histidinol dehydrogenase |
TIGRFAM ID | [TIGR00069] histidinol dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTTTA ACACAATCAT TGACTGGAAT AGCTGTACTG CGGAGCAACA ACGCCAGCTG TTAATGCGCC CGGCGATCTC CGCTTCTGAA AGCATTACCC GCACTGTTAA CGATATTCTC GATAACGTGA AAACGCGTGG CGATGAGGCC CTTCGGGAAT ACAGCGCGAA GTTTGATAAA ACCACGGTTA CCGCGCTGAA GGTGTCTGCT GAGGAGATCG CCGCCGCCAG CGAACGCCTG AGCGACGAGC TAAAACAGGC GATGGCGGTG GCAGTAAAGA ATATTGAAAC CTTCCACACT GCGCAAAAAC TGCCGCCGGT AGATGTAGAA ACGCAGCCAG GCGTACGTTG CCAGCAAGTC ACGCGTCCGG TAGCTTCAGT TGGGTTGTAT ATTCCTGGCG GCTCCGCCCC GCTCTTCTCA ACGGTATTAA TGCTGGCAAC TCCGGCGCGT ATTGCGGGCT GTAAAAAAGT GGTGTTGTGC TCACCGCCGC CGATTGCCGA TGAGATCCTT TATGCGGCGC AGCTGTGCGG TGTGCAGGAC GTGTTTAACG TCGGCGGCGC ACAGGCCATT GCCGCGCTGG CGTTTGGTAC GGAATCTGTG CCGAAAGTGG ACAAAATCTT CGGGCCGGGT AACGCCTTTG TCACCGAAGC AAAACGCCAG GTAAGCCAGC GTCTGGACGG TGCGGCGATC GATATGCCCG CAGGCCCGTC GGAAGTGCTG GTGATTGCTG ACAGCGGCGC TACGCCGGAT TTCGTGGCTT CTGATTTGCT TTCTCAGGCT GAACACGGCC CGGACTCACA GGTGATTTTA CTGACGCCCG ACGCCGATAT GGCGCGTCGC GTTGCCGAGG CTGTCGAACG CCAACTGGCA GAACTGCCGC GAGCTGAAAC CGCCCGCCAG GCACTGAACG CCAGCCGCCT GATCGTGACT AAAGATTTAG CGCAGTGCGT AGAGATCTCC AACCAGTACG GCCCGGAGCA CCTGATCATT CAGACCCGCA ACGCCCGCGA ACTGGTCGAT GGCATCACCA GCGCCGGTTC GGTATTTCTT GGTGACTGGT CACCGGAATC GGCAGGCGAC TATGCCTCCG GCACCAACCA CGTTCTGCCG ACTTACGGTT ACACCGCCAC CTGTTCCAGC CTCGGGCTGG CGGATTTCCA GAAGCGCATG ACCGTGCAGG AACTGTCGAA AGTAGGTTTC TCCGCTCTGG CGTCGACCAT TGAAACACTG GCCGCCGCCG AGCGCCTGAC CGCCCACAAA AATGCCGTTA CTTTGCGTGT TAACGCCCTT AAGGAGCAAG CATGA
|
Protein sequence | MSFNTIIDWN SCTAEQQRQL LMRPAISASE SITRTVNDIL DNVKTRGDEA LREYSAKFDK TTVTALKVSA EEIAAASERL SDELKQAMAV AVKNIETFHT AQKLPPVDVE TQPGVRCQQV TRPVASVGLY IPGGSAPLFS TVLMLATPAR IAGCKKVVLC SPPPIADEIL YAAQLCGVQD VFNVGGAQAI AALAFGTESV PKVDKIFGPG NAFVTEAKRQ VSQRLDGAAI DMPAGPSEVL VIADSGATPD FVASDLLSQA EHGPDSQVIL LTPDADMARR VAEAVERQLA ELPRAETARQ ALNASRLIVT KDLAQCVEIS NQYGPEHLII QTRNARELVD GITSAGSVFL GDWSPESAGD YASGTNHVLP TYGYTATCSS LGLADFQKRM TVQELSKVGF SALASTIETL AAAERLTAHK NAVTLRVNAL KEQA
|
| |