Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3773 |
Symbol | hcaB |
ID | 6968101 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3495843 |
End bp | 3496655 |
Gene Length | 813 bp |
Protein Length | 270 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643387562 |
Product | 2,3-dihydroxy-2,3-dihydrophenylpropionate dehydrogenase |
Protein accession | YP_002272015 |
Protein GI | 209397724 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 75 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGATC TGCATAACGA GTCCATTTTT ATTACCGGCG GCGGATCGGG ATTAGGGCTG GCGCTGGTCG AGCGATTTAT CGAAGAAGGC GCGCAGGTTG CCACGCTGGA ACTGTCGGCG GCGAAAGTCG CCAGTCTGCG TCAGCGATTT GGTGAACATA TTCTGGCGGT GGAAGGCAAC GTGACCTGTT ATGCCGATTA TCAACGCGCG GTCGATCAGA TCCTGACTCG TTCCGGCAAG CTGGATTGTT TTATCGGCAA TGCGGGCATC TGGGATCACA ATGCCTCACT GGTTAATACT CCCGCAGAGA CGCTCGAAAC CGGCTTCCAC GAGCTGTTTA ACGTCAACGT ACTCGGTTAC CTGCTGGGTG CAAAAGCCTG CGCTCCGGCG TTAATCGCCG GTGAAGGCAG CATGATTTTC ACACTATCAA ATGCCGCCTG GTATCCCGGC GGCGGTGGCC CGCTGTACAC CGCCAGTAAA CATGCCGCAA CCGGGCTTAT TCGCCAACTG GCTTATGAAC TGGCTCCGAA AGTACGGGTG AATGGCGTCG GCCCTTGTGG TATGGCCAGC GACCTGCGCG GCCCACAGGC GCTCGGGCAA AGTGAAACCT CAATAATGCA GTCTCTGACG CCGGAGAAAA TTGCCGCCAT TTTACCGCTG CAATTTTTCC CGCAACCGGC TGATTTTACG GGTCCGTATG TGATGTTGGC ATCGCGGCGC AATAATCGCG CATTAAGCGG TGTGATGATC AACGCTGATG CGGGTTTAGC AATTCGCGGC ATTCGCCACG TAGCGGCTGG GTTGGATCTT TAA
|
Protein sequence | MSDLHNESIF ITGGGSGLGL ALVERFIEEG AQVATLELSA AKVASLRQRF GEHILAVEGN VTCYADYQRA VDQILTRSGK LDCFIGNAGI WDHNASLVNT PAETLETGFH ELFNVNVLGY LLGAKACAPA LIAGEGSMIF TLSNAAWYPG GGGPLYTASK HAATGLIRQL AYELAPKVRV NGVGPCGMAS DLRGPQALGQ SETSIMQSLT PEKIAAILPL QFFPQPADFT GPYVMLASRR NNRALSGVMI NADAGLAIRG IRHVAAGLDL
|
| |