Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2694 |
Symbol | hcaB |
ID | 6143216 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2766559 |
End bp | 2767371 |
Gene Length | 813 bp |
Protein Length | 270 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641617565 |
Product | 2,3-dihydroxy-2,3-dihydrophenylpropionate dehydrogenase |
Protein accession | YP_001744730 |
Protein GI | 170683398 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGATC TGCATAACGA GTCCATCTTT ATTACCGGCG GCGGATCGGG ATTAGGCCTG GCGCTGGTCG AGCGATTTAT CGAAGAAGGC GCGCAGGTTG CCACGCTGGA ACTGTCGGCG GCGAAAGTCG CCAGTCTGCG TCAGCGATTT GGTGAACATA TTCTGGCGGT GGAAGGCAAC GTGACCTGTT ATGCCGATTA TCAACGCGCG CTCGATCAGA TCCTGACTCG TTCTGGCAAA CTGGATTGTT TTATCGGCAA TGCGGGCATC TGGGATCACA ATGCCTCACT GGTTAATACT CCCGCAGAGA CGCTCGAAAC CGGCTTCCAC GAGCTGTTTA ACGTCAACGT ACTCGGTTAC CTGCTGGGCG CAAAAGCCTG CGCTCCGGCG TTAATCGCCA GTGAAGGCAG CATGATTTTC ACACTGTCAA ATGCCGCCTG GTATCCCGGC GGCGGTGGCC CGCTGTACAC CGCCAGTAAA CATGCCGCAA CCGGACTTAT TCGCCAACTG GCTTATGAAC TGGCACCGAA AGTGCGGGTG AATGGCGTCG GCCCGTGTGG TATGGCCAGC GACCTGCGCG GCCCACAGGC GCTCGGGCAA AGTGAAACCT CGATAATGCA GTCTCTGACG CCGGAGAAAA TTGCCGCCAT ATTACCGCTG CAATTTTTCC CGCAACCGGC GGATTTTACG GGTCCGTATG TGATGTTGGC ATCGCGGCGC AATAATCGCG CATTAAGCGG TGTGATGATC AACGCTGATG CGGGTTTAGC GATTCGCGGC ATTCGCCACG TAGCGGCTGG GCTGGATCTT TAA
|
Protein sequence | MSDLHNESIF ITGGGSGLGL ALVERFIEEG AQVATLELSA AKVASLRQRF GEHILAVEGN VTCYADYQRA LDQILTRSGK LDCFIGNAGI WDHNASLVNT PAETLETGFH ELFNVNVLGY LLGAKACAPA LIASEGSMIF TLSNAAWYPG GGGPLYTASK HAATGLIRQL AYELAPKVRV NGVGPCGMAS DLRGPQALGQ SETSIMQSLT PEKIAAILPL QFFPQPADFT GPYVMLASRR NNRALSGVMI NADAGLAIRG IRHVAAGLDL
|
| |