Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3703 |
Symbol | |
ID | 6065126 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 4053496 |
End bp | 4054962 |
Gene Length | 1467 bp |
Protein Length | 488 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641603121 |
Product | 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase |
Protein accession | YP_001726641 |
Protein GI | 170021687 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR02299] 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.195699 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAG TAAATCATTG GATCAACGGC AAAAATGTTG CAGGTAACGA CTACTTCCAG ACCACCAATC CGGCAACGGG TGAAGTGCTG GCGGATGTGG CCTCTGGCGG TGAAGCGGAG ATCAATCAGG CGGTAGCGGC AGCGAAAGAG GCGTTCCCGA AATGGGCCAA TCTGCCGATG AAAGAGCGTG CGCGCCTGAT GCGCCGTCTG GGCGATCTGA TCGACCAGAA CGTGCCAGAG ATCGCCGCGA TGGAAACCGC GGACACCGGC CTGCCGATCC ATCAGACCAA AAATGTGTTG ATCCCACGCG CTTCCCACAA CTTTGAATTT TTCGCGGAAG TCTGCCAGCA GATGAACGGC AAGACCTATC CGGTTGACGA CAAGATGCTC AACTACACGC TGGTGCAGCC GGTGGGCGTT TGTGCGCTGG TATCGCCGTG GAACGTACCG TTTATGACCG CCACATGGAA GGTCGCGCCG TGTCTGGCGC TGGGCAATAC CGCGGTACTG AAAATGTCGG AACTCTCCCC GCTGACCGCT GACCGCCTGG GTGAGCTGGC GCTGGAAGCC GGTATTCCGG CAGGCGTGCT GAACGTGGTA CAGGGCTACG GCGCAACCGC AGGGGATGCG CTGGTTCGTC ATCATGACGT ACGTGCCGTG TCGTTCACCG GCGGTACGGC CACCGGGCGC AACATCATGA AAAACGCCGG GCTGAAAAAA TACTCCATGG AACTGGGCGG TAAATCGCCG GTGCTGATTT TTGAAGATGC CGATATTGAA CGCGCGCTGG ACGCCGCCCT GTTCACCATC TTCTCGATCA ACGGCGAGCG CTGCACCGCC GGTTCGCGCA TCTTTATTCA GCAAAGCATC TACCCGGAAT TCGTTAAACG CTTTGCCGAA CGCGCCAACC GTCTGCGCGT GGGCGATCCG AACGATCCGA ATACCCAGGT TGGGGCGCTT ATCAGCCAGC AACACTGGGA TAAAGTCTCC GGCTATATCC GTCTCGGCAT TGAAGAAGGC GCAACCCTGC TGGCGGGCGG CCCGGATAAA CCGTCTGACC TGCCTGCACA CCTGAAAGGC GGCAACTTCC TGCGCCCAAC GGTGCTGGCG GACGTAGATA ACCGTATGCG CGTTGCCCAG GAAGAGATTT TCGGGCCGGT CGCCTGCCTG CTGCCGTTTA AAGACGAAGC CGAAGGCTTA CGTCTGGCAA ACGACGTGGA GTACGGCCTC GCGTCGTACA TCTGGACACA GGATGTCAGC AAAGTGTTAC GCCTGGCGCG TGGCATTGAA GCAGGCATGG TGTTCGTCAA CACCCAGAAC GTGCGTGACC TGCGCCAGCC ATTTGGCGGC GTAAAAGCCT CCGGCACCGG GCGTGAAGGC GGTGAGTACA GCTTCGAAGT GTTCGCGGAA ATGAAGAACG TCTGCATTTC CATGGGCGAC CATCCAATTC CGAAATGGGG AGTCTGA
|
Protein sequence | MKKVNHWING KNVAGNDYFQ TTNPATGEVL ADVASGGEAE INQAVAAAKE AFPKWANLPM KERARLMRRL GDLIDQNVPE IAAMETADTG LPIHQTKNVL IPRASHNFEF FAEVCQQMNG KTYPVDDKML NYTLVQPVGV CALVSPWNVP FMTATWKVAP CLALGNTAVL KMSELSPLTA DRLGELALEA GIPAGVLNVV QGYGATAGDA LVRHHDVRAV SFTGGTATGR NIMKNAGLKK YSMELGGKSP VLIFEDADIE RALDAALFTI FSINGERCTA GSRIFIQQSI YPEFVKRFAE RANRLRVGDP NDPNTQVGAL ISQQHWDKVS GYIRLGIEEG ATLLAGGPDK PSDLPAHLKG GNFLRPTVLA DVDNRMRVAQ EEIFGPVACL LPFKDEAEGL RLANDVEYGL ASYIWTQDVS KVLRLARGIE AGMVFVNTQN VRDLRQPFGG VKASGTGREG GEYSFEVFAE MKNVCISMGD HPIPKWGV
|
| |