Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_2243 |
Symbol | |
ID | 6067691 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 2461672 |
End bp | 2463111 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641601648 |
Product | aldehyde dehydrogenase A |
Protein accession | YP_001725207 |
Protein GI | 170020253 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGTAC CCGTTCAACA TCCTATGTAT ATCGATGGAC AGTTTGTTAC CTGGCGTGGA GACGCATGGA TTGATGTGGT AAACCCTGCT ACAGAGGCTG TCATTTCCCG CATACCCGAT GGTCAGGCCG AGGATGCCCG TAAGGCAATC GATGCAGCAG AACATGCACA ACCAGAATGG GAAGCGTTGC CTGCTATTGA ACGCGCCAGT TGGTTGCGCA AAATCTCCGC CGGGATCCGC GAACGCGCCA GTGAAATCAG TGCGCTGATT GTTGAAGAAG GGGGCAAGAT CCAGCAGCTG GCTGAAGTCG AAGTGGCTTT TACTGCCGAC TATATCGATT ACATGGCGGA GTGGGCACGG CGTTACGAGG GCGAGATTAT TCAAAGCGAT CGTCCAGGAG AAAATATTCT TCTGTTTAAA CGTGCGCTTG GTGTGACTAC CGGCATTCTG CCGTGGAACT TCCCGTTCTT CCTCATTGCC CGCAAAATGG CTCCCGCTCT TTTGACTGGT AATACCATCG TCATTAAACC CAGTGAATTT ACGCCAAACA ATGCGATTGC ATTCGCCAAA ATCGTCGATG AAATAGGCCT TCCGCGCGGC GTGTTTAACC TTGTACTGGG GCGTGGTGAA ACCGTTGGGC AAGAACTGGC GGGTAACCCA AAGGTCGCAA TGGTCAGTAT GACAGGCAGC GTCTCTGCAG GTGAGAAGAT TATGGCGACT GCGGCGAAAA ACATCACCAA AGTGTGCCTG GAATTGGGTG GTAAAGCACC AGCTATCGTA ATGGACGATG CCGATCTTGA ACTGGCAGTC AAAGCCATCG TTGATTCACG CGTCATTAAT AGTGGGCAAG TGTGTAACTG TGCAGAACGT GTTTATGTAC AGAAAGGCAT TTATGATCAG TTCGTCAATC GGCTGGGTGA AGCGATGCAG GCGGTTCAAT TTGGTAACCC CGCTGAACGC AACGACATTG CGATGGGGCC GTTGATTAAC GCCGCGGCGC TGGAAAGGGT CGAGCAAAAA GTGGCGCGCG CAGTAGAAGA AGGGGCGAGA GTGGCGTTGG GTGGCAAAGC AGTAGACGGG AAAGGATATT ATTATCCGCC GACATTGCTG CTGGATGTTC GCCAGGAAAT GTCGATTATG CATGAGGAAA CCTTTGGCCC GGTTCTGCCG GTGGTCGCAT TTGACACGCT GGAAGATGCT ATCTCAATGG CTAATGACAG TGATTACGGC CTGACCTCAT CAATCTATAC CCAAAATCTG AACGTCGCGA TGAAAGCCAT TAAAGGGCTG AAGTTTGGTG AAACTTACAT CAACCGTGAA AACTTCGAAG CTATGCAAGG CTTCCACGCC GGATGGCGTA AATCCGGTAT TGGCGGCGCA GATGGTAAAC ATGGCTTGCA TGAATATCTG CAGACCCAGG TGGTTTATTT ACAGTCTTAA
|
Protein sequence | MSVPVQHPMY IDGQFVTWRG DAWIDVVNPA TEAVISRIPD GQAEDARKAI DAAEHAQPEW EALPAIERAS WLRKISAGIR ERASEISALI VEEGGKIQQL AEVEVAFTAD YIDYMAEWAR RYEGEIIQSD RPGENILLFK RALGVTTGIL PWNFPFFLIA RKMAPALLTG NTIVIKPSEF TPNNAIAFAK IVDEIGLPRG VFNLVLGRGE TVGQELAGNP KVAMVSMTGS VSAGEKIMAT AAKNITKVCL ELGGKAPAIV MDDADLELAV KAIVDSRVIN SGQVCNCAER VYVQKGIYDQ FVNRLGEAMQ AVQFGNPAER NDIAMGPLIN AAALERVEQK VARAVEEGAR VALGGKAVDG KGYYYPPTLL LDVRQEMSIM HEETFGPVLP VVAFDTLEDA ISMANDSDYG LTSSIYTQNL NVAMKAIKGL KFGETYINRE NFEAMQGFHA GWRKSGIGGA DGKHGLHEYL QTQVVYLQS
|
| |