Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_1596 |
Symbol | aldA |
ID | 5587726 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 1588203 |
End bp | 1589642 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640925285 |
Product | aldehyde dehydrogenase A |
Protein accession | YP_001462690 |
Protein GI | 157154889 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGTAC CCGTTCAACA TTCTATGTAT ATCGATGGAC AGTTTGTTAC CTGGCGTGGA GACGCATGGA TTGATGTGGT AAACCCTGCT ACAGAGGCTG TCATTTCCCG CATACCCGAT GGTCAGGCCG AGGATGCCCG TAAGGCAATC GATGCAGCAG AACGTGCACA ACCAGAATGG GAAGCGTTGC CTGCTATTGA ACGCGCCAGT TGGTTGCGCA AAATCTCCGC CGGGATCCGC GAACGCGCCA GTGAAATCAG TGCGCTGATT GTTGAAGAAG GGGGCAAGAT CCAGCAGCTG GCTGAAGTCG AAGTGGCTTT TACTGCCGAC TATATCGATT ACATGGCGGA GTGGGCACGG CGTTACGAGG GCGAGATTAT TCAAAGCGAT CGTCCAGGAG AAAATATTCT TTTGTTTAAA CGTGCGCTTG GTGTGACTAC CGGCATTCTG CCGTGGAACT TCCCGTTCTT CCTCATTGCC CGCAAAATGG CTCCCGCTCT TTTGACCGGT AATACCATCG TCATTAAACC TAGTGAATTT ACGCCAAACA ATGCGATTGC ATTCGCCAAA ATCGTCGATG AAATAGGACT TCCGCGCGGC GTGTTTAACC TTGTACTGGG GCGTGGTGAA ACCGTTGGGC AAGAACTGGC GGGTAACCCA AAGGTCGCAA TGGTCAGTAT GACAGGCAGC GTCTCTGCAG GTGAGAAGAT CATGGCGACT GCGGCGAAAA ACATCACCAA AGTGTGTCTG GAATTGGGGG GTAAAGCACC AGCTATCGTA ATGGACGATG CCGATCTTGA ACTGGCAGTC AAAGCCATCG TTGATTCACG CGTCATTAAT AGTGGGCAAG TGTGTAACTG TGCAGAACGT GTTTATGTAC AGAAAGGCAT TTATGATCAG TTCGTCAATC GGCTGGGTGA AGCGATGCAG GCGGTTCAAT TTGGTAACCC CGCTGAACGC AACGACATTG CGATGGGGCC GTTGATTAAC GCCGCGGCGC TGGAAAGGGT TGAGCAAAAA GTGGCGCGCG CAGTAGAAGA AGGGGCGAGA GTGGCGTTGG GTGGCAAAGC GGTAGAGGGG AAAGGATATT ATTATCCGCC GACATTGCTG CTGGATGTTC GCCAGGAAAT GTCGATTATG CATGAGGAAA CCTTTGGCCC GGTTCTGCCG GTGGTCGCAT TTGACACGCT GGAAGAGGCT ATCTCAATGG CTAATGACAG TGATTACGGC CTGACCTCAT CAATCTATAC CCAAAATCTG AACGTCGCGA TGAAAGCCAT TAAAGGGCTG AAGTTTGGTG AAACTTACAT CAACCGTGAA AACTTCGAAG CTATGCAAGG CTTCCACGCC GGATGGCGTA AATCCGGTAT TGGCGGCGCA GATGGTAAAC ATGGCCTGCA TGAATATCTG CAGACCCAGG TGGTTTATTT ACAGTCGTAA
|
Protein sequence | MSVPVQHSMY IDGQFVTWRG DAWIDVVNPA TEAVISRIPD GQAEDARKAI DAAERAQPEW EALPAIERAS WLRKISAGIR ERASEISALI VEEGGKIQQL AEVEVAFTAD YIDYMAEWAR RYEGEIIQSD RPGENILLFK RALGVTTGIL PWNFPFFLIA RKMAPALLTG NTIVIKPSEF TPNNAIAFAK IVDEIGLPRG VFNLVLGRGE TVGQELAGNP KVAMVSMTGS VSAGEKIMAT AAKNITKVCL ELGGKAPAIV MDDADLELAV KAIVDSRVIN SGQVCNCAER VYVQKGIYDQ FVNRLGEAMQ AVQFGNPAER NDIAMGPLIN AAALERVEQK VARAVEEGAR VALGGKAVEG KGYYYPPTLL LDVRQEMSIM HEETFGPVLP VVAFDTLEEA ISMANDSDYG LTSSIYTQNL NVAMKAIKGL KFGETYINRE NFEAMQGFHA GWRKSGIGGA DGKHGLHEYL QTQVVYLQS
|
| |