Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1757 |
Symbol | aldA |
ID | 6144472 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1761256 |
End bp | 1762695 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641616633 |
Product | aldehyde dehydrogenase A |
Protein accession | YP_001743811 |
Protein GI | 170680762 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0000000000249904 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCAGTAC CCGTTCAACA TCCTATGTAT ATTGATGGAC AGTTTGTTAC CTGGCGTGGA GACGCATGGA TTGATGTGGT AAACCCTGCA ACAGAGGCTG TCATTTCCCG CATTCCCGAT GGTCAGGCCG AGGATGTCCG TAAGGCAATC GATGCAGCAG AACGTGCACA ACCAGAATGG GAAGCGTTGC CTGCTATTGA ACGCGCCAGT TGGTTGCGCA AAATCTCCGC CGGGATCCGC AAACGCGCCA GCGAAATCAG TGCGCTGATT GTTGAAGAAG GGGGCAAGAT CCAGCAGCTG GCTGAAGTCG AAGTGGCTTT TACTGCGGAC TATATCGATT ACATGGCGGA GTGGGCACGG CGTTATGAAG GCGAGATTAT TCAAAGCGAT CGTCCAGGAG AAAATATTCT TCTGTTTAAA CGTGCGCTTG GCGTGACTAC CGGCATTCTG CCGTGGAACT TCCCGTTCTT CCTCATTGCC CGCAAAATGG CCCCCGCTCT TTTGACCGGT AATACCATCG TCATTAAACC CAGTGAATTT ACGCCAAACA ATGCGATTGC ATTCGCCAAA ATTGTCGATG AAATAGGCCT TCCGCGCGGC GTGTTTAACC TTGTTCTGGG GCGTGGTGAA ACCGTTGGGC AAGAACTGGC GGGTAACCCA AAGGTCGCAA TGGTCAGTAT GACAGGCAGC GTCTCTGCAG GTGAAAAGAT TATGGCGACT GCGGCGAAAA ACATCACCAA AGTGTGCCTG GAATTGGGGG GTAAAGCACC GGCTATCGTA ATGGACGATG CCGATCTTGA ACTGGCAGTC AAAGCCATCG TTGATTCACG CGTTATTAAT AGTGGGCAAG TGTGTAACTG TGCAGAACGT GTTTATGTAC AGAAAGGCAT TTATGATCAG TTTGTCAATC GGCTGGGTGA AGCGTTGCAG GCGGTTCAAT TTGGTAACCC CGCTGAACGC AACGACATTG CGATGGGGCC GTTGATTAAC GCCGCGGCGC TGGAAAGGGT CGAGCAAAAA GTGGCGCGCG CAGTAGAAGA AGGGGCGAGA GTGGCGTTGG GTGGCAAAGC GGTAGAGGGG AAAGGATATT ATTATCCGCC GACATTGCTG CTGGATGTTC GCCAGGAAAT GTCGATTATG CATGAGGAAA CCTTTGGCCC GGTTCTGCCG GTGGTCGCAT TTGATACGCT GGAAGAGGCT ATCTCAATGG CGAATGATAG TGATTACGGC CTGACCTCAT CAATCTATAC CCAAAATCTG AACGTCGCGA TGAAAGCCAT TAACGGGCTG AAGTTTGGTG AAACTTACAT CAACCGTGAA AACTTCGAAG CTATGCAAGG CTTCCACGCC GGATGGCGTA AATCTGGTAT TGGCGGCGCA GATGGTAAAC ATGGCCTGCA TGAATATCTG CAGACCCAGG TGGTTTATTT ACAGTCTTAA
|
Protein sequence | MSVPVQHPMY IDGQFVTWRG DAWIDVVNPA TEAVISRIPD GQAEDVRKAI DAAERAQPEW EALPAIERAS WLRKISAGIR KRASEISALI VEEGGKIQQL AEVEVAFTAD YIDYMAEWAR RYEGEIIQSD RPGENILLFK RALGVTTGIL PWNFPFFLIA RKMAPALLTG NTIVIKPSEF TPNNAIAFAK IVDEIGLPRG VFNLVLGRGE TVGQELAGNP KVAMVSMTGS VSAGEKIMAT AAKNITKVCL ELGGKAPAIV MDDADLELAV KAIVDSRVIN SGQVCNCAER VYVQKGIYDQ FVNRLGEALQ AVQFGNPAER NDIAMGPLIN AAALERVEQK VARAVEEGAR VALGGKAVEG KGYYYPPTLL LDVRQEMSIM HEETFGPVLP VVAFDTLEEA ISMANDSDYG LTSSIYTQNL NVAMKAINGL KFGETYINRE NFEAMQGFHA GWRKSGIGGA DGKHGLHEYL QTQVVYLQS
|
| |