Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3915 |
Symbol | aldB |
ID | 6143619 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3985280 |
End bp | 3986818 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641618741 |
Product | aldehyde dehydrogenase B |
Protein accession | YP_001745880 |
Protein GI | 170679828 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAATA ATCCCCCTTC AGCACAGATT AAGCCCGGCG AGTATGGTTT CCCCCTCAAG TTAAAAGCCC GCTACGACAA CTTTATTGGC GGCGAATGGG TAGCCCCTGC CGACGGCGAG TATTACCAGA ACCTGACGCC GGTGACCGGA CAGCTGCTGT GCGAAGTGGC GTCTTCGGGC AAACGAGACA TCGATCTGGC GCTGGATGCT GCGCACAAAG TGAAAGATAA ATGGGCGCAT ACTTCGGTGC AGGATCGCGC GGCGATTCTG TTTAAGATTG CCGATCGGAT GGAGCAGAAC CTCGAACTGT TAGCGACCGC TGAAACCTGG GATAACGGCA AACCCATTCG CGAAACCAGT GCTGCTGATG TGCCGCTGGC GATTGACCAT TTTCGCTATT TTGCCTCGTG TATTCGGGCG CAGGAAGGCG GTATTAGTGA AGTGGATAGC GAAACTGTGG CTTATCACTT CCACGAACCG TTAGGCGTGG TGGGGCAGAT TATTCCGTGG AACTTCCCGC TGCTGATGGC GAGCTGGAAA ATGGCTCCCG CGCTGGCGGC GGGCAACTGT GTGGTGCTGA AACCCGCACG TCTTACCCCG CTCTCAGTAC TGCTGCTAAT GGAAATCGTC GGTGATTTAC TGCCGCCGGG CGTGGTGAAC GTGGTCAACG GTGCGGGCGG AGAAATTGGC GAATATCTGG CGACCTCGAA ACGCATCGCC AAAGTGGCGT TTACCGGCTC AACGGAAGTG GGCCAACAAA TTATGCAGTA CGCCACGCAA AACATTATTC CGGTGACGCT GGAGCTGGGC GGCAAATCCC CAAATATCTT CTTTGCTGAT GTGATGGATG AAGAAGATGC CTTTTTCGAT AAAGCGCTGG AAGGCTTTGC ACTGTTTGCC TTTAACCAGG GCGAAGTTTG CACCTGTCCA AGCCGTGCTT TAGTGCAGGA GTCTATCTAC GAACGCTTTA TGGAACGCGC CATCCGCCGT GTCGAAAGCA TTCGCAGCGG TAACCCGCTC GACAGCGTGA CGCAAATGGG GGCGCAGGTT TCGCACGGGC AATTGGAAAC CATCCTCAAC TACATTGATA TTGGTAAAAA AGAGGGCGCA GACGTGCTCA CCGGCGGGCG GCGCAAGCTG CTGGAAGGCG AACTGAAAGA CGGCTACTAC CTCGAACCGA CGATTCTGTT TGGTCAGAAT AATATGCGCG TGTTCCAGGA AGAAATTTTT GGTCCGGTGC TGGCGGTGAC CACCTTCAAA ACGATGGAAG AAGCGCTGGA GCTGGCGAAC GACACGCAAT ATGGCCTGGG CGCGGGTGTC TGGAGCCGCA ACGGTAATCT GGCCTATAAG ATGGGGCGCG GCATTCAGGC CGGGCGCGTG TGGACCAACT GTTATCACGC TTACCCGGCA CATGCGGCGT TTGGCGGCTA CAAACAGTCG GGCATCGGAC GCGAAACCCA TAAGATGATG CTTGAGCATT ACCAGCAAAC CAAGTGCCTG CTGGTGAGCT ACTCGGATAA ACCGTTGGGG CTGTTCTGA
|
Protein sequence | MTNNPPSAQI KPGEYGFPLK LKARYDNFIG GEWVAPADGE YYQNLTPVTG QLLCEVASSG KRDIDLALDA AHKVKDKWAH TSVQDRAAIL FKIADRMEQN LELLATAETW DNGKPIRETS AADVPLAIDH FRYFASCIRA QEGGISEVDS ETVAYHFHEP LGVVGQIIPW NFPLLMASWK MAPALAAGNC VVLKPARLTP LSVLLLMEIV GDLLPPGVVN VVNGAGGEIG EYLATSKRIA KVAFTGSTEV GQQIMQYATQ NIIPVTLELG GKSPNIFFAD VMDEEDAFFD KALEGFALFA FNQGEVCTCP SRALVQESIY ERFMERAIRR VESIRSGNPL DSVTQMGAQV SHGQLETILN YIDIGKKEGA DVLTGGRRKL LEGELKDGYY LEPTILFGQN NMRVFQEEIF GPVLAVTTFK TMEEALELAN DTQYGLGAGV WSRNGNLAYK MGRGIQAGRV WTNCYHAYPA HAAFGGYKQS GIGRETHKMM LEHYQQTKCL LVSYSDKPLG LF
|
| |