Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4958 |
Symbol | aldB |
ID | 6968293 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 4599411 |
End bp | 4600949 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643388640 |
Product | aldehyde dehydrogenase B |
Protein accession | YP_002273067 |
Protein GI | 209397913 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAATA ATCCCCCTTC AGCACAGATT AAGCCCGGCG AGTATGGTTT CCCCCTCAAG TTAAAAAACC GCTATGACAA CTTTATTGGC GGCGAATGGG TAGCCCCTGC CGATGGTGAG TATTACCAGA ACCTGACGCC GGTGACCGGG CAGCTGCTGT GCGAAGTGGC GTCTTCGGGC AAACGAGACA TCGATCTGGC GCTGGATGCT GCGCACAAAG TGAAAGATAA ATGGGCGCAC ACCTCGGTGC AGGATCGCGC GGCGATTCTG TTTAAGATTG CCGATCGAAT GGAACAAAAC CTCGAGCTGT TAGCGACAGC TGAAGCCTGG GATAACGGCA AACCCATTCG CGAAACCAGT GCTGCGGATG TACCGCTGGC GATTGACCAT TTCCGCTATT TCGCCTCGTG TATTCGGGCG CAGGAAGGTG GGATCAGTGA AGTTGATAGC GAAACCGTGG CCTATCATTT CCATGAACCG TTAGGCGTGG TGGGGCAGAT TATCCCGTGG AACTTCCCGC TGCTGATGGC GAGCTGGAAA ATGGCTCCCG CGCTGGCGGC GGGCAACTGT GTGGTGCTGA AACCCGCACG TCTTACCCCG CTTTCTGTAC TGCTGCTAAT GGAAATCGTC GGTGATTTAC TGCCGCCGGG CGTGGTGAAC GTGGTCAACG GTGCAGGTGG GGAAATTGGC GAATATCTGG CGACCTCGAA ACGCATCGCC AAAGTGGCGT TTACCGGCTC AACGGAAGTG GGTCAACAAA TTATGCAATA CGCCACGCAA AACATTATTC CGGTGACGCT GGAGCTGGGC GGTAAGTCGC CAAATATCTT CTTTGCTGAT GTGATGGATG AAGAAGATGC CTTTTTCGAT AAAGCGCTGG AAGGCTTTGC ACTGTTTGCC TTTAACCAGG GCGAAGTTTG CACCTGTCCG AGTCGTGCCT TAGTGCAGGA GTCAATCTAC GAACGCTTTA TGGAACGCGC CATCCGCCGT GTCGAAAGCA TTCGCAGCGG TAACCCACTC GACAGCGTGA CGCAAATGGG TGCGCAGGTT TCTCACGGGC AACTGGAAAC CATCCTCAAC TACATTGATA TCGGTAAAAA AGAGGGCGCT GATGTGCTCA CCGGCGGGCG GCGCAAGCTG CTGGAAGGTG AACTGAAAGA CGGCTACTAC CTCGAACCGA CGATTCTGTT TGGTCAGAAC AATATGCGCG TGTTCCAGGA GGAGATTTTT GGCCCGGTGC TGGCGGTGAC TACCTTCAAA ACGATGGAAG AAGCGCTGGA GCTGGCGAAC GATACGCAAT ATGGCCTGGG CGCGGGCGTC TGGAGCCGCA ACGGTAATCT GGCCTATAAG ATGGGGCGCG GCATACAGGC TGGGCGCGTG TGGACCAACT GTTATCATGC TTACCCGGCA CATGCGGCGT TTGGTGGCTA CAAACAATCA GGTATCGGTC GCGAAACCCA CAAGATGATG CTGGAGCATT ACCAGCAAAC CAAGTGCCTG CTAGTGAGCT ACTCAGATAA ACCGTTGGGG CTGTTCTGA
|
Protein sequence | MTNNPPSAQI KPGEYGFPLK LKNRYDNFIG GEWVAPADGE YYQNLTPVTG QLLCEVASSG KRDIDLALDA AHKVKDKWAH TSVQDRAAIL FKIADRMEQN LELLATAEAW DNGKPIRETS AADVPLAIDH FRYFASCIRA QEGGISEVDS ETVAYHFHEP LGVVGQIIPW NFPLLMASWK MAPALAAGNC VVLKPARLTP LSVLLLMEIV GDLLPPGVVN VVNGAGGEIG EYLATSKRIA KVAFTGSTEV GQQIMQYATQ NIIPVTLELG GKSPNIFFAD VMDEEDAFFD KALEGFALFA FNQGEVCTCP SRALVQESIY ERFMERAIRR VESIRSGNPL DSVTQMGAQV SHGQLETILN YIDIGKKEGA DVLTGGRRKL LEGELKDGYY LEPTILFGQN NMRVFQEEIF GPVLAVTTFK TMEEALELAN DTQYGLGAGV WSRNGNLAYK MGRGIQAGRV WTNCYHAYPA HAAFGGYKQS GIGRETHKMM LEHYQQTKCL LVSYSDKPLG LF
|
| |