Gene ECH74115_4958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4958 
SymbolaldB 
ID6968293 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4599411 
End bp4600949 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content55% 
IMG OID643388640 
Productaldehyde dehydrogenase B 
Protein accessionYP_002273067 
Protein GI209397913 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAATA ATCCCCCTTC AGCACAGATT AAGCCCGGCG AGTATGGTTT CCCCCTCAAG 
TTAAAAAACC GCTATGACAA CTTTATTGGC GGCGAATGGG TAGCCCCTGC CGATGGTGAG
TATTACCAGA ACCTGACGCC GGTGACCGGG CAGCTGCTGT GCGAAGTGGC GTCTTCGGGC
AAACGAGACA TCGATCTGGC GCTGGATGCT GCGCACAAAG TGAAAGATAA ATGGGCGCAC
ACCTCGGTGC AGGATCGCGC GGCGATTCTG TTTAAGATTG CCGATCGAAT GGAACAAAAC
CTCGAGCTGT TAGCGACAGC TGAAGCCTGG GATAACGGCA AACCCATTCG CGAAACCAGT
GCTGCGGATG TACCGCTGGC GATTGACCAT TTCCGCTATT TCGCCTCGTG TATTCGGGCG
CAGGAAGGTG GGATCAGTGA AGTTGATAGC GAAACCGTGG CCTATCATTT CCATGAACCG
TTAGGCGTGG TGGGGCAGAT TATCCCGTGG AACTTCCCGC TGCTGATGGC GAGCTGGAAA
ATGGCTCCCG CGCTGGCGGC GGGCAACTGT GTGGTGCTGA AACCCGCACG TCTTACCCCG
CTTTCTGTAC TGCTGCTAAT GGAAATCGTC GGTGATTTAC TGCCGCCGGG CGTGGTGAAC
GTGGTCAACG GTGCAGGTGG GGAAATTGGC GAATATCTGG CGACCTCGAA ACGCATCGCC
AAAGTGGCGT TTACCGGCTC AACGGAAGTG GGTCAACAAA TTATGCAATA CGCCACGCAA
AACATTATTC CGGTGACGCT GGAGCTGGGC GGTAAGTCGC CAAATATCTT CTTTGCTGAT
GTGATGGATG AAGAAGATGC CTTTTTCGAT AAAGCGCTGG AAGGCTTTGC ACTGTTTGCC
TTTAACCAGG GCGAAGTTTG CACCTGTCCG AGTCGTGCCT TAGTGCAGGA GTCAATCTAC
GAACGCTTTA TGGAACGCGC CATCCGCCGT GTCGAAAGCA TTCGCAGCGG TAACCCACTC
GACAGCGTGA CGCAAATGGG TGCGCAGGTT TCTCACGGGC AACTGGAAAC CATCCTCAAC
TACATTGATA TCGGTAAAAA AGAGGGCGCT GATGTGCTCA CCGGCGGGCG GCGCAAGCTG
CTGGAAGGTG AACTGAAAGA CGGCTACTAC CTCGAACCGA CGATTCTGTT TGGTCAGAAC
AATATGCGCG TGTTCCAGGA GGAGATTTTT GGCCCGGTGC TGGCGGTGAC TACCTTCAAA
ACGATGGAAG AAGCGCTGGA GCTGGCGAAC GATACGCAAT ATGGCCTGGG CGCGGGCGTC
TGGAGCCGCA ACGGTAATCT GGCCTATAAG ATGGGGCGCG GCATACAGGC TGGGCGCGTG
TGGACCAACT GTTATCATGC TTACCCGGCA CATGCGGCGT TTGGTGGCTA CAAACAATCA
GGTATCGGTC GCGAAACCCA CAAGATGATG CTGGAGCATT ACCAGCAAAC CAAGTGCCTG
CTAGTGAGCT ACTCAGATAA ACCGTTGGGG CTGTTCTGA
 
Protein sequence
MTNNPPSAQI KPGEYGFPLK LKNRYDNFIG GEWVAPADGE YYQNLTPVTG QLLCEVASSG 
KRDIDLALDA AHKVKDKWAH TSVQDRAAIL FKIADRMEQN LELLATAEAW DNGKPIRETS
AADVPLAIDH FRYFASCIRA QEGGISEVDS ETVAYHFHEP LGVVGQIIPW NFPLLMASWK
MAPALAAGNC VVLKPARLTP LSVLLLMEIV GDLLPPGVVN VVNGAGGEIG EYLATSKRIA
KVAFTGSTEV GQQIMQYATQ NIIPVTLELG GKSPNIFFAD VMDEEDAFFD KALEGFALFA
FNQGEVCTCP SRALVQESIY ERFMERAIRR VESIRSGNPL DSVTQMGAQV SHGQLETILN
YIDIGKKEGA DVLTGGRRKL LEGELKDGYY LEPTILFGQN NMRVFQEEIF GPVLAVTTFK
TMEEALELAN DTQYGLGAGV WSRNGNLAYK MGRGIQAGRV WTNCYHAYPA HAAFGGYKQS
GIGRETHKMM LEHYQQTKCL LVSYSDKPLG LF