Gene EcSMS35_3915 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3915 
SymbolaldB 
ID6143619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3985280 
End bp3986818 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content55% 
IMG OID641618741 
Productaldehyde dehydrogenase B 
Protein accessionYP_001745880 
Protein GI170679828 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAATA ATCCCCCTTC AGCACAGATT AAGCCCGGCG AGTATGGTTT CCCCCTCAAG 
TTAAAAGCCC GCTACGACAA CTTTATTGGC GGCGAATGGG TAGCCCCTGC CGACGGCGAG
TATTACCAGA ACCTGACGCC GGTGACCGGA CAGCTGCTGT GCGAAGTGGC GTCTTCGGGC
AAACGAGACA TCGATCTGGC GCTGGATGCT GCGCACAAAG TGAAAGATAA ATGGGCGCAT
ACTTCGGTGC AGGATCGCGC GGCGATTCTG TTTAAGATTG CCGATCGGAT GGAGCAGAAC
CTCGAACTGT TAGCGACCGC TGAAACCTGG GATAACGGCA AACCCATTCG CGAAACCAGT
GCTGCTGATG TGCCGCTGGC GATTGACCAT TTTCGCTATT TTGCCTCGTG TATTCGGGCG
CAGGAAGGCG GTATTAGTGA AGTGGATAGC GAAACTGTGG CTTATCACTT CCACGAACCG
TTAGGCGTGG TGGGGCAGAT TATTCCGTGG AACTTCCCGC TGCTGATGGC GAGCTGGAAA
ATGGCTCCCG CGCTGGCGGC GGGCAACTGT GTGGTGCTGA AACCCGCACG TCTTACCCCG
CTCTCAGTAC TGCTGCTAAT GGAAATCGTC GGTGATTTAC TGCCGCCGGG CGTGGTGAAC
GTGGTCAACG GTGCGGGCGG AGAAATTGGC GAATATCTGG CGACCTCGAA ACGCATCGCC
AAAGTGGCGT TTACCGGCTC AACGGAAGTG GGCCAACAAA TTATGCAGTA CGCCACGCAA
AACATTATTC CGGTGACGCT GGAGCTGGGC GGCAAATCCC CAAATATCTT CTTTGCTGAT
GTGATGGATG AAGAAGATGC CTTTTTCGAT AAAGCGCTGG AAGGCTTTGC ACTGTTTGCC
TTTAACCAGG GCGAAGTTTG CACCTGTCCA AGCCGTGCTT TAGTGCAGGA GTCTATCTAC
GAACGCTTTA TGGAACGCGC CATCCGCCGT GTCGAAAGCA TTCGCAGCGG TAACCCGCTC
GACAGCGTGA CGCAAATGGG GGCGCAGGTT TCGCACGGGC AATTGGAAAC CATCCTCAAC
TACATTGATA TTGGTAAAAA AGAGGGCGCA GACGTGCTCA CCGGCGGGCG GCGCAAGCTG
CTGGAAGGCG AACTGAAAGA CGGCTACTAC CTCGAACCGA CGATTCTGTT TGGTCAGAAT
AATATGCGCG TGTTCCAGGA AGAAATTTTT GGTCCGGTGC TGGCGGTGAC CACCTTCAAA
ACGATGGAAG AAGCGCTGGA GCTGGCGAAC GACACGCAAT ATGGCCTGGG CGCGGGTGTC
TGGAGCCGCA ACGGTAATCT GGCCTATAAG ATGGGGCGCG GCATTCAGGC CGGGCGCGTG
TGGACCAACT GTTATCACGC TTACCCGGCA CATGCGGCGT TTGGCGGCTA CAAACAGTCG
GGCATCGGAC GCGAAACCCA TAAGATGATG CTTGAGCATT ACCAGCAAAC CAAGTGCCTG
CTGGTGAGCT ACTCGGATAA ACCGTTGGGG CTGTTCTGA
 
Protein sequence
MTNNPPSAQI KPGEYGFPLK LKARYDNFIG GEWVAPADGE YYQNLTPVTG QLLCEVASSG 
KRDIDLALDA AHKVKDKWAH TSVQDRAAIL FKIADRMEQN LELLATAETW DNGKPIRETS
AADVPLAIDH FRYFASCIRA QEGGISEVDS ETVAYHFHEP LGVVGQIIPW NFPLLMASWK
MAPALAAGNC VVLKPARLTP LSVLLLMEIV GDLLPPGVVN VVNGAGGEIG EYLATSKRIA
KVAFTGSTEV GQQIMQYATQ NIIPVTLELG GKSPNIFFAD VMDEEDAFFD KALEGFALFA
FNQGEVCTCP SRALVQESIY ERFMERAIRR VESIRSGNPL DSVTQMGAQV SHGQLETILN
YIDIGKKEGA DVLTGGRRKL LEGELKDGYY LEPTILFGQN NMRVFQEEIF GPVLAVTTFK
TMEEALELAN DTQYGLGAGV WSRNGNLAYK MGRGIQAGRV WTNCYHAYPA HAAFGGYKQS
GIGRETHKMM LEHYQQTKCL LVSYSDKPLG LF