Gene EcHS_A3793 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3793 
SymbolaldB 
ID5591730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3784276 
End bp3785814 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content55% 
IMG OID640922907 
Productaldehyde dehydrogenase B 
Protein accessionYP_001460385 
Protein GI157163067 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAATA ATCCCCCTTC AGCACAGATT AAGCCCGGCG AGTATGGTTT CCCCCTCAAG 
TTAAAAGCCC GCTATGACAA CTTTATTGGC GGCGAATGGG TAGCCCCTGC CGACGGCGAG
TATTACCAGA ATCTGACGCC GGTGACCGGG CAGCTGCTGT GCGAAGTGGC GTCTTCGGGC
AAACGAGACA TCGATCTGGC GCTGGATGCT GCGCACAAAG TGAAAGATAA ATGGGCGCAC
ACCTCGGTGC AGGATCGTGC GGCGATTCTG TTTAAGATTG CCGATCGAAT GGAACAAAAC
CTCGAGCTGT TAGCGACAGC TGAAACCTGG GATAACGGCA AACCCATTCG CGAAACCAGT
GCTGCTGATG TACCGCTGGC GATTGACCAT TTCCGCTATT TCGCCTCGTG TATCCGGGCA
CAGGAAGGCG GTATCAGTGA AGTTGATAGC GAAACCGTGG CCTATCATTT CCACGAACCG
TTAGGCGTGG TGGGGCAGAT TATTCCGTGG AACTTCCCGC TGCTGATGGC GAGCTGGAAA
ATGGCTCCCG CGCTGGCGGC GGGCAACTGT GTGGTGCTGA AACCCGCACG TCTTACCCCG
CTTTCTGTAC TGCTGCTAAT GGAAATCGTC GGTGATTTAC TGCCGCCGGG CGTGGTGAAC
GTGGTCAACG GCGCAGGTGG GGAAATTGGC GAATATCTGG CGACCTCGAA ACGCATCGCC
AAAGTGGCGT TTACCGGCTC AACGGAAGTG GGCCAACAAA TTATGCAATA CGCCACGCAA
AACATTATTC CGGTGACGCT GGAGCTGGGC GGCAAATCGC CAAATATCTT CTTTGCTGAT
GTGATGGATG AAGAAGATGC CTTTTTCGAT AAAGCGCTGG AAGGCTTTGC ACTGTTTGCC
TTTAACCAGG GCGAAGTTTG CACCTGTCCG AGTCGTGCTT TAGTGCAGGA ATCTATCTAC
GAACGCTTTA TGGAACGCGC CATCCGCCGT GTCGAAAGCA TTCGTAGCGG TAACCCGCTC
GACAGCGTGA CGCAAATGGG CGCGCAGGTT TCTCACGGGC AACTGGAAAC CATCCTCAAC
TACATTGATA TCGGTAAAAA AGAGGGCGCT GACGTGCTCA CAGGCGGGCG GCGCAAGCTG
CTGGAAGGTG AACTGAAAGA CGGCTACTAC CTCGAACCGA CGATTTTGTT TGGTCAGAAC
AATATGCGCG TGTTCCAGGA GGAGATTTTT GGCCCGGTGC TGGCGGTGAC CACCTTTAAA
ACGATGGATG AGGCGCTGGA GCTGGCGAAC GACACGCAAT ATGGCCTGGG CGCAGGTGTC
TGGAGCCGCA ACGGTAATCT GGCCTATAAG ATGGGGCGCG GCATACAGGC TGGGCGCGTG
TGGACCAACT GCTATCACGC TTACCCGGCA CATGCTGCGT TTGGTGGCTA CAAACAATCA
GGTATCGGTC GCGAAACCCA CAAGATGATG CTGGAGCATT ACCAGCAAAC CAAGTGCCTG
CTGGTGAGCT ACTCGGATAA ACCGTTGGGG CTGTTCTGA
 
Protein sequence
MTNNPPSAQI KPGEYGFPLK LKARYDNFIG GEWVAPADGE YYQNLTPVTG QLLCEVASSG 
KRDIDLALDA AHKVKDKWAH TSVQDRAAIL FKIADRMEQN LELLATAETW DNGKPIRETS
AADVPLAIDH FRYFASCIRA QEGGISEVDS ETVAYHFHEP LGVVGQIIPW NFPLLMASWK
MAPALAAGNC VVLKPARLTP LSVLLLMEIV GDLLPPGVVN VVNGAGGEIG EYLATSKRIA
KVAFTGSTEV GQQIMQYATQ NIIPVTLELG GKSPNIFFAD VMDEEDAFFD KALEGFALFA
FNQGEVCTCP SRALVQESIY ERFMERAIRR VESIRSGNPL DSVTQMGAQV SHGQLETILN
YIDIGKKEGA DVLTGGRRKL LEGELKDGYY LEPTILFGQN NMRVFQEEIF GPVLAVTTFK
TMDEALELAN DTQYGLGAGV WSRNGNLAYK MGRGIQAGRV WTNCYHAYPA HAAFGGYKQS
GIGRETHKMM LEHYQQTKCL LVSYSDKPLG LF