Gene EcolC_0125 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0125 
Symbol 
ID6068348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp138314 
End bp139852 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content55% 
IMG OID641599527 
Productaldehyde dehydrogenase 
Protein accessionYP_001723136 
Protein GI170018182 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAATA ATCCCCCTTC AGCACAGATT AAGCCCGGCG AGTATGGTTT CCCCCTCAAG 
TTAAAAGCCC GCTATGACAA CTTTATTGGC GGCGAATGGG TAGCCCCTGC CGACGGCGAG
TATTACCAGA ATCTGACGCC GGTGACCGGG CAGCTGCTGT GCGAAGTGGC GTCTTCGGGC
AAACGAGACA TCGATCTGGC GCTGGATGCT GCGCACAAAG TGAAAGATAA ATGGGCGCAC
ACCTCGGTGC AGGATCGTGC GGCGATTCTG TTTAAGATTG CCGATCGAAT GGAACAAAAC
CTCGAGCTGT TAGCGACAGC TGAAACCTGG GATAACGGCA AACCCATTCG CGAAACCAGT
GCTGCTGATG TACCGCTGGC GATTGACCAT TTCCGCTATT TCGCCTCGTG TATCCGGGCA
CAGGAAGGCG GTATCAGTGA AGTTGATAGC GAAACCGTGG CCTATCATTT CCACGAACCG
TTAGGCGTGG TGGGGCAGAT TATTCCGTGG AACTTCCCGC TGCTGATGGC GAGCTGGAAA
ATGGCTCCCG CGCTGGCGGC GGGCAACTGT GTGGTGCTGA AACCCGCACG TCTTACCCCG
CTTTCTGTAC TGCTGCTAAT GGAAATCGTC GGTGATTTAC TGCCGCCGGG CGTGGTGAAC
GTGGTCAACG GCGCAGGTGG GGAAATTGGC GAATATCTGG CGACCTCGAA ACGCATCGCC
AAAGTGGCGT TTACCGGCTC AACGGAAGTG GGCCAACAAA TTATGCAATA CGCCACGCAA
AACATTATTC CGGTGACGCT GGAGCTGGGC GGCAAATCGC CAAATATCTT CTTTGCTGAT
GTGATGGATG AAGAAGATGC CTTTTTCGAT AAAGCGCTGG AAGGCTTTGC ACTGTTTGCC
TTTAACCAGG GCGAAGTTTG CACCTGTCCG AGTCGTGCTT TAGTGCAGGA ATCTATCTAC
GAACGCTTTA TGGAACGCGC CATCCGCCGT GTCGAAAGCA TTCGTAGCGG TAACCCGCTC
GACAGCGTGA CGCAAATGGG CGCGCAGGTT TCTCACGGGC AACTGGAAAC CATCCTCAAC
TACATTGATA TCGGTAAAAA AGAGGGCGCT GACGTGCTCA CAGGCGGGCG GCGCAAGCTG
CTGGAAGGTG AACTGAAAGA CGGCTACTAC CTCGAACCGA CGATTCTGTT TGGTCAGAAC
AATATGCGGG TGTTCCAGGA GGAGATTTTT GGCCCGGTGC TGGCGGTGAC CACCTTCAAA
ACGATGGAAG AAGCGCTGGA GCTGGCGAAC GATACGCAAT ATGGCCTGGG CGCGGGCGTC
TGGAGCCGCA ACGGTAATCT GGCCTATAAG ATGGGGCGCG GCATACAGGC TGGGCGCGTG
TGGACCAACT GTTATCACGC TTACCCGGCA CATGCGGCGT TTGGTGGCTA CAAACAATCA
GGTATCGGTC GCGAAACCCA CAAGATGATG CTGGAGCATT ACCAGCAAAC CAAGTGCCTG
CTGGTGAGCT ACTCGGATAA ACCGTTGGGG CTGTTCTGA
 
Protein sequence
MTNNPPSAQI KPGEYGFPLK LKARYDNFIG GEWVAPADGE YYQNLTPVTG QLLCEVASSG 
KRDIDLALDA AHKVKDKWAH TSVQDRAAIL FKIADRMEQN LELLATAETW DNGKPIRETS
AADVPLAIDH FRYFASCIRA QEGGISEVDS ETVAYHFHEP LGVVGQIIPW NFPLLMASWK
MAPALAAGNC VVLKPARLTP LSVLLLMEIV GDLLPPGVVN VVNGAGGEIG EYLATSKRIA
KVAFTGSTEV GQQIMQYATQ NIIPVTLELG GKSPNIFFAD VMDEEDAFFD KALEGFALFA
FNQGEVCTCP SRALVQESIY ERFMERAIRR VESIRSGNPL DSVTQMGAQV SHGQLETILN
YIDIGKKEGA DVLTGGRRKL LEGELKDGYY LEPTILFGQN NMRVFQEEIF GPVLAVTTFK
TMEEALELAN DTQYGLGAGV WSRNGNLAYK MGRGIQAGRV WTNCYHAYPA HAAFGGYKQS
GIGRETHKMM LEHYQQTKCL LVSYSDKPLG LF