Gene EcDH1_2248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2248 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2416380 
End bp2417585 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content56% 
IMG OID 
Productbeta-ketoadipyl CoA thiolase 
Protein accessionACX39896 
Protein GI260449474 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.0780819 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTGAAG CCTTTATTTG TGACGGAATT CGTACGCCAA TTGGTCGCTA CGGCGGGGCA 
TTATCAAGTG TTCGGGCTGA TGATCTGGCT GCTATCCCTT TGCGGGAACT GCTGGTGCGA
AACCCGCGTC TCGATGCGGA GTGTATCGAT GATGTGATCC TCGGCTGTGC TAATCAGGCG
GGAGAAGATA ACCGTAACGT AGCCCGGATG GCGACTTTAC TGGCGGGGCT GCCGCAGAGT
GTTTCCGGCA CAACCATTAA CCGCTTGTGT GGTTCCGGGC TGGACGCACT GGGGTTTGCC
GCACGGGCGA TTAAAGCGGG CGATGGCGAT TTGCTGATCG CCGGTGGCGT GGAGTCAATG
TCACGGGCAC CGTTTGTTAT GGGCAAGGCA GCCAGTGCAT TTTCTCGTCA GGCTGAGATG
TTCGATACCA CTATTGGCTG GCGATTTGTG AACCCGCTCA TGGCTCAGCA ATTTGGAACT
GACAGCATGC CGGAAACGGC AGAGAATGTA GCTGAACTGT TAAAAATCTC ACGAGAAGAT
CAAGATAGTT TTGCGCTACG CAGTCAGCAA CGTACGGCAA AAGCGCAATC CTCAGGCATT
CTGGCTGAGG AGATTGTTCC GGTTGTGTTG AAAAACAAGA AAGGTGTTGT AACAGAAATA
CAACATGATG AGCATCTGCG CCCGGAAACG ACGCTGGAAC AGTTACGTGG GTTAAAAGCA
CCATTTCGTG CCAATGGGGT GATTACCGCA GGCAATGCTT CCGGGGTGAA TGACGGAGCC
GCTGCGTTGA TTATTGCCAG TGAACAGATG GCAGCAGCGC AAGGACTGAC ACCGCGGGCG
CGTATCGTAG CCATGGCAAC CGCCGGGGTG GAACCGCGCC TGATGGGGCT TGGTCCGGTG
CCTGCAACTC GCCGGGTGCT GGAACGCGCA GGGCTGAGTA TTCACGATAT GGACGTGATT
GAACTGAACG AAGCGTTCGC GGCCCAGGCG TTGGGTGTAC TACGCGAATT GGGGCTGCCT
GATGATGCCC CACATGTTAA CCCCAACGGA GGCGCTATCG CCTTAGGCCA TCCGTTGGGA
ATGAGTGGTG CCCGCCTGGC ACTGGCTGCC AGCCATGAGC TGCATCGGCG TAACGGTCGT
TACGCATTGT GCACCATGTG CATCGGTGTC GGTCAGGGCA TCGCCATGAT TCTGGAGCGT
GTTTGA
 
Protein sequence
MREAFICDGI RTPIGRYGGA LSSVRADDLA AIPLRELLVR NPRLDAECID DVILGCANQA 
GEDNRNVARM ATLLAGLPQS VSGTTINRLC GSGLDALGFA ARAIKAGDGD LLIAGGVESM
SRAPFVMGKA ASAFSRQAEM FDTTIGWRFV NPLMAQQFGT DSMPETAENV AELLKISRED
QDSFALRSQQ RTAKAQSSGI LAEEIVPVVL KNKKGVVTEI QHDEHLRPET TLEQLRGLKA
PFRANGVITA GNASGVNDGA AALIIASEQM AAAQGLTPRA RIVAMATAGV EPRLMGLGPV
PATRRVLERA GLSIHDMDVI ELNEAFAAQA LGVLRELGLP DDAPHVNPNG GAIALGHPLG
MSGARLALAA SHELHRRNGR YALCTMCIGV GQGIAMILER V