Gene EcDH1_1333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1333 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp1435367 
End bp1436587 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content57% 
IMG OID 
ProductBeta-ketoacyl synthase 
Protein accessionACX39005 
Protein GI260448583 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGTG CAGTGATTAC TGGCCTGGGC ATTGTTTCCA GCATCGGTAA TAACCAGCAG 
GAAGTCCTGG CATCTCTGCG TGAAGGACGT TCAGGGATCA CTTTCTCTCA GGAGCTGAAG
GATTCCGGCA TGCGTAGCCA CGTCTGGGGC AACGTAAAAC TGGATACCAC TGGCCTCATT
GACCGCAAAG TTGTGCGCTT TATGAGCGAC GCATCCATTT ATGCATTCCT TTCTATGGAG
CAGGCAATCG CTGATGCGGG CCTCTCTCCG GAAGCTTACC AGAATAACCC GCGCGTTGGC
CTGATTGCAG GTTCCGGCGG CGGCTCCCCG CGTTTCCAGG TGTTCGGCGC TGACGCAATG
CGCGGCCCGC GCGGCCTGAA AGCGGTTGGC CCGTATGTGG TCACCAAAGC GATGGCATCC
GGCGTTTCTG CCTGCCTCGC CACCCCGTTT AAAATTCATG GCGTTAACTA CTCCATCAGC
TCCGCGTGTG CGACTTCCGC ACACTGTATC GGTAACGCAG TAGAGCAGAT CCAACTGGGC
AAACAGGACA TCGTGTTTGC TGGCGGCGGC GAAGAGCTGT GCTGGGAAAT GGCTTGCGAA
TTCGACGCAA TGGGTGCGCT GTCTACTAAA TACAACGACA CCCCGGAAAA AGCCTCCCGT
ACTTACGACG CTCACCGTGA CGGTTTCGTT ATCGCTGGCG GCGGCGGTAT GGTAGTGGTT
GAAGAGCTGG AACACGCGCT GGCGCGTGGT GCTCACATCT ATGCTGAAAT CGTTGGCTAC
GGCGCAACCT CTGATGGTGC AGACATGGTT GCTCCGTCTG GCGAAGGCGC AGTACGCTGC
ATGAAGATGG CGATGCATGG CGTTGATACC CCAATCGATT ACCTGAACTC CCACGGTACT
TCGACTCCGG TTGGCGACGT GAAAGAGCTG GCAGCTATCC GTGAAGTGTT CGGCGATAAG
AGCCCGGCGA TTTCTGCAAC CAAAGCCATG ACCGGTCACT CTCTGGGCGC TGCTGGCGTA
CAGGAAGCTA TCTACTCTCT GCTGATGCTG GAACACGGCT TTATCGCCCC GAGCATCAAC
ATTGAAGAGC TGGACGAGCA GGCTGCGGGT CTGAACATCG TGACCGAAAC GACCGATCGC
GAACTGACCA CCGTTATGTC TAACAGCTTC GGCTTCGGCG GCACCAACGC CACGCTGGTA
ATGCGCAAGC TGAAAGATTA A
 
Protein sequence
MKRAVITGLG IVSSIGNNQQ EVLASLREGR SGITFSQELK DSGMRSHVWG NVKLDTTGLI 
DRKVVRFMSD ASIYAFLSME QAIADAGLSP EAYQNNPRVG LIAGSGGGSP RFQVFGADAM
RGPRGLKAVG PYVVTKAMAS GVSACLATPF KIHGVNYSIS SACATSAHCI GNAVEQIQLG
KQDIVFAGGG EELCWEMACE FDAMGALSTK YNDTPEKASR TYDAHRDGFV IAGGGGMVVV
EELEHALARG AHIYAEIVGY GATSDGADMV APSGEGAVRC MKMAMHGVDT PIDYLNSHGT
STPVGDVKEL AAIREVFGDK SPAISATKAM TGHSLGAAGV QEAIYSLLML EHGFIAPSIN
IEELDEQAAG LNIVTETTDR ELTTVMSNSF GFGGTNATLV MRKLKD