Gene EcSMS35_2992 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2992 
Symbol 
ID6144728 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3073044 
End bp3074225 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content53% 
IMG OID641617861 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_001745013 
Protein GI170680410 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.936911 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.139342 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGACG TTGTGATTGT CGGGGCGTTA CGGACACCTA TCGGCTGCTT TCGTGGTGCG 
TTAGCGGGTC ATTCCGCCGT GGAACTTGGC AGCCTGGTCG TGAAAGCGTT AATAGAACGT
ACCGGTGTTC CTGCATATGC GGTGGATGAA GTGATTCTTG GTCAGGTGTT GACTGCAGGG
GCAGGGCAGA ATCCGGCAAG GCAATCGGCT ATTAAAGGTG GTCTTCCTAA CAGCGTTTCT
GCAATCACTA TTAATGACGT TTGCGGTTCC GGGCTTAAAG CACTGCATCT GGCTACTCAG
GCGATACAGT GTGGCGAGGC TGATATTGTC ATCGCCGGTG GCCAGGAAAA CATGAGCCGC
GCACCACATG TTCTGACTGA TAGCCGCACC GGTGCACAGC TTGGCAATAG CCAGTTGGTT
GATAGTCTTG TGCACGATGG GTTGTGGGAT GCCTTCAATG ATTATCATAT TGGTGTCACC
GCCGAAAATC TGGCTCGCGA ATATGGCATC AGCCGTCAGT TGCAGGATGC TTACGCACTT
AGCTCGCAAC AAAAAGCGCG AGCGGCGATT GACGCCGGAC GATTTAAAGA TGAGATCGTC
CCGGTAATGA CCCAAAGTAA CGGTCAGACG TTGGTTGTAG ATACCGATGA ACAGCCACGC
ACTGACGCCA GTGCAGAAGG CTTAGCCCGT TTAAATCCTT CATTTGATAG TCTCGGCTCT
GTGACAGCGG GTAATGCATC ATCCATAAAC GATGGCGCAG CTGCAGTAAT GATGATGAGC
GAAGCCAAAG CACGAGCGTT GAATTTACCC GTGCTGGCCC GCATCCGCGC ATTTGCCAGC
GTTGGTGTAG ATCCGGCATT GATGGGAATT GCGCCGGTGT ATGCGACCCG CCGTTGCCTG
GAGCGTGTTG GCTGGCAGTT GGCTGATGTC GATCTTATCG AGGCTAATGA AGCGTTTGCT
GCACAGGCGC TTTCGGTTGG CAAGATGCTT GAATGGGATG AGCGTCGGGT CAATGTCAAT
GGTGGCGCGA TCGCACTCGG TCATCCAATA GGCGCTTCCG GTTGCCGAAT CCTCGTTTCT
CTGGTTCATG AAATGGTGAA ACGTAATGCC CGCAAAGGAC TGGCTACGCT TTGTATCGGC
GGGGGCCAGG GCGTGGCATT GACCATTGAA CGTGACGAAT AG
 
Protein sequence
MKDVVIVGAL RTPIGCFRGA LAGHSAVELG SLVVKALIER TGVPAYAVDE VILGQVLTAG 
AGQNPARQSA IKGGLPNSVS AITINDVCGS GLKALHLATQ AIQCGEADIV IAGGQENMSR
APHVLTDSRT GAQLGNSQLV DSLVHDGLWD AFNDYHIGVT AENLAREYGI SRQLQDAYAL
SSQQKARAAI DAGRFKDEIV PVMTQSNGQT LVVDTDEQPR TDASAEGLAR LNPSFDSLGS
VTAGNASSIN DGAAAVMMMS EAKARALNLP VLARIRAFAS VGVDPALMGI APVYATRRCL
ERVGWQLADV DLIEANEAFA AQALSVGKML EWDERRVNVN GGAIALGHPI GASGCRILVS
LVHEMVKRNA RKGLATLCIG GGQGVALTIE RDE