Gene EcSMS35_2373 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2373 
SymbolatoB 
ID6146927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2409706 
End bp2410890 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content53% 
IMG OID641617246 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_001744418 
Protein GI170683363 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAATT GTGTCATCGT CAGTGCGGTA CGTACTGCTA TCGGTAGTTT TAACGGTTCA 
CTCGCTTCCA CCAGCGCCAT CGACCTGGGG GCGACAGTAA TTAAAGCCGC CATTGAACGT
GCAAAAATCG ATTCACTACA CATTGATGAA GTGATTATGG GTAACGTGTT GCAAGCCGGG
CTGGGGCAAA ATCCGGCGCG TCAGGCACTG TTAAAAAGCG GGCTGGCAGA AACGGTGTGC
GGATTCACGG TCAATAAAGT GTGTGGTTCG GGTCTTAAAA GTGTGGCGCT TGCCGCCCAG
GCAATTCAGG CAGGTCAGGC GCAAAGCATT GTGGCGGGGG GTATGGAAAA TATGAGTTTA
GCGCCCTACT TACTCGATGC AAAAGCACGC TCTGGTTATC GTCTTGGAGA CGGACAGGTT
TATGACGTAA TCCTGCGCGA TGGTCTGATG TGCGCCACTC ATGGTTATCA TATGGGGATT
ACCGCCGAGA ACGTGGCCAA AGAGTACGGA ATTACCCGAG AAATGCAGGA TGAACTGGCG
CTACATTCAC AGCGTAAAGC GGCAGCCGCA ATTGAGTCCG GTGCTTTTAC AGCCGAAATC
GTCCCGGTAA ATATTGTCAC CCGGAAGAAA ACCTTCGTCT TCAGTCAAGA CGAATTCCCG
AAAGCGAATT CAACGGCTGA AGCGTTAGGT GCATTGCGCC CGGCCTTCGA TAAAGCAGGA
ACAGTCACCG CCGGGAACGC GTCTGGTATT AATGACGGTG CTGCCGCTCT GGTGATTATG
GAAGAATCTG CGGCGCTGGC AGCAGGCCTT ACCCCCCTGG CTCGCATTAA AAGTTATGCC
AGCGGTGGCG TGCCCCCCGC ATTGATGGGT ATGGGACCAG TACCTGCCAC GCAAAAAGCG
TTACAACTGG CGGGGCTGCA ACTGGCGGAT ATTGATCTCA TTGAGGCTAA TGAAGCATTT
GCCGCACAGT TCCTTGCCGT TGGGAAAACC CTGGGCTTTG ATTCTGAGAA AGTGAATGTC
AACGGCGGGG CCATCGCGCT CGGGCATCCT ATCGGTGCCA GTGGTGCTCG TATTCTGGTC
ACACTATTAC ATGCCATGCA GGCACGCGAT AAAACGCTGG GGCTGGCAAC ACTGTGCATT
GGTGGCGGCC AGGGAATTGC GATGGTGATT GAACGGTTGA ATTAA
 
Protein sequence
MKNCVIVSAV RTAIGSFNGS LASTSAIDLG ATVIKAAIER AKIDSLHIDE VIMGNVLQAG 
LGQNPARQAL LKSGLAETVC GFTVNKVCGS GLKSVALAAQ AIQAGQAQSI VAGGMENMSL
APYLLDAKAR SGYRLGDGQV YDVILRDGLM CATHGYHMGI TAENVAKEYG ITREMQDELA
LHSQRKAAAA IESGAFTAEI VPVNIVTRKK TFVFSQDEFP KANSTAEALG ALRPAFDKAG
TVTAGNASGI NDGAAALVIM EESAALAAGL TPLARIKSYA SGGVPPALMG MGPVPATQKA
LQLAGLQLAD IDLIEANEAF AAQFLAVGKT LGFDSEKVNV NGGAIALGHP IGASGARILV
TLLHAMQARD KTLGLATLCI GGGQGIAMVI ERLN