Gene EcSMS35_2737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2737 
Symbol 
ID6143910 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2814997 
End bp2817657 
Gene Length2661 bp 
Protein Length886 aa 
Translation table11 
GC content54% 
IMG OID641617608 
ProductCoA-binding domain/acetyltransferase domain-containing protein 
Protein accessionYP_001744773 
Protein GI170681301 
COG category[C] Energy production and conversion
[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1042] Acyl-CoA synthetase (NDP forming)
[COG1670] Acetyltransferases, including N-acetylases of ribosomal proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.028573 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAGC GAGGACTGGA AGCACTACTG CGACCAAAAT CGATTGCGGT AATTGGCGCA 
TCGATGAAAC CCAATCGCGC AGGTTACCTG ATGATGCGTA ACCTGCTGGC GGGAGGCTTT
AACGGACCGG TACTCCCGGT GACGCCTGCC TGGAAAGCAG TGTTGGGTGT GTTGGCCTGG
CCGGATATTG CCAGCTTGCC CTTTACACCA GACCTTGCGG TTTTATGTAC CAATGCCAGC
CGTAATCTTG CTCTTCTGGA AGAGCTCGGC GAGAAAGGCT GTAAAACCTG CATTATTCTT
TCCGCCCCGT CATCGCAACA CGAAGATCTC CGCGCCTGCG CCCTGCGCCA TAACATGCGC
CTGCTTGGAC CAAACAGTCT GGGTTTACTG GCTCCCTGGC AAGGTCTGAA TGCCAGCTTT
TCGCCTGTGC CGATTAAACG CGGCAAGCTG GCGTTTATTT CGCAATCGGC TGCTGTCTCC
AACACCATCC TCGACTGGGC GCAACAGCGT GAGATGGGCT TTTCCTACTT TATTGCGCTC
GGCGACAGCC TGGATATCGA CGTTGATGAA TTGCTCGACT ATCTGGCGCG CGACAGTAAA
ACCAGCGCCA TCCTGCTCTA TCTCGAACAG TTAAGCGACG CGCGACGCTT TGTTTCGGCG
GCCCGTAGTG CCTCGCGCAA TAAACCGATT CTGGTGATTA AAAGCGGACG TAGCCCGGCG
GCTCAACGTC TGTTGAACAC CACGGCAGGA ATGGATCCAG CGTGGGATGC GGCTATTCAG
CGTGCCGGTT TGTTGCGGGT ACAGGATACC CACGAACTGT TTTCGGCGGT GGAAACCCTT
AGCCATATGC GCCCGCTGCG TGGTGACCGG TTGATGATTA TCAGCAACGG TGCTGCGCCT
GCCGCGCTGG CGCTGGATGC CTTATGGTCA CGCAATGGCA AGCTGGCAAC GCTAAGCGAA
GAAACCTGCC AGAAACTGCG CGATGCACTG CCAGAACATG TGGCAATCTC TAACCCGCTC
GATCTACGCG ATGACGCCAG CAGTGAGCAC TATATTAAAA CGCTGGATAT CCTGCTCCAC
AGCCAGGATT TTGACGCGCT GATGGTTATT CATTCGCCCA GCGCCGCTGC TCCCGCAACA
GAAAGCGCGC AAGCATTAAT TGAAGCGGTA AAGCATCATC CCCGCAGCAA GTATGTTTCT
CTGCTGACGA ACTGGTGTGG CGAGCACTCC TCGCAAGAGG CACGACGATT ATTCAGTGAA
GCCGGGCTGC CGACCTATCG CACGCCGGAA GGAACCATCA CTGCTTTTAT GCATATGGTG
GAGTACCGGC GTAATCAGAA GCAACTACGC GAAACGCCGG CGTTGCCCAG CAATCTGACC
TCCAATACCG CAGAAGCGCA TCTTCTGTTG CAACAGGCGA TTGCCGAAGG AGCAACGTCA
CTCGATACCC ATGAAGTTCA GCCCATCCTG CAAGCATATG GCATGAACAC GCTCCCCACC
TGGATTGCCA GCGATAGCAC CGAAGCGGTG CATATTGCCG AACAGATTGG TTATCCGGTG
GCGCTGAAAT TGCGTTCGCC GGATATTCCA CATAAATCGG AAGTTCAGGG CGTCATGCTC
TACCTGCGTA CAGCTAATGA AGTCCAGCAA GCGGCAAACG CTATTTTCGA TCGCGTAAAA
ATGGCCTGGC CGCAGGCGCG GATCCACGGC CTGTTGGTGC AAAGTATGGC TAACCGTGCT
GGCGCTCAGG AGTTGCGGGT TGTGGTTGAG CACGATCCGG TTTTCGGGCC GTTGATCATG
CTGGGCGAAG GCGGTGTGGA GTGGCGTCCT GAAGATCAAG CCGTCGTCGC GTTACCACCG
CTGAACATGA ACCTGGCCCG CTATCTGGTT ATTCAGGGGA TCAAAAGTAA AAAGATTCGT
GCGCGCAGTG CGCTACGCCC ATTGGATGTT GCAGGCTTGA GCCAGCTTCT GGTACAGGTT
TCCAACTTGA TTGTCGATTG TCCGGAAATT CAACGTCTGG ATATTCATCC TTTGCTGGCT
TCTGGCAGTG AATTTACCGC GCTGGATGTC ACGCTGGATA TCGCGCCGTT TGAAGGCGAT
AACGAGAGCC GGCTGGCAGT GCGCCCTTAT CCGCATCAGC TGGAAGAGTG GGTAGAATTG
AAAAACGGTG AACGCTGCTT GTTCCGCCCG ATTTTGCCAG AAGATGAGCC ACAACTTCAG
CAGTTCATTT CGCGAGTCAC CAAAGAAGAT CTTTATTACC GCTACTTTAG CGAGATCAAC
GAATTTACCC ATGAAGATTT AGCCAACATG ACGCAGATCG ACTACGATCG GGAAATGGCA
TTTGTAGCGG TACGACGTAT TGATCAAACG GAAGAGATCC TCGGCGTCAC GCGTGCGATC
TCCGACCCTG ATAACATCGA TGCCGAATTT GCCGTGCTGG TTCGCTCGGA TCTCAAAGGG
TTAGGCTTAG GTCGACGCTT AATGGAAAAG TTGATTACCT ATACGCGAGA TCACGGACTG
CAACGTCTGA ATGGTATTAC GATGCCAAAC AATCGTGGCA TGGTGGCGCT GGCCCGCAAG
CTCGGGTTTA ACGTTGATAT CCAGCTCGAA GAGGGGATCG TTGGGCTTAC GCTAAATCTT
GCCCAGCGCG AGGAATCATG A
 
Protein sequence
MSQRGLEALL RPKSIAVIGA SMKPNRAGYL MMRNLLAGGF NGPVLPVTPA WKAVLGVLAW 
PDIASLPFTP DLAVLCTNAS RNLALLEELG EKGCKTCIIL SAPSSQHEDL RACALRHNMR
LLGPNSLGLL APWQGLNASF SPVPIKRGKL AFISQSAAVS NTILDWAQQR EMGFSYFIAL
GDSLDIDVDE LLDYLARDSK TSAILLYLEQ LSDARRFVSA ARSASRNKPI LVIKSGRSPA
AQRLLNTTAG MDPAWDAAIQ RAGLLRVQDT HELFSAVETL SHMRPLRGDR LMIISNGAAP
AALALDALWS RNGKLATLSE ETCQKLRDAL PEHVAISNPL DLRDDASSEH YIKTLDILLH
SQDFDALMVI HSPSAAAPAT ESAQALIEAV KHHPRSKYVS LLTNWCGEHS SQEARRLFSE
AGLPTYRTPE GTITAFMHMV EYRRNQKQLR ETPALPSNLT SNTAEAHLLL QQAIAEGATS
LDTHEVQPIL QAYGMNTLPT WIASDSTEAV HIAEQIGYPV ALKLRSPDIP HKSEVQGVML
YLRTANEVQQ AANAIFDRVK MAWPQARIHG LLVQSMANRA GAQELRVVVE HDPVFGPLIM
LGEGGVEWRP EDQAVVALPP LNMNLARYLV IQGIKSKKIR ARSALRPLDV AGLSQLLVQV
SNLIVDCPEI QRLDIHPLLA SGSEFTALDV TLDIAPFEGD NESRLAVRPY PHQLEEWVEL
KNGERCLFRP ILPEDEPQLQ QFISRVTKED LYYRYFSEIN EFTHEDLANM TQIDYDREMA
FVAVRRIDQT EEILGVTRAI SDPDNIDAEF AVLVRSDLKG LGLGRRLMEK LITYTRDHGL
QRLNGITMPN NRGMVALARK LGFNVDIQLE EGIVGLTLNL AQREES