Gene EcSMS35_3053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3053 
Symbolmsm 
ID6143445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3141616 
End bp3143760 
Gene Length2145 bp 
Protein Length714 aa 
Translation table11 
GC content55% 
IMG OID641617922 
Productmethylmalonyl-CoA mutase 
Protein accessionYP_001745073 
Protein GI170683251 
COG category[I] Lipid transport and metabolism 
COG ID[COG1884] Methylmalonyl-CoA mutase, N-terminal domain/subunit
[COG2185] Methylmalonyl-CoA mutase, C-terminal domain/subunit (cobalamin-binding) 
TIGRFAM ID[TIGR00640] methylmalonyl-CoA mutase C-terminal domain
[TIGR00641] methylmalonyl-CoA mutase N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAACG TGCAGGAGTG GCAACAGCTT GCCAACAAGG AATTGAGCCG TCGGGAGAAA 
ACTGTCGACT CGCTGGTTCA GCAAACCGCG GAAGGGATCG CCATCAAGCC GCTGTATACC
GAAGCCGATC TCGATAATCT GGAGGTGACA GGTACCCTTC CTGGTTTGCC CCCCTACGTT
CGTGGCCCGC GTGCCACTAT GTATACCGCC CAACCGTGGA CCATCCGTCA GTATGCTGGT
TTTTCAACAG CAAAAGAGTC CAACGCTTTT TATCGCCGTA ACCTGGCCGC CGGGCAAAAA
GGTCTTTCCG TTGCGTTTGA CCTTGCCACC CACCGAGGCT ACGACTCCGA TAACCCGCGC
GTGGCGGGCG ACGTCGGCAA AGCGGGCGTT GCTATCGACA CCGTGGAAGA TATGAAAGTC
CTGTTCGACC AGATCCCGCT GGATAAAATG TCGGTTTCGA TGACCATGAA TGGCGCAGTA
CTACCAGTAC TGGCGTTTTA TATTGTCGCC GCAGAAGAGC AAGGTGTTAC ACCCGATAAA
CTGACCGGCA CTATTCAAAA CGATATCCTC AAAGAGTATC TTTGCCGCAA CACCTATATT
TACCCGCCAA AACCGTCAAT GCGTATTATC GCAGACATCA TCGCCTGGTG TTCCGGCAAC
ATGCCGCGAT TTAATACCAT CAGTATCAGC GGTTACCATA TGGGTGAAGC GGGTGCCAAC
TGCGTGCAGC AGGTAGCATT TACGCTCGCT GATGGGATTG AGTACATCAA AGCAGCAATC
TCCGCCGGGC TGAAAATTGA TGACTTCGCT CCTCGCCTGT CGTTCTTCTT CGGCATTGGC
ATGGATCTGT TTATGAACGT CGCCATGTTG CGTGCGGCAC GTTATTTATG GAGCGAAGCG
GTCAGTGGAT TTGGCGCACA GGATCCGAAA TCACTGGCGC TGCGTACCCA CTGCCAGACC
TCAGGCTGGA GCCTGACTGA ACAGGATCCG TATAACAACG TTATCCGCAC CACCATTGAA
GCCCTGGCTG CAACGCTGGG CGGTACTCAG TCACTGCATA CCAACGCCTT TGATGAAGCG
CTTGGTTTGC CTACCGATTT CTCAGCACGC ATTGCCCGCA ATACCCAGAT CATCATTCAG
GAAGAATCAG AACTCTGCCG CACCGTCGAT CCACTGGCCG GATCCTATTA CGTTGAATCG
CTGACCGATC AAATCGTCAA ACAAGCCAGA GCCATTATCC AACAGATCGA CGAAGCCGGT
GGCATGGCGA AAGCGATCGA AGCAGGCCTG CCAAAACGAA TGATCGAAGA GGCCTCAGCG
CGCGAGCAGT CGCTGATCGA CCAGGGCAAG CGTGTCATCG TTGGTGTCAA CAAGTACAAA
CTGGATCACG AAGACGAAAC CGACGTCCTT GAGATCGACA ACGTGATGGT GCGTAACGAG
CAGATTGCTT CACTGGAACG CATTCGCGCC ACCCGTGATG ATGCTGCCGT AACCGCCACG
TTGAACGCCC TGACTCACGC CGCACAGCAT AACGAAAACC TGCTGGCTGC CGCTGTTAAT
GCCGCTCGCG TTCGCGCAAC GCTTGGTGAA ATTTCCGATG CGCTGGAAGC GGCATTTGAC
CGTTATCTGG TGCCAAGCCA GTGTGTTACC GGCGTGATTG CGCAAAGCTA TCATCAGTCT
GAGAAATCGG CCACCGAGTT CGATGCCATT GTTGCGCAAA CGGAGCAGTT CCTTGCCGAC
AATGGTCGTC GTCCGCGCAT TCTGATCGCC AAAATGGGCC AGGATGGGCA CGATCGCGGC
GCGAAAGTGA TCGCCAGCGC CTATTCCGAT CTCGGTTTCG ACGTAGATTT AAGCCCGATG
TTCTCTACAC CTGAAGAGAT CGCCCGCCTG GCAGTAGAAA ACGACGTTCA CGTAGTGGGC
GCATCTTCAC TGGCTGCCGG TCATAAGACG CTGATCCCGG AACTGGTCGA AGCGCTGAAA
AAATGGGGAC GCGAAGATAT CTGCGTGGTC GCGGGTGGCG TCATTCCACC GCAGGATTAC
GCCTTCCTGC AAGAGCGCGG CGTGGCGGCG ATTTATGGTC CAGGTACGCC TATGCTCGAC
AGCGTGCGCG ACGTACTGAA TCTAATAAGC CAGCATCATG ATTAA
 
Protein sequence
MSNVQEWQQL ANKELSRREK TVDSLVQQTA EGIAIKPLYT EADLDNLEVT GTLPGLPPYV 
RGPRATMYTA QPWTIRQYAG FSTAKESNAF YRRNLAAGQK GLSVAFDLAT HRGYDSDNPR
VAGDVGKAGV AIDTVEDMKV LFDQIPLDKM SVSMTMNGAV LPVLAFYIVA AEEQGVTPDK
LTGTIQNDIL KEYLCRNTYI YPPKPSMRII ADIIAWCSGN MPRFNTISIS GYHMGEAGAN
CVQQVAFTLA DGIEYIKAAI SAGLKIDDFA PRLSFFFGIG MDLFMNVAML RAARYLWSEA
VSGFGAQDPK SLALRTHCQT SGWSLTEQDP YNNVIRTTIE ALAATLGGTQ SLHTNAFDEA
LGLPTDFSAR IARNTQIIIQ EESELCRTVD PLAGSYYVES LTDQIVKQAR AIIQQIDEAG
GMAKAIEAGL PKRMIEEASA REQSLIDQGK RVIVGVNKYK LDHEDETDVL EIDNVMVRNE
QIASLERIRA TRDDAAVTAT LNALTHAAQH NENLLAAAVN AARVRATLGE ISDALEAAFD
RYLVPSQCVT GVIAQSYHQS EKSATEFDAI VAQTEQFLAD NGRRPRILIA KMGQDGHDRG
AKVIASAYSD LGFDVDLSPM FSTPEEIARL AVENDVHVVG ASSLAAGHKT LIPELVEALK
KWGREDICVV AGGVIPPQDY AFLQERGVAA IYGPGTPMLD SVRDVLNLIS QHHD