Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3053 |
Symbol | msm |
ID | 6143445 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3141616 |
End bp | 3143760 |
Gene Length | 2145 bp |
Protein Length | 714 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641617922 |
Product | methylmalonyl-CoA mutase |
Protein accession | YP_001745073 |
Protein GI | 170683251 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1884] Methylmalonyl-CoA mutase, N-terminal domain/subunit [COG2185] Methylmalonyl-CoA mutase, C-terminal domain/subunit (cobalamin-binding) |
TIGRFAM ID | [TIGR00640] methylmalonyl-CoA mutase C-terminal domain [TIGR00641] methylmalonyl-CoA mutase N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTAACG TGCAGGAGTG GCAACAGCTT GCCAACAAGG AATTGAGCCG TCGGGAGAAA ACTGTCGACT CGCTGGTTCA GCAAACCGCG GAAGGGATCG CCATCAAGCC GCTGTATACC GAAGCCGATC TCGATAATCT GGAGGTGACA GGTACCCTTC CTGGTTTGCC CCCCTACGTT CGTGGCCCGC GTGCCACTAT GTATACCGCC CAACCGTGGA CCATCCGTCA GTATGCTGGT TTTTCAACAG CAAAAGAGTC CAACGCTTTT TATCGCCGTA ACCTGGCCGC CGGGCAAAAA GGTCTTTCCG TTGCGTTTGA CCTTGCCACC CACCGAGGCT ACGACTCCGA TAACCCGCGC GTGGCGGGCG ACGTCGGCAA AGCGGGCGTT GCTATCGACA CCGTGGAAGA TATGAAAGTC CTGTTCGACC AGATCCCGCT GGATAAAATG TCGGTTTCGA TGACCATGAA TGGCGCAGTA CTACCAGTAC TGGCGTTTTA TATTGTCGCC GCAGAAGAGC AAGGTGTTAC ACCCGATAAA CTGACCGGCA CTATTCAAAA CGATATCCTC AAAGAGTATC TTTGCCGCAA CACCTATATT TACCCGCCAA AACCGTCAAT GCGTATTATC GCAGACATCA TCGCCTGGTG TTCCGGCAAC ATGCCGCGAT TTAATACCAT CAGTATCAGC GGTTACCATA TGGGTGAAGC GGGTGCCAAC TGCGTGCAGC AGGTAGCATT TACGCTCGCT GATGGGATTG AGTACATCAA AGCAGCAATC TCCGCCGGGC TGAAAATTGA TGACTTCGCT CCTCGCCTGT CGTTCTTCTT CGGCATTGGC ATGGATCTGT TTATGAACGT CGCCATGTTG CGTGCGGCAC GTTATTTATG GAGCGAAGCG GTCAGTGGAT TTGGCGCACA GGATCCGAAA TCACTGGCGC TGCGTACCCA CTGCCAGACC TCAGGCTGGA GCCTGACTGA ACAGGATCCG TATAACAACG TTATCCGCAC CACCATTGAA GCCCTGGCTG CAACGCTGGG CGGTACTCAG TCACTGCATA CCAACGCCTT TGATGAAGCG CTTGGTTTGC CTACCGATTT CTCAGCACGC ATTGCCCGCA ATACCCAGAT CATCATTCAG GAAGAATCAG AACTCTGCCG CACCGTCGAT CCACTGGCCG GATCCTATTA CGTTGAATCG CTGACCGATC AAATCGTCAA ACAAGCCAGA GCCATTATCC AACAGATCGA CGAAGCCGGT GGCATGGCGA AAGCGATCGA AGCAGGCCTG CCAAAACGAA TGATCGAAGA GGCCTCAGCG CGCGAGCAGT CGCTGATCGA CCAGGGCAAG CGTGTCATCG TTGGTGTCAA CAAGTACAAA CTGGATCACG AAGACGAAAC CGACGTCCTT GAGATCGACA ACGTGATGGT GCGTAACGAG CAGATTGCTT CACTGGAACG CATTCGCGCC ACCCGTGATG ATGCTGCCGT AACCGCCACG TTGAACGCCC TGACTCACGC CGCACAGCAT AACGAAAACC TGCTGGCTGC CGCTGTTAAT GCCGCTCGCG TTCGCGCAAC GCTTGGTGAA ATTTCCGATG CGCTGGAAGC GGCATTTGAC CGTTATCTGG TGCCAAGCCA GTGTGTTACC GGCGTGATTG CGCAAAGCTA TCATCAGTCT GAGAAATCGG CCACCGAGTT CGATGCCATT GTTGCGCAAA CGGAGCAGTT CCTTGCCGAC AATGGTCGTC GTCCGCGCAT TCTGATCGCC AAAATGGGCC AGGATGGGCA CGATCGCGGC GCGAAAGTGA TCGCCAGCGC CTATTCCGAT CTCGGTTTCG ACGTAGATTT AAGCCCGATG TTCTCTACAC CTGAAGAGAT CGCCCGCCTG GCAGTAGAAA ACGACGTTCA CGTAGTGGGC GCATCTTCAC TGGCTGCCGG TCATAAGACG CTGATCCCGG AACTGGTCGA AGCGCTGAAA AAATGGGGAC GCGAAGATAT CTGCGTGGTC GCGGGTGGCG TCATTCCACC GCAGGATTAC GCCTTCCTGC AAGAGCGCGG CGTGGCGGCG ATTTATGGTC CAGGTACGCC TATGCTCGAC AGCGTGCGCG ACGTACTGAA TCTAATAAGC CAGCATCATG ATTAA
|
Protein sequence | MSNVQEWQQL ANKELSRREK TVDSLVQQTA EGIAIKPLYT EADLDNLEVT GTLPGLPPYV RGPRATMYTA QPWTIRQYAG FSTAKESNAF YRRNLAAGQK GLSVAFDLAT HRGYDSDNPR VAGDVGKAGV AIDTVEDMKV LFDQIPLDKM SVSMTMNGAV LPVLAFYIVA AEEQGVTPDK LTGTIQNDIL KEYLCRNTYI YPPKPSMRII ADIIAWCSGN MPRFNTISIS GYHMGEAGAN CVQQVAFTLA DGIEYIKAAI SAGLKIDDFA PRLSFFFGIG MDLFMNVAML RAARYLWSEA VSGFGAQDPK SLALRTHCQT SGWSLTEQDP YNNVIRTTIE ALAATLGGTQ SLHTNAFDEA LGLPTDFSAR IARNTQIIIQ EESELCRTVD PLAGSYYVES LTDQIVKQAR AIIQQIDEAG GMAKAIEAGL PKRMIEEASA REQSLIDQGK RVIVGVNKYK LDHEDETDVL EIDNVMVRNE QIASLERIRA TRDDAAVTAT LNALTHAAQH NENLLAAAVN AARVRATLGE ISDALEAAFD RYLVPSQCVT GVIAQSYHQS EKSATEFDAI VAQTEQFLAD NGRRPRILIA KMGQDGHDRG AKVIASAYSD LGFDVDLSPM FSTPEEIARL AVENDVHVVG ASSLAAGHKT LIPELVEALK KWGREDICVV AGGVIPPQDY AFLQERGVAA IYGPGTPMLD SVRDVLNLIS QHHD
|
| |