Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3075 |
Symbol | msm |
ID | 5594128 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 3088361 |
End bp | 3090505 |
Gene Length | 2145 bp |
Protein Length | 714 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640922194 |
Product | methylmalonyl-CoA mutase |
Protein accession | YP_001459694 |
Protein GI | 157162376 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1884] Methylmalonyl-CoA mutase, N-terminal domain/subunit [COG2185] Methylmalonyl-CoA mutase, C-terminal domain/subunit (cobalamin-binding) |
TIGRFAM ID | [TIGR00640] methylmalonyl-CoA mutase C-terminal domain [TIGR00641] methylmalonyl-CoA mutase N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTAACG TGCAGGAGTG GCAACAGCTT GCCAACAAGG AATTGAGCCG TCGGGAGAAA ACTGTCGACT CGCTGGTTCA TCAAACCGCG GAAGGGATCG CCATCAAGCC GCTGTATACC GAAGCCGATC TCGATAATCT GGAGGTGACA GGTACCCTTC CTGGTTTGCC GCCCTACGTT CGTGGCCCGC GTGCCACTAT GTATACCGCC CAACCGTGGA CCATCCGTCA GTATGCTGGT TTTTCAACAG CAAAAGAGTC CAACGCTTTT TATCGCCGTA ACCTGGCCGC CGGGCAAAAA GGTCTTTCCG TTGCGTTTGA CCTTGCCACC CACCGTGGCT ACGACTCCGA TAACCCGCGC GTGGCGGGCG ACGTCGGCAA AGCGGGCGTC GCTATCGACA CCGTGGAAGA TATGAAAGTC CTGTTCGACC AGATCCCGCT GGATAAAATG TCGGTTTCGA TGACCATGAA TGGCGCAGTG CTACCAGTAC TGGCGTTTTA TATCGTCGCC GCAGAAGAGC AAGGTGTTAC ACCTGATAAA CTGACCGGCA CCATTCAAAA CGATATTCTC AAAGAGTACC TCTGCCGCAA CACCTATATT TACCCACCAA AACCGTCAAT GCGCATTATC GCCGACATCA TCGCCTGGTG TTCCGGCAAC ATGCCGCGAT TTAATACCAT CAGTATCAGC GGTTACCACA TGGGGGAAGC GGGTGCCAAC TGCGTGCAGC AGGTAGCATT TACGCTCGCT GATGGGATTG AGTACATCAA AGCAGCAATC TCTGCCGGAC TGAAAATTGA TGACTTCGCT CCTCGCCTGT CGTTCTTCTT CGGTATCGGC ATGGATCTGT TTATGAACGT CGCCATGTTG CGTGCGGCAC GTTATTTATG GAGCGAAGCG GTCAGTGGAT TTGGCGCACA GGATCCGAAA TCACTGGCGC TGCGTACCCA CTGCCAGACC TCAGGCTGGA GCCTGACTGA ACAGGATCCG TATAACAACG TTATCCGCAC CACCATTGAA GCACTGGCTG CGACGCTGGG CGGTACTCAG TCACTGCATA CCAACGCCTT TGACGAAGCG CTTGGTTTGC CTACCGATTT CTCAGCACGC ATTGCCCGCA ACACCCAGAT CATCATCCAG GAAGAATCAG AACTCTGCCG CACCGTCGAT CCACTGGCCG GATCCTATTA CATTGAGTCG CTGACCGATC AAATCGTCAA ACAAGCCAGA GCTATTATCC AACAGATCGA CGAAGCCGGT GGCATGGCGA AAGCGATCGA AGCAGGTCTG CCAAAACGAA TGATCGAAGA GGCCTCAGCG CGCGAACAGT CGCTGATCGA CCAGGGCAAG CGTGTCATCG TTGGTGTCAA CAAGTACAAA CTGGATCACG AAGACGAAAC CGATGTACTT GAGATCGACA ACGTGATGGT GCGTAACGAG CAAATTGCTT CGCTGGAACG CATTCGCGCC ACCCGTGATG ATGCCGCCGT AACCGCCGCG TTGAACGCCC TGACTCACGC CGCACAGCAT AACGAAAACC TGCTGGCTGC CGCTGTTAAT GCCGCTCGCG TTCGCGCCAC CCTGGGTGAA ATTTCCGATG CGCTGGAAGT CGCTTTCGAC CGTTATCTGG TGCCAAGCCA GTGTGTTACC GGCGTGATTG CGCAAAGCTA TCATCAGTCT GAGAAATCGG CCTCCGAGTT CGATGCCATT GTTGCGCAAA CGGAGCAGTT CCTTGCCGAC AATGGTCGTC GCCCGCGCAT TCTGATCGCT AAGATGGGCC AGGATGGACA CGATCGCGGC GCGAAAGTGA TCGCCAGCGC CTATTCCGAT CTCGGTTTCG ACGTAGATTT AAGCCCGATG TTCTCTACAC CTGAAGAGAT CGCCCGCCTG GCCGTAGAAA ACGACGTTCA CGTAGTGGGC GCATCCTCAC TGGCTGCCGG TCATAAAACG CTGATCCCGG AACTGGTCGA AGCGCTGAAA AAATGGGGAC GCGAAGATAT CTGCGTGGTC GTGGGTGGCG TCATTCCGCC GCAGGATTAC GCCTTCCTGC AAGAGCGCGG CGTGGCGGCG ATTTATGGTC CAGGTACACC TATGCTCGAC AGTGTGCGCG ACGTACTGAA TCTGATAAGC CAGCATCATG ATTAA
|
Protein sequence | MSNVQEWQQL ANKELSRREK TVDSLVHQTA EGIAIKPLYT EADLDNLEVT GTLPGLPPYV RGPRATMYTA QPWTIRQYAG FSTAKESNAF YRRNLAAGQK GLSVAFDLAT HRGYDSDNPR VAGDVGKAGV AIDTVEDMKV LFDQIPLDKM SVSMTMNGAV LPVLAFYIVA AEEQGVTPDK LTGTIQNDIL KEYLCRNTYI YPPKPSMRII ADIIAWCSGN MPRFNTISIS GYHMGEAGAN CVQQVAFTLA DGIEYIKAAI SAGLKIDDFA PRLSFFFGIG MDLFMNVAML RAARYLWSEA VSGFGAQDPK SLALRTHCQT SGWSLTEQDP YNNVIRTTIE ALAATLGGTQ SLHTNAFDEA LGLPTDFSAR IARNTQIIIQ EESELCRTVD PLAGSYYIES LTDQIVKQAR AIIQQIDEAG GMAKAIEAGL PKRMIEEASA REQSLIDQGK RVIVGVNKYK LDHEDETDVL EIDNVMVRNE QIASLERIRA TRDDAAVTAA LNALTHAAQH NENLLAAAVN AARVRATLGE ISDALEVAFD RYLVPSQCVT GVIAQSYHQS EKSASEFDAI VAQTEQFLAD NGRRPRILIA KMGQDGHDRG AKVIASAYSD LGFDVDLSPM FSTPEEIARL AVENDVHVVG ASSLAAGHKT LIPELVEALK KWGREDICVV VGGVIPPQDY AFLQERGVAA IYGPGTPMLD SVRDVLNLIS QHHD
|
| |