Gene EcHS_A3075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3075 
Symbolmsm 
ID5594128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3088361 
End bp3090505 
Gene Length2145 bp 
Protein Length714 aa 
Translation table11 
GC content55% 
IMG OID640922194 
Productmethylmalonyl-CoA mutase 
Protein accessionYP_001459694 
Protein GI157162376 
COG category[I] Lipid transport and metabolism 
COG ID[COG1884] Methylmalonyl-CoA mutase, N-terminal domain/subunit
[COG2185] Methylmalonyl-CoA mutase, C-terminal domain/subunit (cobalamin-binding) 
TIGRFAM ID[TIGR00640] methylmalonyl-CoA mutase C-terminal domain
[TIGR00641] methylmalonyl-CoA mutase N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTAACG TGCAGGAGTG GCAACAGCTT GCCAACAAGG AATTGAGCCG TCGGGAGAAA 
ACTGTCGACT CGCTGGTTCA TCAAACCGCG GAAGGGATCG CCATCAAGCC GCTGTATACC
GAAGCCGATC TCGATAATCT GGAGGTGACA GGTACCCTTC CTGGTTTGCC GCCCTACGTT
CGTGGCCCGC GTGCCACTAT GTATACCGCC CAACCGTGGA CCATCCGTCA GTATGCTGGT
TTTTCAACAG CAAAAGAGTC CAACGCTTTT TATCGCCGTA ACCTGGCCGC CGGGCAAAAA
GGTCTTTCCG TTGCGTTTGA CCTTGCCACC CACCGTGGCT ACGACTCCGA TAACCCGCGC
GTGGCGGGCG ACGTCGGCAA AGCGGGCGTC GCTATCGACA CCGTGGAAGA TATGAAAGTC
CTGTTCGACC AGATCCCGCT GGATAAAATG TCGGTTTCGA TGACCATGAA TGGCGCAGTG
CTACCAGTAC TGGCGTTTTA TATCGTCGCC GCAGAAGAGC AAGGTGTTAC ACCTGATAAA
CTGACCGGCA CCATTCAAAA CGATATTCTC AAAGAGTACC TCTGCCGCAA CACCTATATT
TACCCACCAA AACCGTCAAT GCGCATTATC GCCGACATCA TCGCCTGGTG TTCCGGCAAC
ATGCCGCGAT TTAATACCAT CAGTATCAGC GGTTACCACA TGGGGGAAGC GGGTGCCAAC
TGCGTGCAGC AGGTAGCATT TACGCTCGCT GATGGGATTG AGTACATCAA AGCAGCAATC
TCTGCCGGAC TGAAAATTGA TGACTTCGCT CCTCGCCTGT CGTTCTTCTT CGGTATCGGC
ATGGATCTGT TTATGAACGT CGCCATGTTG CGTGCGGCAC GTTATTTATG GAGCGAAGCG
GTCAGTGGAT TTGGCGCACA GGATCCGAAA TCACTGGCGC TGCGTACCCA CTGCCAGACC
TCAGGCTGGA GCCTGACTGA ACAGGATCCG TATAACAACG TTATCCGCAC CACCATTGAA
GCACTGGCTG CGACGCTGGG CGGTACTCAG TCACTGCATA CCAACGCCTT TGACGAAGCG
CTTGGTTTGC CTACCGATTT CTCAGCACGC ATTGCCCGCA ACACCCAGAT CATCATCCAG
GAAGAATCAG AACTCTGCCG CACCGTCGAT CCACTGGCCG GATCCTATTA CATTGAGTCG
CTGACCGATC AAATCGTCAA ACAAGCCAGA GCTATTATCC AACAGATCGA CGAAGCCGGT
GGCATGGCGA AAGCGATCGA AGCAGGTCTG CCAAAACGAA TGATCGAAGA GGCCTCAGCG
CGCGAACAGT CGCTGATCGA CCAGGGCAAG CGTGTCATCG TTGGTGTCAA CAAGTACAAA
CTGGATCACG AAGACGAAAC CGATGTACTT GAGATCGACA ACGTGATGGT GCGTAACGAG
CAAATTGCTT CGCTGGAACG CATTCGCGCC ACCCGTGATG ATGCCGCCGT AACCGCCGCG
TTGAACGCCC TGACTCACGC CGCACAGCAT AACGAAAACC TGCTGGCTGC CGCTGTTAAT
GCCGCTCGCG TTCGCGCCAC CCTGGGTGAA ATTTCCGATG CGCTGGAAGT CGCTTTCGAC
CGTTATCTGG TGCCAAGCCA GTGTGTTACC GGCGTGATTG CGCAAAGCTA TCATCAGTCT
GAGAAATCGG CCTCCGAGTT CGATGCCATT GTTGCGCAAA CGGAGCAGTT CCTTGCCGAC
AATGGTCGTC GCCCGCGCAT TCTGATCGCT AAGATGGGCC AGGATGGACA CGATCGCGGC
GCGAAAGTGA TCGCCAGCGC CTATTCCGAT CTCGGTTTCG ACGTAGATTT AAGCCCGATG
TTCTCTACAC CTGAAGAGAT CGCCCGCCTG GCCGTAGAAA ACGACGTTCA CGTAGTGGGC
GCATCCTCAC TGGCTGCCGG TCATAAAACG CTGATCCCGG AACTGGTCGA AGCGCTGAAA
AAATGGGGAC GCGAAGATAT CTGCGTGGTC GTGGGTGGCG TCATTCCGCC GCAGGATTAC
GCCTTCCTGC AAGAGCGCGG CGTGGCGGCG ATTTATGGTC CAGGTACACC TATGCTCGAC
AGTGTGCGCG ACGTACTGAA TCTGATAAGC CAGCATCATG ATTAA
 
Protein sequence
MSNVQEWQQL ANKELSRREK TVDSLVHQTA EGIAIKPLYT EADLDNLEVT GTLPGLPPYV 
RGPRATMYTA QPWTIRQYAG FSTAKESNAF YRRNLAAGQK GLSVAFDLAT HRGYDSDNPR
VAGDVGKAGV AIDTVEDMKV LFDQIPLDKM SVSMTMNGAV LPVLAFYIVA AEEQGVTPDK
LTGTIQNDIL KEYLCRNTYI YPPKPSMRII ADIIAWCSGN MPRFNTISIS GYHMGEAGAN
CVQQVAFTLA DGIEYIKAAI SAGLKIDDFA PRLSFFFGIG MDLFMNVAML RAARYLWSEA
VSGFGAQDPK SLALRTHCQT SGWSLTEQDP YNNVIRTTIE ALAATLGGTQ SLHTNAFDEA
LGLPTDFSAR IARNTQIIIQ EESELCRTVD PLAGSYYIES LTDQIVKQAR AIIQQIDEAG
GMAKAIEAGL PKRMIEEASA REQSLIDQGK RVIVGVNKYK LDHEDETDVL EIDNVMVRNE
QIASLERIRA TRDDAAVTAA LNALTHAAQH NENLLAAAVN AARVRATLGE ISDALEVAFD
RYLVPSQCVT GVIAQSYHQS EKSASEFDAI VAQTEQFLAD NGRRPRILIA KMGQDGHDRG
AKVIASAYSD LGFDVDLSPM FSTPEEIARL AVENDVHVVG ASSLAAGHKT LIPELVEALK
KWGREDICVV VGGVIPPQDY AFLQERGVAA IYGPGTPMLD SVRDVLNLIS QHHD