Gene ECH74115_4213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4213 
SymbolscpA 
ID6970508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3902723 
End bp3904867 
Gene Length2145 bp 
Protein Length714 aa 
Translation table11 
GC content55% 
IMG OID643387953 
Productmethylmalonyl-CoA mutase 
Protein accessionYP_002272392 
Protein GI209400202 
COG category[I] Lipid transport and metabolism 
COG ID[COG1884] Methylmalonyl-CoA mutase, N-terminal domain/subunit
[COG2185] Methylmalonyl-CoA mutase, C-terminal domain/subunit (cobalamin-binding) 
TIGRFAM ID[TIGR00640] methylmalonyl-CoA mutase C-terminal domain
[TIGR00641] methylmalonyl-CoA mutase N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones79 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAACG AGCAGGAGTG GCAACAGCTT GCCAACAAGG AATTGAGCCG TCGGGAGAAA 
ACTGTCGACT CGCTGGTTCA GCAAACCGCG GAAGGGATCG CCATCAAGCC GCTGTATACC
GAAGCCGATC TCGATAATCT GGAGGTGACA GGTACCCTTC CTGGTTTGCC GCCCTACGTT
CGTGGCCCGC GTGCCACTAT GTATACCGCC CAACCGTGGA CCATCCGTCA GTATGCTGGT
TTTTCAACAG CAAAAGAGTC CAACGCTTTT TATCGCCGTA ACCTGGCCGC CGGGCAAAAA
GGTCTTTCCG TTGCGTTTGA CCTTGCCACC CACCGTGGCT ACGACTCCGA TAACCCACGC
GTGGCGGGTG ACGTCGGCAA AACGGGCGTC GCTATCGACA CCGTGGAAGA TATGAAAGTC
CTGTTCGACC AGATCCCGCT GGATAAAATG TCGGTTTCGA TGACCATGAA TGGCGCAGTG
CTACCAGTAC TGGCGTTTTA TATCATCGCC GCGGAAGAGC AAGGTGTTAC ACCTGATAAA
CTGACCGGCA CCATTCAAAA CGATATTCTC AAAGAGTACC TCTGCCGCAA CACCTATATT
TACCCGCCAA AACCGTCAAT GCGCATTATC GCCGACATCA TCGCCTGGTG TTCCGGCAAC
ATGCCGCGAT TTAATACCAT CAGTATCAGC GGTTACCACA TGGGTGAAGC GGGTGCCAAC
TGCGTGCAGC AGGTAGCATT TACGCTCGCT GATGGGATTG AGTACATCAA AGCAGCAATC
TCTGCCGGAC TGAAAATTGA TGACTTCGCT CCTCGCCTGT CGTTCTTCTT CGGCATCGGC
ATGGATCTGT TTATGAACGT CGCCATGTTG CGTGCGGCAC GTTATTTATG GAGCGAAGCG
GTCAGTGGAT TTGGCGCACA GGACCCGAAA TCACTGGCGC TGCGTACCCA CTGCCAGACC
TCAGGCTGGA GCCTGACTGA ACAGGATCCG TATAACAACG TTATCCGCAC CACCATTGAA
GCGCTGGCTG CGACGCTGGG CGGTACTCAG TCACTGCATA CCAACGCCTT TGACGAAGCG
CTTGGTTTGC CTACCGATTT CTCAGCACGC ATTGCCCGCA ACACCCAGAT CATCATCCAG
GAAGAATCAG AACTCTGCCG CACCGTCGAT CCACTGGCCG GATCCTATTA CATTGAGTCG
CTGACCGATC AAATCGTCAA ACAAGCCAGA GCTATTATCC AACAGATCGA CGAAGCCGGT
GGCATGGCGA AAGCGATCGA AGCAGGTCTG CCAAAACGAA TGATCGAAGA GGCCTCAGCG
CGCGAACAGT CGCTGATCGA CCAGGGCAAG CGTGTCATCG TTGGTGTCAA CAAGTACAAA
CTGGATCACG AAGACGAAAC CGATGTACTT GAGATCGACA ACGTGATGGT GCGTAACGAG
CAAATTGCTT CGCTGGAACG CATTCGCGCC ACCCGTGATG ATGCCGCCGT AACCGCCGCG
TTGAACGCCC TGACTCACGC CGCACAGCAT AACGAAAACC TGCTGGCTGC CGCTGTTAAT
GCCGCTCGCG TTCGCGCCAC CCTGGGTGAA ATTTCCGATG CGCTGGAAGC CGCTTTCGAC
CGTTATCTGG TGCCAAGCCA GTGTGTTACC GGCGTGATTG CGCAAAGCTA TCATCAATCT
GAGAAATCGG CCTCCGAGTT CGATGCCATT GTTGCGCAAA CGGAGCAGTT CCTTGCCGAC
AATGGTCGTC GCCCGCGCAT TCTGATCGCC AAAATGGGCC TGGATGGACA CGATCGCGGC
GCGAAAGTGA TCGCCAGCGC CTATTCCGAT CTCGGTTTCG ACGTAGATTT AAGCCCGATG
TTCTCTACAC CTGAAGAGAT CGCCCGCCTG GCAGTAGAAA ATGACGTTCA CGTAGTGGGC
GCATCCTCAC TGGCTGCCGG TCATAAAACG CTGATCCCGG AACTGGTCGA AGCGCTGAAA
AAATGGGGAC GCGAAGATAT CTGCGTAGTC GCCGGTGGCG TCATTCCACC GCAGGATTAC
GCCTTCCTGC AAGAGCGCGG CGTGGCGGCG ATTTATGGTC CAGGTACACC TATGCTCGAC
AGTGTGCGCG ACGTACTGAA TCTGATAAGC CAGCATCATG ATTAA
 
Protein sequence
MSNEQEWQQL ANKELSRREK TVDSLVQQTA EGIAIKPLYT EADLDNLEVT GTLPGLPPYV 
RGPRATMYTA QPWTIRQYAG FSTAKESNAF YRRNLAAGQK GLSVAFDLAT HRGYDSDNPR
VAGDVGKTGV AIDTVEDMKV LFDQIPLDKM SVSMTMNGAV LPVLAFYIIA AEEQGVTPDK
LTGTIQNDIL KEYLCRNTYI YPPKPSMRII ADIIAWCSGN MPRFNTISIS GYHMGEAGAN
CVQQVAFTLA DGIEYIKAAI SAGLKIDDFA PRLSFFFGIG MDLFMNVAML RAARYLWSEA
VSGFGAQDPK SLALRTHCQT SGWSLTEQDP YNNVIRTTIE ALAATLGGTQ SLHTNAFDEA
LGLPTDFSAR IARNTQIIIQ EESELCRTVD PLAGSYYIES LTDQIVKQAR AIIQQIDEAG
GMAKAIEAGL PKRMIEEASA REQSLIDQGK RVIVGVNKYK LDHEDETDVL EIDNVMVRNE
QIASLERIRA TRDDAAVTAA LNALTHAAQH NENLLAAAVN AARVRATLGE ISDALEAAFD
RYLVPSQCVT GVIAQSYHQS EKSASEFDAI VAQTEQFLAD NGRRPRILIA KMGLDGHDRG
AKVIASAYSD LGFDVDLSPM FSTPEEIARL AVENDVHVVG ASSLAAGHKT LIPELVEALK
KWGREDICVV AGGVIPPQDY AFLQERGVAA IYGPGTPMLD SVRDVLNLIS QHHD