Gene GM21_0572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0572 
Symbol 
ID8135887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp697112 
End bp698764 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content62% 
IMG OID644868189 
Productmethylmalonyl-CoA mutase, large subunit 
Protein accessionYP_003020404 
Protein GI253699215 
COG category[I] Lipid transport and metabolism 
COG ID[COG1884] Methylmalonyl-CoA mutase, N-terminal domain/subunit 
TIGRFAM ID[TIGR00641] methylmalonyl-CoA mutase N-terminal domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0000000000000115409 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCATTG AAGAGAAAAA AAAGGCATGG CAGGCAGCAG CTGAGAAGAG CATCGCCAAG 
GCGCCGGAGC GCAAAGGCTC CTTCCGCAAC AGCTCCGATA TCGAGCTGGA TCGTTGCTTC
GCCCCCGAGT TCGACTATCC GGGGTATGAG GAGAACCTCG GTTTCCCCGG GCAGTACCCC
TTCACCCGCG GCGTGCAGCC GACCATGTAC CGCGGCAGGT TCTGGACCAT GCGCCAGTAC
GCCGGTTTCG GCAATGCGGC CGAGTCGAAC GAGCGCTACA AGTACCTGCT CTCCGCCGGT
CAGACCGGCC TCTCCATCGC CTTCGACCTC CCCACCCAGA TGGGATACGA TTCCGACGCT
TCGATGAGCC GCGGGGAGGT GGGCAAAGTC GGGGTAGCCA TCGACTCCCT GGCCGACATG
GAGATCCTGT TCGACGGCAT CCCGCTGGAC AAGGTCTCGA CCTCGATGAC CATCAACTCC
ACCGCGTCCA TCCTGCTTGC CATGTACATC GCGGTCGCCG AGAAGCAGGG GGTTTCCGCC
GACAAGATCT CCGGCACCAT CCAAAACGAC ATACTTAAGG AGTACATGGC GCGCGGCACC
TACATCTATC CGCCCAAGGA GTCGATGAGG ATCATCACCG ACATCTTTGC CTACTGCAAG
GACAACGTCC CCAAGTGGAA CACCATCAGC ATCTCCGGCT ACCACATCCG CGAGGCGGGT
TCCTCCGCGG TCCAGGAAGT CGCCTTCACC CTGGCAGACG GCATCGCCTA CGTCGAGGCG
GCGGTCAAGG CCGGACTCGA CGTCGACGAG TTCGCGCCGC GCTTGGCCTT CTTCTTCAAC
GCGCACAACG ACCTGCTCGA GGAGGTGGCC AAGTTCCGCG CCGCCCGCCG CATGTGGTCC
CGCATCATGC GTGACCGTTT CAAGGCGAAA GACCCGCGCT CTCAGATGCT GCGCTTCCAC
ACCCAGACCG CCGGCTGCAC CCTCACCGCA CAGCAGCCCG ACAACAACAT CATGCGCGTC
ACCCTGCAGG CGCTCGCCGC CGTGTTGGGC GGCACCCAGT CGCTGCACAC CAACTCGCGC
GACGAGGCGC TCGCGCTTCC CACCGAGGAA TCGGTCAGGA TCGCGCTGCG CACCCAGCAG
GTCATCGCCT ACGAATCGGG TGTTGCCGAC TCCATCGACC CCTTGGCCGG CTCCTTCATG
GTCGAGGCGC TGACCGACAA GATCGAGGCG CAGGCGCTTG CCTACATCGA GAAGATCGAC
GCACTCGGCG GCGCGGTCGA AGCGATCTCC CGCGGGTTCC AGCAGAAGGA GATCCAGGAT
TCCGCCTACG CCTACCAGCG CGCCATCGAG AAGAACGAGA CCATCATCGT CGGCGTGAAC
AAGTTCACCG TCCAGGAAGG CGCGCCCCAG GGGCTGCTGA AGGTGACCGA CGAGGTCGAG
GTGAAGCAGA AGGCTTCTCT TGGCAAGCTG AAAGAGGGCC GCGACAACGC CAAGGTGGAG
GCTTCCCTGA AGGCGCTGGA GCAGGCGGCA CGCGGGACCC AGAACCTGAT GCCGTTCATC
CTAGATGCAG TAAAGACTTA CGCTACCTTG GGCGAGATCG CCAACGTAAT GAGGGAAGTC
TTCGGCATTC ACAGGGAGAC GGTTGTCCTT TAA
 
Protein sequence
MSIEEKKKAW QAAAEKSIAK APERKGSFRN SSDIELDRCF APEFDYPGYE ENLGFPGQYP 
FTRGVQPTMY RGRFWTMRQY AGFGNAAESN ERYKYLLSAG QTGLSIAFDL PTQMGYDSDA
SMSRGEVGKV GVAIDSLADM EILFDGIPLD KVSTSMTINS TASILLAMYI AVAEKQGVSA
DKISGTIQND ILKEYMARGT YIYPPKESMR IITDIFAYCK DNVPKWNTIS ISGYHIREAG
SSAVQEVAFT LADGIAYVEA AVKAGLDVDE FAPRLAFFFN AHNDLLEEVA KFRAARRMWS
RIMRDRFKAK DPRSQMLRFH TQTAGCTLTA QQPDNNIMRV TLQALAAVLG GTQSLHTNSR
DEALALPTEE SVRIALRTQQ VIAYESGVAD SIDPLAGSFM VEALTDKIEA QALAYIEKID
ALGGAVEAIS RGFQQKEIQD SAYAYQRAIE KNETIIVGVN KFTVQEGAPQ GLLKVTDEVE
VKQKASLGKL KEGRDNAKVE ASLKALEQAA RGTQNLMPFI LDAVKTYATL GEIANVMREV
FGIHRETVVL