Gene Mpe_A0159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0159 
Symbol 
ID4784122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp166878 
End bp168464 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content66% 
IMG OID640088707 
Productmalate synthase 
Protein accessionYP_001019356 
Protein GI124265352 
COG category[C] Energy production and conversion 
COG ID[COG2225] Malate synthase 
TIGRFAM ID[TIGR01344] malate synthase A 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACCC TCCAACTGCC CGCCGGCCTG CAGATCAACG CGGCGATCCT GCCGGGCTTC 
GAGACCATCC TCACGCCGGC GGCGCTCGAG CTCGTGGCCA AGCTGCACCG CACCTTCGAG
CCCCGCCGCC AGGAACTGCT GGCCGCGCGC GTGGCACGCG TGAAGCGGCT CGATGCCGGC
GAGCGCCCCG ATTTCCTGCC GGAGACGAAG GCGATCCGCG AAGGCGACTG GAAGATCGCG
CCGATCCCGA AGGCGCTGGA GTGCCGTCGC GTCGAGATCA CCGGGCCGGT CGAGGCCAAG
ATGGTCATCA ACGCCTTCAA TTCGGGCGCT GATTCGTACA TGACCGACTT CGAGGACAGC
AACTCGCCGG TCTGGGCGAA CCAGATCCAG GGGCAGATCA ACCTGTACAA GGCGATCCGC
CGCACGCTGA CGCTGGGCCA GGCGGGCAAG ACCTACAAGC TCAACGACAA GATCGCGACG
CTGCAGGTGC GCCCGCGCGG CTGGCACCTC GACGAGAAGC ACGTGCTGGT CGACGGCCAG
CGCGTCTCCG GCGGCATCTT CGACTTCGCG CTATTCCTGT TCCACAACGC CAAGGAGCAG
ATCGAGCGCG GCGCCGGCCC GTTCTTCTAC CTGCCGAAGA TGGAGTCGCA CCTCGAGGCG
CGGCTGTGGA ACGACATCTT CATCGCCGCG CAGAAGGAGA TCGGCCTGCC GCAGGGCACG
ATCAAGGCGA CGGTGCTGAT CGAGACCATC CTCGCCGCGT TCGAGATGGA CGAGATCCTG
TACGAGCTGC GCGAGCACAG CGCCGGGCTG AATGCCGGCC GCTGGGACTA CATCTTCTCG
TGCATCAAGA AGTTCAAGAA CGACCGCGAC TTCTGCCTGG CCGACCGCGC CAAGGTCACG
ATGACGGCGC CCTTCATGCG CGCCTACGCG CTGCTGCTGC TCAAGACCTG CCACAAGCGC
GGCGCGCCCG CGATCGGCGG CATGAGCGCG CTGATCCCGA TCAAGAACGA CCCCGAGAAG
AACGCCATCG CGATGGCCGG GATCATCAAG GACAAGCGCC GCGACGCGAA CGACGGCTAC
GACGGTGGCT GGGTCGCGCA CCCGGGCCTG GTCGAGTCGG CGATGAAGGA GTTCGTCGAC
GTGCTGGGCG ATCGCCCGAA CCAGATCGAG CGCCAGAAGC CCTACACCCA CATCACCGCC
GCGCAACTGC TGGAGTTCGC GCCCGAGGCC CCGATCACCG AGGCCGGGCT GCGCATGAAC
ATCAACGTCG GCATCTATTA CCTGGCGTCC TGGCTGGCCG GCAACGGCTG CGTGCCGATC
TACAACCTGA TGGAGGACGC CGCCACCGCC GAGATCTCGC GCTCGCAGGT GTGGCAGTGG
ATCCGCAGTC CCAAGGGCGT GCTCGCCGAC GGCCGCAAGG TCACCGCCGA CATGGTGCGC
AAGATGGTGG CCGAGGAGCT CAAGGGCATC AAGGACGCCG GCTACGAAGG CCAGACGGTC
GACCGCGCGG CCGAGATCTT CGAGCAGATG AGCACCCAGG ATGCGTTCGC CGAGTTCCTG
ACGCTGCCGC TGTACGAAGA GATCTGA
 
Protein sequence
MTTLQLPAGL QINAAILPGF ETILTPAALE LVAKLHRTFE PRRQELLAAR VARVKRLDAG 
ERPDFLPETK AIREGDWKIA PIPKALECRR VEITGPVEAK MVINAFNSGA DSYMTDFEDS
NSPVWANQIQ GQINLYKAIR RTLTLGQAGK TYKLNDKIAT LQVRPRGWHL DEKHVLVDGQ
RVSGGIFDFA LFLFHNAKEQ IERGAGPFFY LPKMESHLEA RLWNDIFIAA QKEIGLPQGT
IKATVLIETI LAAFEMDEIL YELREHSAGL NAGRWDYIFS CIKKFKNDRD FCLADRAKVT
MTAPFMRAYA LLLLKTCHKR GAPAIGGMSA LIPIKNDPEK NAIAMAGIIK DKRRDANDGY
DGGWVAHPGL VESAMKEFVD VLGDRPNQIE RQKPYTHITA AQLLEFAPEA PITEAGLRMN
INVGIYYLAS WLAGNGCVPI YNLMEDAATA EISRSQVWQW IRSPKGVLAD GRKVTADMVR
KMVAEELKGI KDAGYEGQTV DRAAEIFEQM STQDAFAEFL TLPLYEEI