Gene Mpe_A3742 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3742 
Symbol 
ID4786031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3961276 
End bp3962922 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content73% 
IMG OID640092325 
Productputative phospholipase D protein 
Protein accessionYP_001022930 
Protein GI124268926 
COG category[I] Lipid transport and metabolism 
COG ID[COG1502] Phosphatidylserine/phosphatidylglycerophosphate/cardiolipin synthases and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.135454 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGCGG CATGGCACCA TCGGCGCGAA GCCGCCGCCA TGCCCTCGTT CGCGTCCGAC 
CTGCCGACCC CGCGCCCTCG CCACCGCGTT GTCCCGCTGA CGCTGTGGCG GGTGGCGATG
CTGCTGGCCT GGCTCGGGCT GGCGGGCTGC TCCTCGCTGC CCTCGCGGCC ACCGTTGCCC
GACGAGGCCG CGCTGCCCGC GGCGACCGCC GGCGCGCTGG CCGCGCAGGT CGCGCCGCTG
GTGCAGGCGC ACCCCGGCCG CTCCGGCTTC GTGCCGCTCG ACAGCGGGCT CGACGCCTTC
GCCGCACGCG TCTGGCTGGT CGACCGCGCC GGCAGCAGCC TGGACATCCA GACCTACATC
TGGCGCAGCG ACCGCACCGG CCGTTGGCTG CTGACGCGCC TGCAGGCCGC GGCCGAGCGC
GGCGTGCGGG TGCGCCTGCT GCTCGACGAC GGCAACGGCA GCCCCACGCT CGACGGGCTG
CTGTCGCAGC TCGATGCCCA CCCGGGCGCC GAGGTGCGCT TCTTCAACCC CTATCCGCAC
CGCGGCCTGG GCCGCGCGTG GGACCTGGCC ACCGACTTCT CGCGCCTGCA CCGGCGCATG
CACAACAAGA CGTTCAATGC CGACGGCGTG GTCACCATCC TCGGCGGACG CAATGTCGGC
GACGAGTACT TCGGCGCGGC GGCCGACATG GAGTTCGCCG ACCTCGATGT GCTCGCGGTC
GGGCCGATCG TCGGTGAGGT GTCGGCGTCC TTCGACGCCT ACTGGAACAG CGCCTCGGCC
TACCCGCGGC GGTCGCTGCT GTTGCCCGGC AACGACCGGC CGCTCACGCC CGAGCAGCAG
GCGGCGCTCG ACGAGGCCAG CGCCTACGCG AAGGACCTGC GCGAGCGTCC GCGCGTCGAG
CGCTGGCGCG AGCGCGGGCC GCAGGCGTCC GACTTCGTCT GGGGCCGCTC CACGCTGTTC
GTCGACCCGC CCGACAAGGT GCTCAAGCAG GCCGGCGAGA GCGAGCTGAT GTTGCCGCGG
CTGGCCCGCA CGCTGGGCAA TGCCGACCGC AGCATCGACA TCGTGTCGCC CTACTTCGTG
CCCACCGACG ACGGCGTGGC CGCGTTCACC GCGCTGCACG ACCGCGGCGT GCGGCTGCGC
GTGCTGACCA ACTCGCTGGC CGCGACGGAT GTCTCGGCGG TGCACGCCGG CTATGCACCA
TATCGCCCGG CGCTGCTCGA CGGCGGTGTG GAGCTCTACG AACTGCGGCC GGTCCCGGCA
CCCGAAGGTC GCGCGCGCGG GCGGCTGCTG GGCCTGGGTT CGTCGCGCGC GAGCCTGCAC
GCCAAGACCT TCGCCGTCGA CGGCGAGCGC GTGTTCATCG GCTCGTTCAA CTTCGATCCG
CGCTCGGCCT GGCTCAACAC CGAGATCGGC GTATTGCTCG AGCACCCAGG CCTCGCGCAG
CGCATCGGAC AAGCGTTCGA CACCGAGGTG CCGCGCCAGG CCTGGCAGGT GACGCGCGAC
CCTGACCGCG GCCCCGACGC GCTGCGCTGG CGCGGCCAGA CGCCCGAAGG CCAGCCGATC
GAGCTGACCG AGGAGCCCGA GGCCGGCTGG CTCACGCGGC TGTGGGTGTG GCTGCTGTCG
TGGCTGCCGA TCGACGGGCT GCTGTAG
 
Protein sequence
MGAAWHHRRE AAAMPSFASD LPTPRPRHRV VPLTLWRVAM LLAWLGLAGC SSLPSRPPLP 
DEAALPAATA GALAAQVAPL VQAHPGRSGF VPLDSGLDAF AARVWLVDRA GSSLDIQTYI
WRSDRTGRWL LTRLQAAAER GVRVRLLLDD GNGSPTLDGL LSQLDAHPGA EVRFFNPYPH
RGLGRAWDLA TDFSRLHRRM HNKTFNADGV VTILGGRNVG DEYFGAAADM EFADLDVLAV
GPIVGEVSAS FDAYWNSASA YPRRSLLLPG NDRPLTPEQQ AALDEASAYA KDLRERPRVE
RWRERGPQAS DFVWGRSTLF VDPPDKVLKQ AGESELMLPR LARTLGNADR SIDIVSPYFV
PTDDGVAAFT ALHDRGVRLR VLTNSLAATD VSAVHAGYAP YRPALLDGGV ELYELRPVPA
PEGRARGRLL GLGSSRASLH AKTFAVDGER VFIGSFNFDP RSAWLNTEIG VLLEHPGLAQ
RIGQAFDTEV PRQAWQVTRD PDRGPDALRW RGQTPEGQPI ELTEEPEAGW LTRLWVWLLS
WLPIDGLL