Gene Mpe_A1971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1971 
Symbol 
ID4784757 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2109686 
End bp2112031 
Gene Length2346 bp 
Protein Length781 aa 
Translation table11 
GC content66% 
IMG OID640090541 
Productputative outer membrane signal peptide protein 
Protein accessionYP_001021164 
Protein GI124267160 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4775] Outer membrane protein/protective antigen OMA87 
TIGRFAM ID[TIGR03303] outer membrane protein assembly complex, YaeT protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.208049 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.130084 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTTCT CCCGCGCGCT CCTCCTGTCG ACGGTCCGTC CGTCGCTCTC CTTGGCCACC 
CCTTCCGCGT TGCGCCCCTC GATTCTCGCC GCCGCCCTGG CGGTGGCAAT GCACGCGGCG
CCGGTGTGGG CGGTCGAGCC CTTCGTGCTC AAGGACATCC GTGTCGAAGG CCTCCAGCGG
GCCGACGCCG GTACCATTTT CGGCGCGCTG CCGTTCCGTA TCGGCGACAC CTACAGCGAC
GAGAAGGGCG CCGCCGCGCT GCGCGCGCTG TTCGCCACCG GCCTGTTCAA GGACGTGCGA
CTGGACGTCG ACAATGACGT GGTCGTCGTC ATCGTCGAGG AGCGGCCGAT CATCTCGTCG
GTGACCTTCG TCGGGCTCAA GGAGTTCGAC AAGGACGCGC TCACCAAGTC GCTGAAGGAT
GTGGGCATCG GCGAAGGCCA GCCGTTCGAC CGCGCGCTGG CCGACCGTGC CGAACAGGAA
CTCAAGCGCC AGTACCTCAC GCGCAGCCTC TACGGCGCCG AGGTCGTGAC CACCATCACT
CCGGTGGAGC GCAACCGCGT CAACGTCACC TTCACCGTCA GCGAAGGTGA GGTCGCGAAG
ATCAGCGACA TCCGCATCAC CGGCAACAAG GTGTTCTCGG AAAGCACGCT GCTCGGGCAG
TTCGAGCTGA CCACCGGCGG TTGGCTCACT TGGTACACGA AGACGGATCG CTACTCGCGT
ACCAAGCTCA ATGCCGACCT CGAAACGCTG CGCGCCTATT ACCTGAACCG CGGCTATCTC
GAGTTCACGG TCGACTCCAC GCAGGTGGCC ATCTCACCGG ACAAGCAGAA CATTTCGATC
ACGATCAACG TGACCGAAGG TCAGCCCTAC ACCGTGACAG CGGTCCGCCT CGAGGGTGAG
TACCTCGGCA AGGAAGACGA CTTCAAGGCG CTGGTGGCCA TCAAGCCGGG CGAGGCGTAC
CGCGCCGAGA CAGTCGCCGA GACGACCCGC CGCTTCACCG AGATGTACGG CGCCTTCGGC
TACGCCTTTG CACGCGTCGA GCCGCGTACC GAGATCGACC GTGCCACCGG GCGCGTCGAG
GTGGTGCTGG TCGGCGAGCC GAGCCGCCGG GTCTATGTGC GCCGCGTGAA CGTGGCAGGC
AATACCCGCA CCCGCGACGA GGTGGTGCGG CGCGAATTCC GGCAGTTCGA GTCGTCCTGG
TACGACGGCC GGCGCATCAA GCTGTCGCGC GACCGCGTGG ACCGGCTCGG CTACTTCAGC
GAGGTCACGA TCGACACGGC CGAGGTACCG GGTGCGCAGG ACCAGGTCGA CATCACCGCC
CAGGTGGTCG AGAAGCCGAC CGGCAATCTG CAGCTGGGCG CCGGCTTCTC CAGCGCGGAG
AAGGTCTCGC TGACTTTCGG CATCCGACAG GACAACATCT TCGGCAGCGG CAACTACCTG
GGCTTCGAGG TCAACACCAG CCGCTACAAC CGCAACATCG TGGTCAGCAC GGTCGATCCG
TACTTCACGG TCGACGGCAT CTCGCGCGCC ATCGACGTGT TCTACCGAAC GGCCAGGCCG
ATCAACAGCC AGGGCGAGGA CTACAAGCTG GTGACGCAGG GCGGGGCGAT CCGTTTCGGC
GTGCCGTTCA GCGAGTTCGA CACCGTGTTC TTCGGTGCGG GCTGGGAGCG GACCCAGATC
GAGGCCGGTG TCTCGATCCC GAACAGCTAT TTCCTGTACC GCGAGGCCTT CGGTGACACG
ACCGACAGCC TGCCGCTGAC GCTCGGTTGG CAGCGCGACG GCCGCGACAG CGCGCTGGTC
CCCACCTCCG GTCTCTACCA GCGGCTGAAC GCCGAGTGGA GCGTGGCGCT CGACACCCGG
TATCTGCGCA CCAACTACCA GATTCAGCAG TGGATTCCGC TGTCGAAGAA GTACACGCTG
GGCCTCAATG CCGAAGCCGG CTGGGGCAAG GGCTTCGGCG GCCGTCCGTA TCCGATCTTC
AAGAACTTCT ACGGCGGTGG CCTCGGGTCG GTGCGGGGCT TCGACCAGAG CTCGCTGGGC
CCGATCGACG TGACCGGCGC CTACATCGGC GGCAACCGCA AGCTCAACCT GAATGCCGAG
TTCTACGTCC CGGTGCCGGG CTCGGGCAAC GACCGCACGC TGCGCCTGTT CGGCTATCTC
GACGCCGGCA ACGTGTGGGG CGAGGACGAG AAGCTCAGCC TCGGCGACCT GCGTGCCGCG
GCCGGCATCG GCCTGAGTTG GGTCTCGCCG GTCGGGCCGC TCAAGATCAG CTACGGCCAG
CCGGTGCGCA AGTTTGCGCA GGATAGAATT CAAAAGTTGC AATTCCAGAT CGGGACCGCG
TTCTGA
 
Protein sequence
MAFSRALLLS TVRPSLSLAT PSALRPSILA AALAVAMHAA PVWAVEPFVL KDIRVEGLQR 
ADAGTIFGAL PFRIGDTYSD EKGAAALRAL FATGLFKDVR LDVDNDVVVV IVEERPIISS
VTFVGLKEFD KDALTKSLKD VGIGEGQPFD RALADRAEQE LKRQYLTRSL YGAEVVTTIT
PVERNRVNVT FTVSEGEVAK ISDIRITGNK VFSESTLLGQ FELTTGGWLT WYTKTDRYSR
TKLNADLETL RAYYLNRGYL EFTVDSTQVA ISPDKQNISI TINVTEGQPY TVTAVRLEGE
YLGKEDDFKA LVAIKPGEAY RAETVAETTR RFTEMYGAFG YAFARVEPRT EIDRATGRVE
VVLVGEPSRR VYVRRVNVAG NTRTRDEVVR REFRQFESSW YDGRRIKLSR DRVDRLGYFS
EVTIDTAEVP GAQDQVDITA QVVEKPTGNL QLGAGFSSAE KVSLTFGIRQ DNIFGSGNYL
GFEVNTSRYN RNIVVSTVDP YFTVDGISRA IDVFYRTARP INSQGEDYKL VTQGGAIRFG
VPFSEFDTVF FGAGWERTQI EAGVSIPNSY FLYREAFGDT TDSLPLTLGW QRDGRDSALV
PTSGLYQRLN AEWSVALDTR YLRTNYQIQQ WIPLSKKYTL GLNAEAGWGK GFGGRPYPIF
KNFYGGGLGS VRGFDQSSLG PIDVTGAYIG GNRKLNLNAE FYVPVPGSGN DRTLRLFGYL
DAGNVWGEDE KLSLGDLRAA AGIGLSWVSP VGPLKISYGQ PVRKFAQDRI QKLQFQIGTA
F