Gene Mpe_A1217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1217 
Symbol 
ID4787064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1312063 
End bp1315056 
Gene Length2994 bp 
Protein Length997 aa 
Translation table11 
GC content70% 
IMG OID640089782 
Producthypothetical protein 
Protein accessionYP_001020414 
Protein GI124266410 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0823] Periplasmic component of the Tol biopolymer transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAACC AGCTGGGAAG ACAACAGGCA CTGCTGCTGG CCTGCAGCGC CGCGCTGCTG 
GGCCTCGGGC TCACGGCGTG CGGCAGCGGC GGTGGCGGCT CGGACACGGT CACCGTCCAG
GGCGACGTGC CGATCGCCTA CGTGAAGCGC GCCAACACGA TCCGCATGAA CCCGACCAAC
GGCGCGCCGA CCGCGCCGGG CGGCGACCTG ATGATCCGCG AGAAGTCCTC GCCGAGCGCG
CCCGAGCACA ACATCACCAC CCAGTTCACG CAGGGGCAGG GCGACGCGTC CGACCCCGAG
GTCTCGTACG ACGGCAAGAA GATCGTTTTC GCGATGCGCT GCCCGACCAC GAACACTGCG
CAGATCGATG GCGGGCCCGC CTGCACCGGC CGCTGGAACA TCTGGGAATA CGACATGACC
ACCGGCGGCT ACACCGGCGG CAGCTTCCGG CGCCTGACCA GTTCGACGCA GGACGATGAC
GTGGACCCGG CCTACCTGCC GGCCGACCGC GGCTTCGTGT TCTCGTCGAA TCGCCAGACC
AAGTCGAAGA CGACGCAGGC GCTCGGCCAG ACCTACTACG CACTCGACGA GTACGAGCGC
GAGCGCGTCT TCAACCTGCA CACGATGACC GCCAACGGCG TGAACATCCA GCAGATCTCG
TTCAACCAGA GCCACGACCG CAACCCGGTG GTGCGCCCGA ACGGCGACAT CCTGTTCTCG
CGCTGGGAGC ATGTGGGCGA CCGCAACCGC TTCGCGATCT TCCGCACCAA GCCCGACGGC
ACCGACATGT TCGTGCTGTA CGGCGCGCAC AGCCCGGGCA ACAGCTTCCT GCACCCGCGC
GACATGGATC CGGCCGGCGC CTACAGCGGC TTCCTGACCT CTTCGCTGAT GTCGCTGTCG
GGCACCCACG AGGGCGGCTC GCTGATGCTG GTCGACGCCG CGAACTACTC CGAATACAAC
ACGCCCGCCA ACCGCAACGT ACAGGCGCTG GGCGGGCAGG CGCAGATCAC CGCGCAATCG
CTCAACGACG GGCGAGGCCT GTCGCGCTAC GGCCGCGTCA CCTCGCCGTT CCCGCTGTGG
GACGGTACCG ACCGCGTGCT GGTGGGCTAC CGGCCCTGCG AGGTCACGCG CGACGGCGAC
GTGGTGTCGT GCGCGACACT CAGCAGCGCC GAGATTGCGC GGCTCAACGA CGAGGAGCGC
ACCGAGGCCG AGGTCGCTGC CGACCCGGTG CAGGACAACG TGCCGCCGTC GTACGCGATC
TACATGTACG ACCCGTCCAA GCAGACCTGG CTGAACGTGG CCGCTCCGCC TTCGGGCTTC
ATGTACACCG ACCCGGTCGC GCTGCAGCAG CGCCCCGAGC CGAACGCCGC CGACCCGACC
AACGTGGACC CCACGCTCGC GGCGCAGAAT CTGGCGCTGA TCGAGGTGCG CAGCGTCTAC
GACACCGACG GCCTCGACCG CATGGGCACC TCGATGCTCG CCGCCGCCGA CCTGCCGAGC
GGCTGCACCA CCGCGATCGA GAAGACCGCA CCGACCGATC CGCTGGACAC CCGCAACCTG
GTCGCCGACC TGCTGCGCAT CAAGGACCCG GCCGACCCGG CCTACAACTG TGCGCCGGCG
CGCTTCGTTC GCGCGGTGCG GGCGGTGGCG CCGCAGGCCA ACATGATGGG CATGCGCGAG
GCGATCGGCG AGACCGACTT CGAGCCGCAG CAGATCCTCG GCTACGCGCC GGTGGAGCCC
GACGGTTCCT TCAAGCTGCA GGTGCCGGCC GACACCCCGC TGGCACTGGC GATCGTCGAC
GCCAAGGGCC GTGGTATCCA GACCCACCTG AACTGGATCC AGGTGCGGCC CGGCGAACGC
CGCACGTGCG ACGGTTGCCA CAGTCCGCGG CGCGGTGCTG CGCTCAACTC GGGCTCGATC
GTCAACACGC TGGCGACGGC GCTGCTGCCG TCGATGTCGG GCGCGCACCA GTCCGGCGAG
ACCATGGCCT CGCTGCGCAC GCGGCTGGAT CCGACGGCGC TGAGCCTCGG CGCCGACATG
GTCTACACCG ACGTGTGGGC CGACACCAGC CGCGGCGGCG TGGCGCGCGC GCCGATCACG
GTGCGCTACA CCGGCAACAC CAATCCCGCC GACGACCTGG CGACTGCGGT GCCGGTCAGC
GGCATCATCA ACTACGCCGA GCACATCCAG CCTCTGTGGA CGCGCAACCG TGGAGGCAAC
ACCTGCACGG GCTGCCACAA CGACCCGGCC AAGCTCTCGC TGCAGGGCAC GACCAGCGGT
ACGGGCCGAC TGCTGTCCTA CGACGAACTG CTGATCGGCG ATCCGGTGAT CGACGCCGGC
ACCGGCCTGC CGGTGACGCG CATCGAGGAC GGCGTGCCGG TGATCGTGCG CGGCGCCGCA
GTGGTGGAGA CCATGAGCGG CAATGCCGGC GGTCTGGCGC GCATGAGCCG GCTCACCGAG
ATCCTGTTCG GCGAGGAGCT GATGGCCGGC GCCGCGGCGC GCACCGCGCA TCCCAACCCG
CCCGGCACCG CACCGAACCA CGCGACCATC CTCAATGCGG CGGAACGCCG TCTGGTGACC
GAGTGGATGG ACCTGGGCGG CCAGTACTTC AACGACCTGA CCAGCAGCCC GAGCGTCGTC
AACGTCGCCG CGGCGCTGAC CCAGGCCTCG TTCGAGGCCC AGGTGCAGCC GGTGCTGCGT
GCCAGCTGCT CGGCGGGCTG CCATCAGCCG GGCGGCAATG CCGGCGCGTC GCAGACGACG
CCTTCCTATG CGCGCAACCG CTTCATCCTC ACCGGCGACC CGGGCGGTGA CTACAACGTC
ACGCTGACGA TGATCTCCGA CACCTGCAAC GCGGCGGCGA ACTACCTGCT GAGCCGTCCG
TCCACGGTGC CGCACCCGGC CGGGGCCGCC GGCCAGAGCG CCGCCGTGCT GCCGGTCGGC
AGCGCGGGCT ACACGGCGAT CGCCAACTGG ATCACCAGCG GATGCACGCC ATGA
 
Protein sequence
MKNQLGRQQA LLLACSAALL GLGLTACGSG GGGSDTVTVQ GDVPIAYVKR ANTIRMNPTN 
GAPTAPGGDL MIREKSSPSA PEHNITTQFT QGQGDASDPE VSYDGKKIVF AMRCPTTNTA
QIDGGPACTG RWNIWEYDMT TGGYTGGSFR RLTSSTQDDD VDPAYLPADR GFVFSSNRQT
KSKTTQALGQ TYYALDEYER ERVFNLHTMT ANGVNIQQIS FNQSHDRNPV VRPNGDILFS
RWEHVGDRNR FAIFRTKPDG TDMFVLYGAH SPGNSFLHPR DMDPAGAYSG FLTSSLMSLS
GTHEGGSLML VDAANYSEYN TPANRNVQAL GGQAQITAQS LNDGRGLSRY GRVTSPFPLW
DGTDRVLVGY RPCEVTRDGD VVSCATLSSA EIARLNDEER TEAEVAADPV QDNVPPSYAI
YMYDPSKQTW LNVAAPPSGF MYTDPVALQQ RPEPNAADPT NVDPTLAAQN LALIEVRSVY
DTDGLDRMGT SMLAAADLPS GCTTAIEKTA PTDPLDTRNL VADLLRIKDP ADPAYNCAPA
RFVRAVRAVA PQANMMGMRE AIGETDFEPQ QILGYAPVEP DGSFKLQVPA DTPLALAIVD
AKGRGIQTHL NWIQVRPGER RTCDGCHSPR RGAALNSGSI VNTLATALLP SMSGAHQSGE
TMASLRTRLD PTALSLGADM VYTDVWADTS RGGVARAPIT VRYTGNTNPA DDLATAVPVS
GIINYAEHIQ PLWTRNRGGN TCTGCHNDPA KLSLQGTTSG TGRLLSYDEL LIGDPVIDAG
TGLPVTRIED GVPVIVRGAA VVETMSGNAG GLARMSRLTE ILFGEELMAG AAARTAHPNP
PGTAPNHATI LNAAERRLVT EWMDLGGQYF NDLTSSPSVV NVAAALTQAS FEAQVQPVLR
ASCSAGCHQP GGNAGASQTT PSYARNRFIL TGDPGGDYNV TLTMISDTCN AAANYLLSRP
STVPHPAGAA GQSAAVLPVG SAGYTAIANW ITSGCTP