Gene Mpe_A1181 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1181 
Symbol 
ID4785580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1265095 
End bp1268427 
Gene Length3333 bp 
Protein Length1110 aa 
Translation table11 
GC content64% 
IMG OID640089744 
Productputative ATP-binding protein 
Protein accessionYP_001020377 
Protein GI124266373 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.562353 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.115598 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGATG AACAGCTAGT TCCACGGCCA CTGTCGGAGG TCGTCTCGAT ATCTCGGCAG 
TTCCTGCGCT CCATCCGCCT CGACGCCGAT TTCGGTCGGG AAGACGCGTT GTCCGGCTAC
GTCTGCCAAG GGACGGCTCG ATCACTCCTG GAGAGCATGG CCCGGCAACT GGTGTCGACG
CGGCAACGGG CCTTCACGTG GACCGGACCG TACGGCGGCG GAAAGTCGTC CCTGGCGCTG
ATGCTGTGCT CTCTGGTCGG CCCGAACCCG CGACTGCGTG CGCGTGCTCG CGACATCCTC
GACCTCCCGG CCGACAGCCC GGTGAGCAAA GCCTTTGAGG CGCGAGGCGA GGGCTGGCTC
GTGGTCCCGG TGGTCGGCAA GCGTGCAAGC GTCGTTCAAG AACTGCACAC TGCCCTGAAC
AAGGCGCGAG GAGGCGGCCG TCGAAAGCAA GGTGAGCTTG TCACCGAACT GGTCGCGGCC
GCGGAAGCGC ACCGCCAAGG CGTGCTGGTC GTGGTCGACG AACTCGGCAA GTTCCTCGAG
GCTTCTGCAC AGGGCGCCGG CGACGACATC TACTTCTTTC AAGAACTTGC CGAGGCGGCA
AGCCGCACCG CCGGCAAGCT CGTGGTGGTG GGCATCCTCC ATCAAGCCTT CGATGCGTAC
GCGACGCGAC TGGGCCGCGA CGCGCGAGAC GACTGGGCAA AGGTGCAAGG GCGCTTCGTC
GACATCCCAC TGGTTGCTGC CTCCGACGAG GTGATCGAAC TGATCGGGCA GGCGATCGAC
GTCAATCCGT CAGTCGATAG ATCCCTGCTG ATGCATTGCG TCGAACCCGT AGCTTCGGCC
ATCCGGACAC GGCGCCCCGG CACGCCGGCA AGCCTCGCTA AGAGCCTGGC CAAGTGCTGG
CCGCTTCATC CTGTGACGGC ATCGCTGCTG GGTCCGATCT CGAAGCGCAA GTTCGGCCAA
AACGAGCGCA GCACCTTCGG CTTCCTCGCG TCGCGCGAGC CCAATGGCTT TGCAGAGTTC
ATCGAGGGCC AGCCAGCGGA CTGGAGGTCC ATCTACGAGC CGGCGCGCTA CTGGGACTAC
CTGCGTGCCA ACCTGGAACC GGCCATCCTT GCGTCACCGG ACGGCCACCG TTGGGCTGCG
GCGGCCGACG CCGTCGAACG CGCCGAAGCC AAGGGCGAGG CCATCCATGT TGCGGCAACC
AAGACCGTCG CGTTGATCGA GATGTTCCGC AATGGCTCCG GTCTCGTCGC GGACGAGTCA
GTCCTCCGAG TCTCGGTTCA CGCCCGCAAC TACGGGGAAG TGAGCAAGGC GCTGCACGAC
CTGGTGCAGT GGAAGATCCT GATCGAGCGT CGCCACCTCG GGGCGTACGG TGTGTTCGCC
GGAAGCGACT TCGACATCGA GGGGGCCATC TCGCATGCGA GAGGTGAGAT CGGCGGTCCG
TCGTTGGAGC ACGTGTCCGC GCTGAGCGAT CTGCAACCGG TCTTGGCCAA ACGCTTCTAT
GCGCAGACAG GTACCATGCA TTGGTTCACT CGGCGCATCG TGCGACTGGC CGACCTGCCT
CACACGCTGG AGCACTTCCG GCAGGACAAG GGCAGCGCCG GCGCGTTCCT TCTCTGCCTT
CCCGATGTCG ACACGAGCGA ACGCACCGCC GAGCGTCAAG TGAGATCGGC CAGCGCCGAG
AACCCGACCG TGCAGGCGTT GCTAGGAACA CCGGCGAATG CCGCTCGGAT ATCGGAGCTC
GCGCTCGAGC TGGCGGCAGT CGAGCGCGTG ATGAAGACGC GGCCCGAGCT CGAAGGGGAT
GCGGTGGCGC GACGGGAGCT TGGTGGGCGT CTCGCAGCCG TCCGTGGCGC TCTGGAGGAC
GAACTCGCCG ATGCGTTCGT CCTGTCGAAG TGGTACTGGA AGGGAGAGCG GACCGGCGCC
GACAAGCACA GCTCGCTCTC GTCGATCGCA TCAGACGTCG CCAAGGACGT CTACCGCAAG
GCGCCGACGA TCTTCAGCGA ACTGCTGAAT CGCGAGGATC CATCGAGCAA CTCGAACAAA
GCTCGTAAGG ATCTGATGTA CCAGATGATC CGCGGCGGTG CCCGCGAGAA CCTCGGCTAC
ACAGGCTATC CCGCGGACGC TGGCCTCTAC TACACGATCT TGCAGGCCAC CGGCGTGCAC
CGCAGAGACG AGAAACGCGG CTGGGGCTTC TACGAGCCCT ATGTGACGAA CCCTCGGCTC
GAGGGCATGT GGCAAATGTG GACGGCTGCT CGTGAGCGCG TTGCGCAGCC GGCCCACGAG
ACGACGGCTT CCGATCTGTA TGCCTACTGG GGTGCACCAC CCTACGGCGT GCGTGCGGGG
GTCATGCCGG TGCTGGCGCT TGCCTTTTAC CTGGCATACC GGTCCGAACT CGCGATGTAC
GTTGACGGTG TGTTCACGCC GGACCTCAGC GAGGCGGTGA TCGACGACTG GCTGCAGGAT
CCACAACGGA TTCGCTTCCA GTACGTTGCG GCGTCCAAGG ACCAGGAAAG CCTGGTCAAG
GCGATCGCAG CCAGTGTCTC GGAACAGTCC AACGTCACCA TTCCGGCGGC ACCACTGGAC
GCGGCGCGCG CGCTCGTTGG TCTGGTGACG TCCCTGCCAG GCTGGACTAA GCGAACGCTC
ACGGTATCGC CAGCCGCGCA AGATGTCCGC GCGATGCTTC TCAAGGCGAG CGACCCGCAC
AAGGTTCTGT TCGCCGATCT ACCGACGCTG CTCAAGGCCG ACTCACCGAG CGAGTTGGTG
CTGCGCCTTC GCGCTGTCAC CAATGAGCTG TCGACTGCCT ATCAATCGAT GTTGGCGCGC
GTCCGTGAGC ACGTGTTGAA GGCGCTCGAC CATCAGGACC GGCCTCTTGA ATCGCTCAAC
CATCGCGCGA TCAACGTCAA GGGCATCACG GGTGAGTTCC GACTGGATGC GTTTGCAGCC
CGCCTGGAAG TGTTCGACGA CACCGACCAG GCTGTGGAGG GGATCATCAG CCTCGCGGTC
AGCAAGCCGT CGGCGCAGTG GGTCGACCGC GACATCGATG CTGCGCTGCT TCAACTCGGC
TCCTGGGCCA TCGATTTCCG TCGGGCCGAA GCCATGGCGC CGTTGCGAGG CCGCCCATCG
ACGCGGCGCG TCATCGGTGT CGTGTTCGGC GCCGGCAAGG GTCAGGACGC AACCGGCTCT
GTCGACGTCG CGGAGAGCGA CATTCCCGCG ATAGACCAGC TCGTGAAAGA GCTGCTAGCC
ACCGTCCAAC GTGAGCGACG GGAGATTGTT CTGGCCGCTC TGGCAGAAGC GGGAGCAATG
TTGGTCAAGC TAGGAATGAA GGAGAAGGCG TGA
 
Protein sequence
MSDEQLVPRP LSEVVSISRQ FLRSIRLDAD FGREDALSGY VCQGTARSLL ESMARQLVST 
RQRAFTWTGP YGGGKSSLAL MLCSLVGPNP RLRARARDIL DLPADSPVSK AFEARGEGWL
VVPVVGKRAS VVQELHTALN KARGGGRRKQ GELVTELVAA AEAHRQGVLV VVDELGKFLE
ASAQGAGDDI YFFQELAEAA SRTAGKLVVV GILHQAFDAY ATRLGRDARD DWAKVQGRFV
DIPLVAASDE VIELIGQAID VNPSVDRSLL MHCVEPVASA IRTRRPGTPA SLAKSLAKCW
PLHPVTASLL GPISKRKFGQ NERSTFGFLA SREPNGFAEF IEGQPADWRS IYEPARYWDY
LRANLEPAIL ASPDGHRWAA AADAVERAEA KGEAIHVAAT KTVALIEMFR NGSGLVADES
VLRVSVHARN YGEVSKALHD LVQWKILIER RHLGAYGVFA GSDFDIEGAI SHARGEIGGP
SLEHVSALSD LQPVLAKRFY AQTGTMHWFT RRIVRLADLP HTLEHFRQDK GSAGAFLLCL
PDVDTSERTA ERQVRSASAE NPTVQALLGT PANAARISEL ALELAAVERV MKTRPELEGD
AVARRELGGR LAAVRGALED ELADAFVLSK WYWKGERTGA DKHSSLSSIA SDVAKDVYRK
APTIFSELLN REDPSSNSNK ARKDLMYQMI RGGARENLGY TGYPADAGLY YTILQATGVH
RRDEKRGWGF YEPYVTNPRL EGMWQMWTAA RERVAQPAHE TTASDLYAYW GAPPYGVRAG
VMPVLALAFY LAYRSELAMY VDGVFTPDLS EAVIDDWLQD PQRIRFQYVA ASKDQESLVK
AIAASVSEQS NVTIPAAPLD AARALVGLVT SLPGWTKRTL TVSPAAQDVR AMLLKASDPH
KVLFADLPTL LKADSPSELV LRLRAVTNEL STAYQSMLAR VREHVLKALD HQDRPLESLN
HRAINVKGIT GEFRLDAFAA RLEVFDDTDQ AVEGIISLAV SKPSAQWVDR DIDAALLQLG
SWAIDFRRAE AMAPLRGRPS TRRVIGVVFG AGKGQDATGS VDVAESDIPA IDQLVKELLA
TVQRERREIV LAALAEAGAM LVKLGMKEKA