Gene Mpe_A1672 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1672 
Symbol 
ID4785753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1795144 
End bp1798641 
Gene Length3498 bp 
Protein Length1165 aa 
Translation table11 
GC content66% 
IMG OID640090241 
Productsuperfamily II helicase-like protein 
Protein accessionYP_001020869 
Protein GI124266865 
COG category[R] General function prediction only 
COG ID[COG1204] Superfamily II helicase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.406776 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCGA CGCTCGACGA ACTCCGGATA AACATCGAAG CAGCCGTGGT GCCCGGCTAC 
CGTGCGCGCC TGCTCGCCCG CGGCCAGGCG CGCGGCATGA TCTGGCGCGA AGGCGTTCTG
CCAGAGGAAT CTCCGAACTT CGGCGCGGAG TTGTCGGACG AGCTGCTGTC CTACGGTTAC
TCGCTCCTGC TGCACGGGCT TCGCTACACC GACCTGGGTG GCGATGCCGC AACGGCGCGC
AAGGCCTTTG AGATTGCCGC CGAGGCACTC GAAGCTGTCG TCGCACGCGG ACCGGCCGGC
GCTGAACGAG GCTTTCATCG GCTCGTCGCT GCCGCCGCCT ACCACTTGGG CCGCTTCTCG
GCTCGGGCCT ACTCGCTCCT CTACAAAGGA CTCGGCGAGG CGAACCTGTC GACGATGGAG
ACAGGGCTGG CGAAACTCAT GCTCCGCGAC TTGGACGGCC TTGCCCGAGA TCTCGCCGAT
TGGTTCGCTG CGGACCGAGG CAGCGACGAT GCACTTGTTG CTGGCCTAAG CCAACGCGAC
GCCAAGGTGA CCAGCGCCGG TGCTGACGAT GAAGAGACTT CGGACGACCG AGTTGACGCC
GTGCTCATTG CAGTTGAAGA CAACTTCATG GCTGCGCTCG CGGTTGCAAT GCTCGGGCTG
GAGCGCGGCG ATGAGACCTT GCTGCAGTCG GCACGCGAAC GAATACAACG CGGCCTCGAC
GTCGCCGCAG ACCTCGACCT CGTCGCCGCT TGGTGGTGCC ACCGCCTGGC GATCCACCTC
CTGGGCGGCC TATGGGAGAC AAGCTTCCAT CAGGTGCTAC CGCTCGGTGG ACCGCCCGGG
GCCGACGTCG GCGACTGGGT GCGACTGCGA AAGATCTTCA TCGCTTCGCT CTACCGCAGA
GGCCGGTCCG AGATCGAGCT GTGGCCATCC CAATTGCAGG CAGCACGTCG AGTCCTCGAT
GCCGAGGTGA ACCTCGTGCT GTCGCTCCCG ACGAGCGCAG GCAAAACGCG CATCGCGGAG
CTGTGCATCC TGGCCAGCCT GGCGCGAGGC CGGCGCGTCG TCTTCGTCAC GCCGTTGCGC
GCGCTGTCAG CGCAGACTGA GGTCGGGCTG CGGCGCACCT TCGCGCCAGT AGGCAAGTCT
GTCTCGAGCC TGTACGGCAG CATCGGCGCC AGTGGCGCCG ACATCGACGC GCTGCGTTCG
ACAGACATCG TTGTCGCCAC GCCCGAAAAG CTCGACTTCG CGCTGCGTAG TGATCCGACG
TTGCTGGATG ACGTCGGCCT TATCGTGCTC GACGAAGGTC ACATGATCGG CCTGGGCGAA
CGTGAGGTCC GCTACGAGGC CCAGATCCAG CGCCTGCTGC GTCGGCCCGA TGCTGCACAG
CGGCGCATCG TGTGCCTGTC TGCGATTCTT CCCGATGGGG ACCAGTTGGA GGACTTCGCG
GGCTGGCTGA CACGTGATCG CCCCGACGGG CTGATCAAGG ACGATTGGCG TCCGACCCGG
CTGCGCTTCG GCGAGGTGGA TTGGAAGGGT CAGCACGCAC AGCTCAACGT CAACGTCGGC
GACGAGAAGC CTTTCATCCC GAGATTCGTG GTCGCGAAGA AGCCAACGCG TGGCCGCGCC
AGGAAGCTGT TCCCGGCCGA TCAGGCGCAG CTGTGCATCG CGACGGCCTG GCGCCTGGTG
GAAGAGGGCC AGACCGTCCT TGTCTTCTGC CCGATGCGCC GCTCGGTGCT GCCGTTGGCA
GCGACCATTA TCGAGATGCA CAAGCGCGGC CACATCGACT CGGTACTGGA GCAAGCGCCG
TCCGCGCTGA CGAGGGCGCT AGCTGTCGGT ACCGAGTGGT TCGGACCCGA TCATGACATC
CTCGCCTGCC TGAAGCTCGG GGTTGCCGTG CACCACGGCG CCCTACCCAC GCCGTACCGC
AAGGAGGTCG AGCGCTTGCT GCGCGAGGGG GTGCTGCGCG TGACGGTCTC CTCGCCGACG
CTGGCCCAGG GACTCAACCT TGCCGCGACC TCCCTGGTCT TCCGCGGCGT CAGACGCGGC
CGCGACCTCC TCGACGTAGC GGAGTTCCGC AATGTCGTCG GCCGGGCTGG GCGCGCCTAC
ATCGACGTCG AAGGCCTCGT GGTGTATCCG ATGTTCGACA ACCATCGGCA GCGGCGCAAC
GACTGGCAGG ACTTGGTTGC GAACCACAGC GGCCGGGAGA TGGAGAGCGG CCTGTTGCGT
CTGATCTTTA CGTTACTCAG TCGGATGGCC AACAAGCTGG GCACCCGCGA CGTCGGAGCC
CTTCTCGAGT ACGTCGCCGG GCAGGCTGCC TGGGAATTCC CGGTGCTCCC GTCCGAGACG
ACCGAGGAAG CAACCGAAGC GCGGCACCGG TGGGACCAGC ATCTGGCAAG CCTGGACACC
GCGATCTTCA GCCTGCTGGG CGACACCATC GTGCTTGATG CCGAAGTCGA AACCAAGCTG
GATGAGATCC TCGTAGCCTC TCTGTTCCAG CGGCGCCTCG CCCGCCACAA AGAGGGGATA
CGCCAAGCGC TGCCAAAGGG ACTGCACGCT CGCGCGCGAT ACGTCTGGCG CAACAGTACG
CCCACGCAAC GGCGGGGATA CTTTCTAGCG GGTGTGGGGC TGGCCGCGGG CAAGGCCCTC
GACAAGCACG CGCCAGAACT CGAGCAACTG CTGCTGACGG CGAACGTCAG CATCGACCTC
GGCCAGCACA ATGAGGCGGT CGAGGCCATC GTGGCGTTTG CCGAGATCGC CTTCGACATC
GCGCCGTTCA AGCCGGACAA GCTGATCGAC GGGTGGGAAT CTTTGCTCAG GCGATGGCTT
CTCGGTCAGC CCGTGGCCGA CGCATTGTCC GAGGACAGCG ACGACGAGAT TCAGTTCATC
GAGCAGGCGT TCGTCTACAA CCTTCCCTGG GCTATGGAGG CGGTACGGGT CCGTGCTGAG
GCACACGAGG ACCTTTTTTC GGACGAGATC AAGCTGTCGG ATTACCCGCG GGCGCACGCG
GTCGCAGCAC TGGAGACCGG CACGCTGTCG ATCGCCGCCG CGACGCTGAT CCAGGCGGGT
TTCGGTTCGC GCCTCGGGGC AATTCGCGCC GTCGCGGAGA CGGGCGCGAC CTTTGACTCG
ATGAGCGGTC TCATGGGCTG GCTCGCATCA GACGAGGTAG GGGCCTTGTC CGCGGCGCCG
AACTGGCCAA CGCCGGAATC GAACCCACTG TGGGTTGACT TCAACGGACC GGGCGGCGCG
CAGGCGACCC AGCCTTGGGC TGCTACCGAG TACACGAGCG GGATCACGTG GCGCGGTGCA
CCAATGCCGC CCGGGACGGC TCTGCGGCTG GGCGGCGGCC CCGGGAAGGA GCGCAGCGTC
TTCACGGCTG ACTTCCGGGA AGTCGGAGCG ATCGGCTGGA CGCCAAGTGC GCAGGGCCTG
ATCGTCGCCG AAGCTACGGG CGACACCGGC AAGTTCACGT TGGAGCACAT CGGCCCTGGC
AAGGTGAACC GCGACTGA
 
Protein sequence
MAATLDELRI NIEAAVVPGY RARLLARGQA RGMIWREGVL PEESPNFGAE LSDELLSYGY 
SLLLHGLRYT DLGGDAATAR KAFEIAAEAL EAVVARGPAG AERGFHRLVA AAAYHLGRFS
ARAYSLLYKG LGEANLSTME TGLAKLMLRD LDGLARDLAD WFAADRGSDD ALVAGLSQRD
AKVTSAGADD EETSDDRVDA VLIAVEDNFM AALAVAMLGL ERGDETLLQS ARERIQRGLD
VAADLDLVAA WWCHRLAIHL LGGLWETSFH QVLPLGGPPG ADVGDWVRLR KIFIASLYRR
GRSEIELWPS QLQAARRVLD AEVNLVLSLP TSAGKTRIAE LCILASLARG RRVVFVTPLR
ALSAQTEVGL RRTFAPVGKS VSSLYGSIGA SGADIDALRS TDIVVATPEK LDFALRSDPT
LLDDVGLIVL DEGHMIGLGE REVRYEAQIQ RLLRRPDAAQ RRIVCLSAIL PDGDQLEDFA
GWLTRDRPDG LIKDDWRPTR LRFGEVDWKG QHAQLNVNVG DEKPFIPRFV VAKKPTRGRA
RKLFPADQAQ LCIATAWRLV EEGQTVLVFC PMRRSVLPLA ATIIEMHKRG HIDSVLEQAP
SALTRALAVG TEWFGPDHDI LACLKLGVAV HHGALPTPYR KEVERLLREG VLRVTVSSPT
LAQGLNLAAT SLVFRGVRRG RDLLDVAEFR NVVGRAGRAY IDVEGLVVYP MFDNHRQRRN
DWQDLVANHS GREMESGLLR LIFTLLSRMA NKLGTRDVGA LLEYVAGQAA WEFPVLPSET
TEEATEARHR WDQHLASLDT AIFSLLGDTI VLDAEVETKL DEILVASLFQ RRLARHKEGI
RQALPKGLHA RARYVWRNST PTQRRGYFLA GVGLAAGKAL DKHAPELEQL LLTANVSIDL
GQHNEAVEAI VAFAEIAFDI APFKPDKLID GWESLLRRWL LGQPVADALS EDSDDEIQFI
EQAFVYNLPW AMEAVRVRAE AHEDLFSDEI KLSDYPRAHA VAALETGTLS IAAATLIQAG
FGSRLGAIRA VAETGATFDS MSGLMGWLAS DEVGALSAAP NWPTPESNPL WVDFNGPGGA
QATQPWAATE YTSGITWRGA PMPPGTALRL GGGPGKERSV FTADFREVGA IGWTPSAQGL
IVAEATGDTG KFTLEHIGPG KVNRD