Gene Mpe_A1148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1148 
Symbol 
ID4785723 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1227099 
End bp1229912 
Gene Length2814 bp 
Protein Length937 aa 
Translation table11 
GC content70% 
IMG OID640089711 
ProductDNA-directed DNA polymerase 
Protein accessionYP_001020344 
Protein GI124266340 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.725762 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.433626 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCTC ACGACGATTC CTGCCTGTTG CTGGTGGACG GTTCCAGCTA CCTCTACCGC 
GCCTACCACG CGCTGCCTGA CCTGCGCAAT CCGGCCGGCG AGCCGACCGG CGCGGTGCGC
GGCATGGTGG CCATGCTGAA GAAGCTGCGC GAGGAGTTCC CGTCGGCGCA CGCGGCCTGC
GTGTTCGACG CCAAGGGCAA GACCTTCCGC GACGACTGGT ACCCGGAGTA CAAGGCGAAC
CGCGCCTCGA TGCCGGAGGA TCTGGCGAGG CAGATCGCGC CGATCCACGC TGTGGTGACG
CTGCTGGGCT GGCCGGTGCT CGAGATCCCC GGCATCGAGG CCGACGACGT GATCGGCACG
CTGGCGCGGG CGGCCGCCGC GCGCGGCCAG CGGGTCATCG TCTCGACAGG CGACAAGGAC
TTGGCGCAGC TCGTCGACGC GCACGTGACG CTGATCAACA CGATGAGCGG CGAGCGGCTC
GACGTGGCCG GCGTGACCGA GAAGTTCGGT GTGCCGCCCG AGCGCATCGT CGACTACCTG
ACGCTGGTCG GCGACGCGGT CGACAACGTG CCGGGAGTCG AGAAGGTGGG GCCGAAGACC
GCCGCCAAGT GGATCGCGGA GCACGGCTCG CTGGACGGCG TGATGGCCGC GGCCGACGCC
ATCAAGGGCG TTGCCGGCGA GAACCTGCGC AAGGCGCTGG ATTGGCTGCC GCTGGGTCGC
CGGCTGGTGA CCGTGAAGAC CGACTGTGAT CTGTCGGAGG CACTGCCGGG CTGGAACGGC
ACGGCCGCCT GGGACACGCT GACCTGGCGC GAGACCGACC GCGCGGCCCT GTTGGCCTTC
TACACGCACA ACGACTTCCG CGCGTGGCGC AATGAGCTCG AGTCGGCCCG TGCCGCCGCC
GCGCCGGCCC CGCAAGTGGC GGCGGCGGAG GCCGGGGAAG GGCAGAGCGC GCTGTTCGCC
GACCCGGCGG GTACCGGGCC GGCCGATGGT GGCGTCGCGC CTGCGGTCGA CAAGCGCTAC
GAGACCGTGC TGGCGCGCGA GGCCTTCGAG GCGTGGCGCG CGCGCATCGA GGCGGCCGAC
CTGGTGGCTC TGGACACGGA GACCGACTCG CTCGACGGCA TGCGGGCACG CATCGTCGGC
CTGAGCTTCA GCGTGCAGCC CTACGAGGCC TGCTACATCC CGCTCGCCCA CACCTACCCG
GGCGCGCCCG ACCAGCTGCC GCTCGATGAG GTGCTGGCGG CGCTGAAGTC CTGGCTCGAG
GACGGCTCGC GCGCCAAGCT CGGCCAGAAC GTCAAGTACG ACACCCATGT GTTCGCCAAC
CATGGCATCG CGGTGCGTGG CTATGTGCAT GACACCCTGC TGCAGAGCTA CGTGCTGGAG
GCGCACAAGC CGCACAGCCT GGAAAGCCTC GCCAGCCGCC ACCTCGACCG CAAGGGCCTG
AGCTACGAGG ACGTGGCCGG CAAGGGCGCG CAGCAGATCC CGTTCGCGCA GGTCGAGCTG
ACGCGCGCCA CCGAGTATTC GGGCGAGGAC AGCGACATGA CGCTGGACGT GCACAGGGTC
TTGTGGCCGC AGCTCGAGGC CGCGCCGCGC CTGCGCGAGG TCTACGAGCG CATCGAGATG
CCGACCTCGG TGATCCTCGG CCGCATCGAG CGTCACGGCG TGCTGATCGA CAGCGCGCTG
CTCGCGCGCC AGAGCGCCGA TCTGGCGCAG CGCATGGTGG CACTGGAGCA GGAGGCGCAT
GCGCTGGCCG GCCAGCCCTT CAACCTGGGC AGCCCCAAGC AGATCGGCGA GATCCTGTTC
AACAAGCTGG GCATCCCGGC GAAGAAGAAG ACCGCCAGCG GCGCGCCGAG CACCGACGAG
GAGGTCCTGG CCGAGCTGGC CGCCGACTAC CCACTGCCGG CCAAGCTGCT GGAGCACCGC
TCGCTCGCCA AGCTCAAGGG CACCTACACG GACAAGCTGC CGCTGATGGT GAACGCGGCC
ACCGGCCGCG TGCACACCAA CTACGCGCAG GCAGTCGCGG TGACCGGCCG GCTGGCCAGC
AACGACCCCA ACCTGCAGAA CATCCCGATC CGCACGCCCG AGGGCCGGCG CGTGCGCGAG
GCCTTCATCG CGCCGCCCGG CCACGTGATC CTGAGCGCCG ACTACTCGCA GATCGAGCTG
CGCATCATGG CCCACATCTC CGAGGATCCG GGCCTGCTGA AGGCCTTTGC CGAGGGCCTG
GACGTGCACC GCGCCACCGC GAGCGAGGTG TTCAACGTGC CGGTGGCCGA GGTCAGCAGC
GAGCAGCGGC GCTATGCCAA GGTCATCAAC TTCGGGCTGA TCTACGGCAT GGGCGCCTTC
GGTCTGGCGA GCAACCTCGG CATCGAGCAG AAGGCCGCCA AGGACTACAT CGATCGCTAC
TTCGCGCGCT TCGCCGGCGT GAAGCGCTAC ATGGACGAGA CCCGCGCGCG GGCCAAGGAG
CTGGGCTACG TGGAGACCTT GTTCGGGCGC CGCATCTACC TGCCCGAGAT CAACGGCGGC
AACGGTCCGC GCCGCACCGG CGCCGAGCGC CAGGCGATCA ACGCGCCGAT GCAGGGCACC
GCGGCCGACC TGATCAAGCT CGCGATGATC GCGGTGCAGG CGGCGCTCGA TGCCCAGCAG
CGCGCCACGT GCATGGTGAT GCAGGTGCAC GACGAGCTGG TGTTCGAAGT GCCCGAGGCC
GAGCTCGACT GGGCCCGGAC CGCCGTGCCG GAACTGATGG CCGGCGTGGC CGAGCTGAAG
GTGCCGCTGC TGGCCGAGGT GGGCGTGGGC GCGAACTGGG ACCTCGCCCA CTGA
 
Protein sequence
MSAHDDSCLL LVDGSSYLYR AYHALPDLRN PAGEPTGAVR GMVAMLKKLR EEFPSAHAAC 
VFDAKGKTFR DDWYPEYKAN RASMPEDLAR QIAPIHAVVT LLGWPVLEIP GIEADDVIGT
LARAAAARGQ RVIVSTGDKD LAQLVDAHVT LINTMSGERL DVAGVTEKFG VPPERIVDYL
TLVGDAVDNV PGVEKVGPKT AAKWIAEHGS LDGVMAAADA IKGVAGENLR KALDWLPLGR
RLVTVKTDCD LSEALPGWNG TAAWDTLTWR ETDRAALLAF YTHNDFRAWR NELESARAAA
APAPQVAAAE AGEGQSALFA DPAGTGPADG GVAPAVDKRY ETVLAREAFE AWRARIEAAD
LVALDTETDS LDGMRARIVG LSFSVQPYEA CYIPLAHTYP GAPDQLPLDE VLAALKSWLE
DGSRAKLGQN VKYDTHVFAN HGIAVRGYVH DTLLQSYVLE AHKPHSLESL ASRHLDRKGL
SYEDVAGKGA QQIPFAQVEL TRATEYSGED SDMTLDVHRV LWPQLEAAPR LREVYERIEM
PTSVILGRIE RHGVLIDSAL LARQSADLAQ RMVALEQEAH ALAGQPFNLG SPKQIGEILF
NKLGIPAKKK TASGAPSTDE EVLAELAADY PLPAKLLEHR SLAKLKGTYT DKLPLMVNAA
TGRVHTNYAQ AVAVTGRLAS NDPNLQNIPI RTPEGRRVRE AFIAPPGHVI LSADYSQIEL
RIMAHISEDP GLLKAFAEGL DVHRATASEV FNVPVAEVSS EQRRYAKVIN FGLIYGMGAF
GLASNLGIEQ KAAKDYIDRY FARFAGVKRY MDETRARAKE LGYVETLFGR RIYLPEINGG
NGPRRTGAER QAINAPMQGT AADLIKLAMI AVQAALDAQQ RATCMVMQVH DELVFEVPEA
ELDWARTAVP ELMAGVAELK VPLLAEVGVG ANWDLAH