Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A1148 |
Symbol | |
ID | 4785723 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 1227099 |
End bp | 1229912 |
Gene Length | 2814 bp |
Protein Length | 937 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640089711 |
Product | DNA-directed DNA polymerase |
Protein accession | YP_001020344 |
Protein GI | 124266340 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.725762 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.433626 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCTC ACGACGATTC CTGCCTGTTG CTGGTGGACG GTTCCAGCTA CCTCTACCGC GCCTACCACG CGCTGCCTGA CCTGCGCAAT CCGGCCGGCG AGCCGACCGG CGCGGTGCGC GGCATGGTGG CCATGCTGAA GAAGCTGCGC GAGGAGTTCC CGTCGGCGCA CGCGGCCTGC GTGTTCGACG CCAAGGGCAA GACCTTCCGC GACGACTGGT ACCCGGAGTA CAAGGCGAAC CGCGCCTCGA TGCCGGAGGA TCTGGCGAGG CAGATCGCGC CGATCCACGC TGTGGTGACG CTGCTGGGCT GGCCGGTGCT CGAGATCCCC GGCATCGAGG CCGACGACGT GATCGGCACG CTGGCGCGGG CGGCCGCCGC GCGCGGCCAG CGGGTCATCG TCTCGACAGG CGACAAGGAC TTGGCGCAGC TCGTCGACGC GCACGTGACG CTGATCAACA CGATGAGCGG CGAGCGGCTC GACGTGGCCG GCGTGACCGA GAAGTTCGGT GTGCCGCCCG AGCGCATCGT CGACTACCTG ACGCTGGTCG GCGACGCGGT CGACAACGTG CCGGGAGTCG AGAAGGTGGG GCCGAAGACC GCCGCCAAGT GGATCGCGGA GCACGGCTCG CTGGACGGCG TGATGGCCGC GGCCGACGCC ATCAAGGGCG TTGCCGGCGA GAACCTGCGC AAGGCGCTGG ATTGGCTGCC GCTGGGTCGC CGGCTGGTGA CCGTGAAGAC CGACTGTGAT CTGTCGGAGG CACTGCCGGG CTGGAACGGC ACGGCCGCCT GGGACACGCT GACCTGGCGC GAGACCGACC GCGCGGCCCT GTTGGCCTTC TACACGCACA ACGACTTCCG CGCGTGGCGC AATGAGCTCG AGTCGGCCCG TGCCGCCGCC GCGCCGGCCC CGCAAGTGGC GGCGGCGGAG GCCGGGGAAG GGCAGAGCGC GCTGTTCGCC GACCCGGCGG GTACCGGGCC GGCCGATGGT GGCGTCGCGC CTGCGGTCGA CAAGCGCTAC GAGACCGTGC TGGCGCGCGA GGCCTTCGAG GCGTGGCGCG CGCGCATCGA GGCGGCCGAC CTGGTGGCTC TGGACACGGA GACCGACTCG CTCGACGGCA TGCGGGCACG CATCGTCGGC CTGAGCTTCA GCGTGCAGCC CTACGAGGCC TGCTACATCC CGCTCGCCCA CACCTACCCG GGCGCGCCCG ACCAGCTGCC GCTCGATGAG GTGCTGGCGG CGCTGAAGTC CTGGCTCGAG GACGGCTCGC GCGCCAAGCT CGGCCAGAAC GTCAAGTACG ACACCCATGT GTTCGCCAAC CATGGCATCG CGGTGCGTGG CTATGTGCAT GACACCCTGC TGCAGAGCTA CGTGCTGGAG GCGCACAAGC CGCACAGCCT GGAAAGCCTC GCCAGCCGCC ACCTCGACCG CAAGGGCCTG AGCTACGAGG ACGTGGCCGG CAAGGGCGCG CAGCAGATCC CGTTCGCGCA GGTCGAGCTG ACGCGCGCCA CCGAGTATTC GGGCGAGGAC AGCGACATGA CGCTGGACGT GCACAGGGTC TTGTGGCCGC AGCTCGAGGC CGCGCCGCGC CTGCGCGAGG TCTACGAGCG CATCGAGATG CCGACCTCGG TGATCCTCGG CCGCATCGAG CGTCACGGCG TGCTGATCGA CAGCGCGCTG CTCGCGCGCC AGAGCGCCGA TCTGGCGCAG CGCATGGTGG CACTGGAGCA GGAGGCGCAT GCGCTGGCCG GCCAGCCCTT CAACCTGGGC AGCCCCAAGC AGATCGGCGA GATCCTGTTC AACAAGCTGG GCATCCCGGC GAAGAAGAAG ACCGCCAGCG GCGCGCCGAG CACCGACGAG GAGGTCCTGG CCGAGCTGGC CGCCGACTAC CCACTGCCGG CCAAGCTGCT GGAGCACCGC TCGCTCGCCA AGCTCAAGGG CACCTACACG GACAAGCTGC CGCTGATGGT GAACGCGGCC ACCGGCCGCG TGCACACCAA CTACGCGCAG GCAGTCGCGG TGACCGGCCG GCTGGCCAGC AACGACCCCA ACCTGCAGAA CATCCCGATC CGCACGCCCG AGGGCCGGCG CGTGCGCGAG GCCTTCATCG CGCCGCCCGG CCACGTGATC CTGAGCGCCG ACTACTCGCA GATCGAGCTG CGCATCATGG CCCACATCTC CGAGGATCCG GGCCTGCTGA AGGCCTTTGC CGAGGGCCTG GACGTGCACC GCGCCACCGC GAGCGAGGTG TTCAACGTGC CGGTGGCCGA GGTCAGCAGC GAGCAGCGGC GCTATGCCAA GGTCATCAAC TTCGGGCTGA TCTACGGCAT GGGCGCCTTC GGTCTGGCGA GCAACCTCGG CATCGAGCAG AAGGCCGCCA AGGACTACAT CGATCGCTAC TTCGCGCGCT TCGCCGGCGT GAAGCGCTAC ATGGACGAGA CCCGCGCGCG GGCCAAGGAG CTGGGCTACG TGGAGACCTT GTTCGGGCGC CGCATCTACC TGCCCGAGAT CAACGGCGGC AACGGTCCGC GCCGCACCGG CGCCGAGCGC CAGGCGATCA ACGCGCCGAT GCAGGGCACC GCGGCCGACC TGATCAAGCT CGCGATGATC GCGGTGCAGG CGGCGCTCGA TGCCCAGCAG CGCGCCACGT GCATGGTGAT GCAGGTGCAC GACGAGCTGG TGTTCGAAGT GCCCGAGGCC GAGCTCGACT GGGCCCGGAC CGCCGTGCCG GAACTGATGG CCGGCGTGGC CGAGCTGAAG GTGCCGCTGC TGGCCGAGGT GGGCGTGGGC GCGAACTGGG ACCTCGCCCA CTGA
|
Protein sequence | MSAHDDSCLL LVDGSSYLYR AYHALPDLRN PAGEPTGAVR GMVAMLKKLR EEFPSAHAAC VFDAKGKTFR DDWYPEYKAN RASMPEDLAR QIAPIHAVVT LLGWPVLEIP GIEADDVIGT LARAAAARGQ RVIVSTGDKD LAQLVDAHVT LINTMSGERL DVAGVTEKFG VPPERIVDYL TLVGDAVDNV PGVEKVGPKT AAKWIAEHGS LDGVMAAADA IKGVAGENLR KALDWLPLGR RLVTVKTDCD LSEALPGWNG TAAWDTLTWR ETDRAALLAF YTHNDFRAWR NELESARAAA APAPQVAAAE AGEGQSALFA DPAGTGPADG GVAPAVDKRY ETVLAREAFE AWRARIEAAD LVALDTETDS LDGMRARIVG LSFSVQPYEA CYIPLAHTYP GAPDQLPLDE VLAALKSWLE DGSRAKLGQN VKYDTHVFAN HGIAVRGYVH DTLLQSYVLE AHKPHSLESL ASRHLDRKGL SYEDVAGKGA QQIPFAQVEL TRATEYSGED SDMTLDVHRV LWPQLEAAPR LREVYERIEM PTSVILGRIE RHGVLIDSAL LARQSADLAQ RMVALEQEAH ALAGQPFNLG SPKQIGEILF NKLGIPAKKK TASGAPSTDE EVLAELAADY PLPAKLLEHR SLAKLKGTYT DKLPLMVNAA TGRVHTNYAQ AVAVTGRLAS NDPNLQNIPI RTPEGRRVRE AFIAPPGHVI LSADYSQIEL RIMAHISEDP GLLKAFAEGL DVHRATASEV FNVPVAEVSS EQRRYAKVIN FGLIYGMGAF GLASNLGIEQ KAAKDYIDRY FARFAGVKRY MDETRARAKE LGYVETLFGR RIYLPEINGG NGPRRTGAER QAINAPMQGT AADLIKLAMI AVQAALDAQQ RATCMVMQVH DELVFEVPEA ELDWARTAVP ELMAGVAELK VPLLAEVGVG ANWDLAH
|
| |