Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_2471 |
Symbol | |
ID | 5835636 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 2771870 |
End bp | 2775013 |
Gene Length | 3144 bp |
Protein Length | 1047 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641368273 |
Product | DNA polymerase I |
Protein accession | YP_001639937 |
Protein GI | 163851894 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.195384 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAAG AGACGCAACC GCAACCGGCC AAGCCCGTCT GCGCCGATGA CCGGGTGATC CTCGTTGACG GCTCGTCCTT CATCTTCCGA GCGTACTTCC AGTCGATCAA TCAGGACCAG AAGTACAACA CGCGGCCCTC CGACGGCCTG CCGACCGGTG CGGTGCGCCT GTTCTGCACC AAGATCGCGC AGTTCCTTCA GGAGGGTGCG GCGGGTGTGA AGCCGACCCA TCTTGGTATC GTCTTCGACA AATCCGAGGG CTCGTTCCGC AAGGAGCTGT TCCCCGACTA CAAGGGCCAC CGCCCTGACG CGCCGGACGA CCTCAAGCGC CAGATGCCGC TGATGCGCGA TGCGGTGCGA GCCTTCGGGC TGCATGCGGT CGAGCTGGTG CGGTATGAGG CCGACGACCT GATCGCGACC TACGCGCGCC AAGCGGAGGC GCGCGGCGCC GAGGTCATCA TCGTGTCCTC CGACAAGGAC CTGATGCAGC TCGTCTCGGA CAAGATCCGG TTCTACGATT TCGAGTCAGG CGCCAAGGGC AAGCCCGGCT ACCGCCCCGA GCGCAACCTC GACCGCGAAG CGATCATCGC CAAGTGGGAG GGCCTCGCGC CCGAGCAGAT CGGCGACGCC CTGGCACTGA TCGGCGACAC CTCCGACAAC GTGCCGGGTG TGCCCGGCAT CGGCCTGAAG ACGGCGGCTG CGCTCATCAA GGAATACGGC AGCCTGGAGC AGCTCTTGGA GCGGGCCAGC GAGATCAAGC AGCCCAAGCG CCGGGAGATG CTGCTCGCCA ACATCGATCA GGCCAAGCTC TCGCGCCGCC TCGTCGCGCT CGAGGAGAGC GTGCCGGTGC CGGTGCCACT CGACGAACTC GGCGTGCCGC AGCCCGATCC GCAGAAGCTC GTCGGCTTCC TGAAGGCCAT GGAGTTCAAC ACTCTGACGC GGCGCATCGC GCAGATGCTC CATGTCGATC CCGAAGCGGT GAAGCCCGAC CCGGCTTTGC TGCCGGGCGC ACAGCCTCAC GCCTATTCCA ACGCGGCCGG CGGCAGTGAC GCCGTGCCGT TCTTCGGCGA CGCCGTGCCG GTCGATCCCG AGACGGCCGC CGCACCCGAG CGGACGGATG GGGAGGCGCC GCCCGAAGCG GGCGAGGCCG ATCCCTTCGC CGATCTCGAC CTGCCGGATC AGGCTCCGAA GAAGAAGCGC GGCTCCAACG AGCCGACGCC CGCGACCCTC GTGGCCGCCC GCGCGGCGGA ATCGGTCAAA CCGTTCGATA CGGCCGCCTA CGAAACCATC GCCACCGTGG CGCAGCTCGA GGCCTGGATC GCCGAGAGCT ACGAGGCCGG CGTGATCGCG GTCGATACCG AGACCGACGC GCTCGACGCG GCCAAGGCGG GACTCGTCGG CGTCTCGCTC GCCACCGCGC CGGGGCGGGC CGCCTATATC CCGCTCGCCC ACGTGAAGCC CGAGGTGAAG GGCGGCGACC TGTTCGGCGA GAGCGGGGCA GGAGCCGATG CGTCCGACAG GCAGCCGGGC CAGATCGATT TCGACACCGC CCTCAAGCTG CTCAAGCCGC TGCTCGAGGA TGCCGGCACG CTCAAGGTCG GACAGAATCT GAAATACGAC CTCTCGGTGC TGCACCGCTA CGGCATCGAC GTGAGGCCCT TCGACGACAC GATGCTGATC TCCTACGTGC TCGATGCCGG CAAGGGCGGG CACGGCATGG ACGAGCTCGC CCGCCGCCAT CTTGGCCATC AGCCGATCAC CTTCGCCGAC GTCGCCGGCA CCGGCCGCAA CAAGGTCACC TTCGACCGCG TGGCGATCGA CAAGGCGACC GCCTACGCGG CGGAAGACGC CGACGTCACC TTACGCCTGT GGCGGATGAT GAAGCCGCGT CTCGTCGCCG AGCACCGGGT GACGGTCTAC GAGACGCTGG AGCGTCCGCT GGTGCCAGTG CTGGCGCGGA TGGAGCGGGC CGGCATCGCC ATCGACCGCA ACATGCTGAG CCGTCTCTCC GGCGACTTCT CGCAGATCCT GGCGCGGCTC GAGGAGGAGA TTCAGGAGGA CGCGGGCGAG CGGTTTCAGG TCTCCTCACC GAAGCAGATC GGCGACGTGC TGTTCGGCAA GATGGGCCTG CCCGGCGCGA AAAAGACGCC GTCCGGTCAG TGGGCCACGC CCGCGACGCT TCTCGAAGAG CTGGCCCAGG CCGGCCACGA CCTGCCGAAG AAGATTCTCA ACTATCGCCA GCTCTCCAAG CTGAAATCGA CCTACACCGA CTCGCTGCAA CAGCACGCCG ACCGGGGGAC CAACCGCGTC CACACCTCGT TCGCCCTCGC GGCGACGACG ACCGGCCGGC TCTCTTCGTC GGATCCGAAC CTGCAGAACA TCCCGATCCG CACGGAGGAG GGGCGGCGCA TCCGCCGCGC CTTCGTCGCG CCCGAGGGTA AGAAGCTGAT CTCAGCCGAT TACAGCCAGA TCGAGCTGCG CCTGCTCGCC CACATCGCCG ACATCCCGCA ATTGCGCGAA GCGTTCGAGC AGGGAATCGA CATCCACGCG GCGACGGCGT CGGCCATGTT CGGCGTCGCC CTCGACCAGA TGACCGGCGA CCTGCGGCGC CGGGCCAAGA CGATCAATTT CGGCATCATC TACGGCATCT CGGCCTTCGG GCTGGCCGAC CGCCTCGGCA TCGGCCGCGA GGAGGCATCG GCCTTCATCA AGCAGTATTT CGAGCGGTTT CCCGGCATTC GCGACTACAT CGACACCACC AAGCGCTCGT GCCGCGAGAA GGGCTACGTC ACGACCCTGT TCGGCCGCGT CTGCCACTAC CCGCAGATCC GCTCGAACAA CCCGTCCGAA CGGGCGAGCG TGGAGCGGCA AGCCATCAAC GCCCCGATCC AGGGCACCGC CGCCGACATC ATCCGCCGCG CCATGACGCG GATGGAGGAT GCGCTGGAGG CCAAGAAGCT CACTGCGCGG ATGCTGCTGC AAGTGCACGA CGAACTCGTG TTCGAGGTGC CCGACGACGA GGTCGAGGCG ACGATCCCCG TGATCGCCGG GGTGATGGAA GAGGCGCCCG CGCCGGCCCT GACGCTGAGG GTGCCGCTGG TGGTCGAGGC CCGGGCGGCG GGCAACTGGG AAGAGGCGCA CTGA
|
Protein sequence | MTEETQPQPA KPVCADDRVI LVDGSSFIFR AYFQSINQDQ KYNTRPSDGL PTGAVRLFCT KIAQFLQEGA AGVKPTHLGI VFDKSEGSFR KELFPDYKGH RPDAPDDLKR QMPLMRDAVR AFGLHAVELV RYEADDLIAT YARQAEARGA EVIIVSSDKD LMQLVSDKIR FYDFESGAKG KPGYRPERNL DREAIIAKWE GLAPEQIGDA LALIGDTSDN VPGVPGIGLK TAAALIKEYG SLEQLLERAS EIKQPKRREM LLANIDQAKL SRRLVALEES VPVPVPLDEL GVPQPDPQKL VGFLKAMEFN TLTRRIAQML HVDPEAVKPD PALLPGAQPH AYSNAAGGSD AVPFFGDAVP VDPETAAAPE RTDGEAPPEA GEADPFADLD LPDQAPKKKR GSNEPTPATL VAARAAESVK PFDTAAYETI ATVAQLEAWI AESYEAGVIA VDTETDALDA AKAGLVGVSL ATAPGRAAYI PLAHVKPEVK GGDLFGESGA GADASDRQPG QIDFDTALKL LKPLLEDAGT LKVGQNLKYD LSVLHRYGID VRPFDDTMLI SYVLDAGKGG HGMDELARRH LGHQPITFAD VAGTGRNKVT FDRVAIDKAT AYAAEDADVT LRLWRMMKPR LVAEHRVTVY ETLERPLVPV LARMERAGIA IDRNMLSRLS GDFSQILARL EEEIQEDAGE RFQVSSPKQI GDVLFGKMGL PGAKKTPSGQ WATPATLLEE LAQAGHDLPK KILNYRQLSK LKSTYTDSLQ QHADRGTNRV HTSFALAATT TGRLSSSDPN LQNIPIRTEE GRRIRRAFVA PEGKKLISAD YSQIELRLLA HIADIPQLRE AFEQGIDIHA ATASAMFGVA LDQMTGDLRR RAKTINFGII YGISAFGLAD RLGIGREEAS AFIKQYFERF PGIRDYIDTT KRSCREKGYV TTLFGRVCHY PQIRSNNPSE RASVERQAIN APIQGTAADI IRRAMTRMED ALEAKKLTAR MLLQVHDELV FEVPDDEVEA TIPVIAGVME EAPAPALTLR VPLVVEARAA GNWEEAH
|
| |