Gene Mext_2471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2471 
Symbol 
ID5835636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2771870 
End bp2775013 
Gene Length3144 bp 
Protein Length1047 aa 
Translation table11 
GC content68% 
IMG OID641368273 
ProductDNA polymerase I 
Protein accessionYP_001639937 
Protein GI163851894 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.195384 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAG AGACGCAACC GCAACCGGCC AAGCCCGTCT GCGCCGATGA CCGGGTGATC 
CTCGTTGACG GCTCGTCCTT CATCTTCCGA GCGTACTTCC AGTCGATCAA TCAGGACCAG
AAGTACAACA CGCGGCCCTC CGACGGCCTG CCGACCGGTG CGGTGCGCCT GTTCTGCACC
AAGATCGCGC AGTTCCTTCA GGAGGGTGCG GCGGGTGTGA AGCCGACCCA TCTTGGTATC
GTCTTCGACA AATCCGAGGG CTCGTTCCGC AAGGAGCTGT TCCCCGACTA CAAGGGCCAC
CGCCCTGACG CGCCGGACGA CCTCAAGCGC CAGATGCCGC TGATGCGCGA TGCGGTGCGA
GCCTTCGGGC TGCATGCGGT CGAGCTGGTG CGGTATGAGG CCGACGACCT GATCGCGACC
TACGCGCGCC AAGCGGAGGC GCGCGGCGCC GAGGTCATCA TCGTGTCCTC CGACAAGGAC
CTGATGCAGC TCGTCTCGGA CAAGATCCGG TTCTACGATT TCGAGTCAGG CGCCAAGGGC
AAGCCCGGCT ACCGCCCCGA GCGCAACCTC GACCGCGAAG CGATCATCGC CAAGTGGGAG
GGCCTCGCGC CCGAGCAGAT CGGCGACGCC CTGGCACTGA TCGGCGACAC CTCCGACAAC
GTGCCGGGTG TGCCCGGCAT CGGCCTGAAG ACGGCGGCTG CGCTCATCAA GGAATACGGC
AGCCTGGAGC AGCTCTTGGA GCGGGCCAGC GAGATCAAGC AGCCCAAGCG CCGGGAGATG
CTGCTCGCCA ACATCGATCA GGCCAAGCTC TCGCGCCGCC TCGTCGCGCT CGAGGAGAGC
GTGCCGGTGC CGGTGCCACT CGACGAACTC GGCGTGCCGC AGCCCGATCC GCAGAAGCTC
GTCGGCTTCC TGAAGGCCAT GGAGTTCAAC ACTCTGACGC GGCGCATCGC GCAGATGCTC
CATGTCGATC CCGAAGCGGT GAAGCCCGAC CCGGCTTTGC TGCCGGGCGC ACAGCCTCAC
GCCTATTCCA ACGCGGCCGG CGGCAGTGAC GCCGTGCCGT TCTTCGGCGA CGCCGTGCCG
GTCGATCCCG AGACGGCCGC CGCACCCGAG CGGACGGATG GGGAGGCGCC GCCCGAAGCG
GGCGAGGCCG ATCCCTTCGC CGATCTCGAC CTGCCGGATC AGGCTCCGAA GAAGAAGCGC
GGCTCCAACG AGCCGACGCC CGCGACCCTC GTGGCCGCCC GCGCGGCGGA ATCGGTCAAA
CCGTTCGATA CGGCCGCCTA CGAAACCATC GCCACCGTGG CGCAGCTCGA GGCCTGGATC
GCCGAGAGCT ACGAGGCCGG CGTGATCGCG GTCGATACCG AGACCGACGC GCTCGACGCG
GCCAAGGCGG GACTCGTCGG CGTCTCGCTC GCCACCGCGC CGGGGCGGGC CGCCTATATC
CCGCTCGCCC ACGTGAAGCC CGAGGTGAAG GGCGGCGACC TGTTCGGCGA GAGCGGGGCA
GGAGCCGATG CGTCCGACAG GCAGCCGGGC CAGATCGATT TCGACACCGC CCTCAAGCTG
CTCAAGCCGC TGCTCGAGGA TGCCGGCACG CTCAAGGTCG GACAGAATCT GAAATACGAC
CTCTCGGTGC TGCACCGCTA CGGCATCGAC GTGAGGCCCT TCGACGACAC GATGCTGATC
TCCTACGTGC TCGATGCCGG CAAGGGCGGG CACGGCATGG ACGAGCTCGC CCGCCGCCAT
CTTGGCCATC AGCCGATCAC CTTCGCCGAC GTCGCCGGCA CCGGCCGCAA CAAGGTCACC
TTCGACCGCG TGGCGATCGA CAAGGCGACC GCCTACGCGG CGGAAGACGC CGACGTCACC
TTACGCCTGT GGCGGATGAT GAAGCCGCGT CTCGTCGCCG AGCACCGGGT GACGGTCTAC
GAGACGCTGG AGCGTCCGCT GGTGCCAGTG CTGGCGCGGA TGGAGCGGGC CGGCATCGCC
ATCGACCGCA ACATGCTGAG CCGTCTCTCC GGCGACTTCT CGCAGATCCT GGCGCGGCTC
GAGGAGGAGA TTCAGGAGGA CGCGGGCGAG CGGTTTCAGG TCTCCTCACC GAAGCAGATC
GGCGACGTGC TGTTCGGCAA GATGGGCCTG CCCGGCGCGA AAAAGACGCC GTCCGGTCAG
TGGGCCACGC CCGCGACGCT TCTCGAAGAG CTGGCCCAGG CCGGCCACGA CCTGCCGAAG
AAGATTCTCA ACTATCGCCA GCTCTCCAAG CTGAAATCGA CCTACACCGA CTCGCTGCAA
CAGCACGCCG ACCGGGGGAC CAACCGCGTC CACACCTCGT TCGCCCTCGC GGCGACGACG
ACCGGCCGGC TCTCTTCGTC GGATCCGAAC CTGCAGAACA TCCCGATCCG CACGGAGGAG
GGGCGGCGCA TCCGCCGCGC CTTCGTCGCG CCCGAGGGTA AGAAGCTGAT CTCAGCCGAT
TACAGCCAGA TCGAGCTGCG CCTGCTCGCC CACATCGCCG ACATCCCGCA ATTGCGCGAA
GCGTTCGAGC AGGGAATCGA CATCCACGCG GCGACGGCGT CGGCCATGTT CGGCGTCGCC
CTCGACCAGA TGACCGGCGA CCTGCGGCGC CGGGCCAAGA CGATCAATTT CGGCATCATC
TACGGCATCT CGGCCTTCGG GCTGGCCGAC CGCCTCGGCA TCGGCCGCGA GGAGGCATCG
GCCTTCATCA AGCAGTATTT CGAGCGGTTT CCCGGCATTC GCGACTACAT CGACACCACC
AAGCGCTCGT GCCGCGAGAA GGGCTACGTC ACGACCCTGT TCGGCCGCGT CTGCCACTAC
CCGCAGATCC GCTCGAACAA CCCGTCCGAA CGGGCGAGCG TGGAGCGGCA AGCCATCAAC
GCCCCGATCC AGGGCACCGC CGCCGACATC ATCCGCCGCG CCATGACGCG GATGGAGGAT
GCGCTGGAGG CCAAGAAGCT CACTGCGCGG ATGCTGCTGC AAGTGCACGA CGAACTCGTG
TTCGAGGTGC CCGACGACGA GGTCGAGGCG ACGATCCCCG TGATCGCCGG GGTGATGGAA
GAGGCGCCCG CGCCGGCCCT GACGCTGAGG GTGCCGCTGG TGGTCGAGGC CCGGGCGGCG
GGCAACTGGG AAGAGGCGCA CTGA
 
Protein sequence
MTEETQPQPA KPVCADDRVI LVDGSSFIFR AYFQSINQDQ KYNTRPSDGL PTGAVRLFCT 
KIAQFLQEGA AGVKPTHLGI VFDKSEGSFR KELFPDYKGH RPDAPDDLKR QMPLMRDAVR
AFGLHAVELV RYEADDLIAT YARQAEARGA EVIIVSSDKD LMQLVSDKIR FYDFESGAKG
KPGYRPERNL DREAIIAKWE GLAPEQIGDA LALIGDTSDN VPGVPGIGLK TAAALIKEYG
SLEQLLERAS EIKQPKRREM LLANIDQAKL SRRLVALEES VPVPVPLDEL GVPQPDPQKL
VGFLKAMEFN TLTRRIAQML HVDPEAVKPD PALLPGAQPH AYSNAAGGSD AVPFFGDAVP
VDPETAAAPE RTDGEAPPEA GEADPFADLD LPDQAPKKKR GSNEPTPATL VAARAAESVK
PFDTAAYETI ATVAQLEAWI AESYEAGVIA VDTETDALDA AKAGLVGVSL ATAPGRAAYI
PLAHVKPEVK GGDLFGESGA GADASDRQPG QIDFDTALKL LKPLLEDAGT LKVGQNLKYD
LSVLHRYGID VRPFDDTMLI SYVLDAGKGG HGMDELARRH LGHQPITFAD VAGTGRNKVT
FDRVAIDKAT AYAAEDADVT LRLWRMMKPR LVAEHRVTVY ETLERPLVPV LARMERAGIA
IDRNMLSRLS GDFSQILARL EEEIQEDAGE RFQVSSPKQI GDVLFGKMGL PGAKKTPSGQ
WATPATLLEE LAQAGHDLPK KILNYRQLSK LKSTYTDSLQ QHADRGTNRV HTSFALAATT
TGRLSSSDPN LQNIPIRTEE GRRIRRAFVA PEGKKLISAD YSQIELRLLA HIADIPQLRE
AFEQGIDIHA ATASAMFGVA LDQMTGDLRR RAKTINFGII YGISAFGLAD RLGIGREEAS
AFIKQYFERF PGIRDYIDTT KRSCREKGYV TTLFGRVCHY PQIRSNNPSE RASVERQAIN
APIQGTAADI IRRAMTRMED ALEAKKLTAR MLLQVHDELV FEVPDDEVEA TIPVIAGVME
EAPAPALTLR VPLVVEARAA GNWEEAH