Gene Mext_1048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1048 
Symbol 
ID5831150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1142585 
End bp1146106 
Gene Length3522 bp 
Protein Length1173 aa 
Translation table11 
GC content69% 
IMG OID641366843 
ProductDNA polymerase III, alpha subunit 
Protein accessionYP_001638524 
Protein GI163850481 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.195384 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.00537835 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCGCGCA CACTCAAAGA GGTCGGCTTC GTCCACCTCC ATGTCCACTC GTCCTACTCG 
CTGCTCGAAG GCGGGGTGAA GGCGGGTGAT CTCGTCAAGG CGGCCGCGGC CGACCGGCAG
CCCGCGCTCG CGCTCACCGA CACCAACAAC CTGTTCGGGG CGCTGGAATT CTCCGAATAC
GCCGCCAAGG CGGGGATCCA GCCGATCGCG GGCCTGCAGC TCACGATCTG CTTCGAGGCG
CCGGACCCGA TGGCGCGCCT GCCCCAGGCC GGGTGCGCCA ACATCGTGCT GCTCGCCCAG
GACGAGACCG GCTACGGCAA CCTTCTGCGT CTGGGCAGCC GGGCGCATTT CGATGGTCCG
CTCGGCGCCG CGCCCAACCT CGCCGTCTCG GCACTGGAGG GCAACGTCGA GGGGCTGATC
GGCCTCACCG GCGGGCTCTC GGGGCCGCTC GACACCAGCC TGCGTGCCGG CCGCGTCGAT
CAGGCCGCGC GCCGCCTGGA GATCCTGAAA GGGGCGTTCG GCGAGGAGCA TCTCTACGTC
GAGATCCAGC GCCACGGCCT CGACGATGAG CGCGCCGTCG AGCGCGAATT GCTGCGGCTC
GCGGACACGA ACGGTCTCGG CATCGTCGCC ACCAACGAGC CGTTCTTCGC CAAGCCCGGC
GATTACGACG CGCACGACGC GCTGCTCGCC ATCGCTGAGG GCCGCCTCGT CTCCGACGAG
CGCCGCCGCC GGCTCACCCC GCGCCACGCC TTCACCACGC GCGCCGCGAT GATGGAGCTG
TTCCGCGATC TGCCGGACGC GCTCTCGGCC TCGGTCGAGA TCGCCATGCG CTGCTCTTAC
CGGGCGCGCA CCCGCAAACC CATCCTGCCG AACTTCGCCA GCGTCGCCGC GGGCGAGGCG
CCGCCCATGG CCGACGCACT GGCCGAGGCC GCCGAGACGC CGGTTCAGGC AGTGGCCGCC
GACGAGCCGA CCGAGCTGTG CCGGCAAGCC GAGGCCGGGC TGGAGCTGCG CTTGAAGCAG
CACGGCACCG CGCCGGGCTT CACGGAGGAG GATTACCGCG CGCGGCTGAA GTTCGAGCTC
GACGTCATCG TCAAGATGAA GTTCCCGGGC TACTTCCTGA TCGTCTCGGA CTTCATCAAG
TGGGCCAAGG ACCACGACAT CCCCGTGGGA CCGGGCCGCG GCTCGGGCGC GGGCTCGCTG
GTGGCGTGGT CGCTCCTGAT CACCGACCTC GACCCGCTGC GCTTCGGCCT GCTGTTCGAG
CGCTTCCTCA ACCCCGAACG CGTCTCGATG CCGGACTTCG ACATCGACTT CTGCGTCGAG
GGCCGCGAAC GGGTAATCCG CTACGTGCAG CAGCGCTACG GCGAGCGGCA GGTCGGGCAG
ATCATCACCT TCGGTACGTT GCTCGCCCGC GGCGTGCTGC GCGACGTCGG CCGCGTGCTG
GAAATGCCCT ACGGGCAGGT CGACAAGCTG ACCAAGCTCG TACCGCAGAA CCCGGCCAAC
CCCGTGACGC TGGCGCAAGC CATCGAGGGC GAGCCCAAGC TCCAGCAGGC GATCGAGGAG
GAGCCGATCG TCGCCCGTCT CATCGAGATC TCGAAGAAGC TGGAGGGCCT GCACCGCCAC
GCCTCGACCC ACGCCGCCGG CGTGGTGATC GGCGACCGGC CGCTCGAAGA GCTGGTGCCG
CTCTACCGAG ACCCGAAGAC GGGCATGCGG GTGACCCAGT TCAACATGAA GTGGGTCGAG
CAGGCCGGCC TCGTGAAGTT CGACTTCCTC GGCCTCAAGA CGCTCACCAT GCTGCGGTGC
TGCACGGATC TGCTCAAGCA GCGCGGCATC GAGGTCGATC TCGCCCAGAT CCCGCTCGAC
GACAAGAAGA CCTACGAGCC GATGGGCCGC GGCGAGACCG TCGCGGTGTT CCAGGTGGAA
TCCGCCGGCA TGCGCAAGGC GCTCTGCGAG ATGCAGGCCG ACCGCCTGGA GGACATCATC
GCGCTGGTGG CGCTCTACCG GCCGGGCCCG ATGGCCAACA TCCCGGTCTA TTGCGAGCGC
AAGCTCGGGC GCGACGCCGG CAACGAGGCG AGCTGGTACT CGCACCCGAT GCTGGAGCCG
ATCCTGAAGG AGACCTTCGG GATCATCGTC TACCAAGAGC AGGTGATGGA GGTCGCCAAG
GTTCTCGCCG GCTACACGCT GGGCGAGGCC GACATGCTCC GCCGCGCCAT GGGTAAGAAG
ATCAAGGCGG AGATGGACGC CCAGCGCGAC CGCTTCGTGA AGGGCTGCAC CGAGAACGGC
CTGACCAAGG CCAAGGCCGA CGAGATCTTC GACCTGCTCG CCAAGTTCGC CGACTACGGC
TTCAACAAGT CCCACGCGGC CGCCTACGCG CTGGTGACCT ACCAGACTGC CTATCTGAAG
GCGAACCATC CGGTCGAGTT CCTGGCCGCG GCCATGACCC TCGACATCGA CAACACCGAC
AAGCTCGCCG AATTCCGACA GGACGCGCAG CGCCTGAAGA TCACGGTCGA GCCCCCGTCG
GTGAACACCT CGGGCGTGGT GTTCGACGTG CGGGAGGGAC GCATCCTCTA CGCGCTTGCC
GCGATCAAGG GCGTCGGCCG CTCGGCGGTG GAGTCGATCG TGGCCGCGCG CGGCGACAAG
CCGTTCAAGG ACCTCGCCTG CTTCGCCCGG CGCCTGAACC CGCGCCACGT CAACAAGCGC
ACGCTGGAAA ACCTGATCGC AGCTGGGGCG CTGGACTGCA TCGAGCCCGA TCGCGCGCGG
GCCTGGGCGG CGGTCGAGCC GATGATGAAG ATGGCCCAGG GTGCGGCCGA GGCCGAGACC
TCGGGCATCA CCGACATGTT CGGCGGCGTC GCCTCGGTGG ATGTGGCTTT ACGCATCCCG
CCGCACGAGT TCTGGACCCA CACCGACAAG CTCAAGCGCG AATGCGACGC CATCGGCTTC
TTCCTCTCGG GCCATCCGCT CGACGAGTAC GGGCCGATCC TCGAAAAGCT GCGGGTCCAG
TCCTGGGCCG ATTTCTGCCG GGCGGTGCGG GCGGGCACGG CCAGCGTCGG CCGGATCGCG
GCCTCGGTGC TCGACCGCTC GGAGCGCCGC ACCAAGAGCG GTAACAAGCT CGGCATCGTC
ATTCTCTCGG ACCAGACCGG CCACTTCGAG GCGATCATCT TCTCCGAGGG GCTGAACCAG
TACCGCGACA TCCTCGAGCC CGGCCGCCCG CTGGTGCTGA CGATCCAAGC CAACCTGGAG
GGCGAGGACG TGCGCGCTCG GATCACCACC GCCGAGCCCC TGGATCAGGC CGCCGCCCGC
CACCAGAAGG GCATGCGCAT CTTCCTGCGC GACGACCGCG CCTTGAGTTC GGTGCAGCAG
CGCCTGACCC TGCGCGGCGA GGGCGAGGTT TCGCTGATCC TGATCCTCGA TGGCGGCGAG
CGCGAGGTCG AGGTGCGGCT GAAGGATCGC TACCAGGCGA CCCCGCAGGT CGCGGGTGCG
CTGCGTGCCG TGCCGGGGGT GGTGCAGGTC GAGGTGAATT GA
 
Protein sequence
MARTLKEVGF VHLHVHSSYS LLEGGVKAGD LVKAAAADRQ PALALTDTNN LFGALEFSEY 
AAKAGIQPIA GLQLTICFEA PDPMARLPQA GCANIVLLAQ DETGYGNLLR LGSRAHFDGP
LGAAPNLAVS ALEGNVEGLI GLTGGLSGPL DTSLRAGRVD QAARRLEILK GAFGEEHLYV
EIQRHGLDDE RAVERELLRL ADTNGLGIVA TNEPFFAKPG DYDAHDALLA IAEGRLVSDE
RRRRLTPRHA FTTRAAMMEL FRDLPDALSA SVEIAMRCSY RARTRKPILP NFASVAAGEA
PPMADALAEA AETPVQAVAA DEPTELCRQA EAGLELRLKQ HGTAPGFTEE DYRARLKFEL
DVIVKMKFPG YFLIVSDFIK WAKDHDIPVG PGRGSGAGSL VAWSLLITDL DPLRFGLLFE
RFLNPERVSM PDFDIDFCVE GRERVIRYVQ QRYGERQVGQ IITFGTLLAR GVLRDVGRVL
EMPYGQVDKL TKLVPQNPAN PVTLAQAIEG EPKLQQAIEE EPIVARLIEI SKKLEGLHRH
ASTHAAGVVI GDRPLEELVP LYRDPKTGMR VTQFNMKWVE QAGLVKFDFL GLKTLTMLRC
CTDLLKQRGI EVDLAQIPLD DKKTYEPMGR GETVAVFQVE SAGMRKALCE MQADRLEDII
ALVALYRPGP MANIPVYCER KLGRDAGNEA SWYSHPMLEP ILKETFGIIV YQEQVMEVAK
VLAGYTLGEA DMLRRAMGKK IKAEMDAQRD RFVKGCTENG LTKAKADEIF DLLAKFADYG
FNKSHAAAYA LVTYQTAYLK ANHPVEFLAA AMTLDIDNTD KLAEFRQDAQ RLKITVEPPS
VNTSGVVFDV REGRILYALA AIKGVGRSAV ESIVAARGDK PFKDLACFAR RLNPRHVNKR
TLENLIAAGA LDCIEPDRAR AWAAVEPMMK MAQGAAEAET SGITDMFGGV ASVDVALRIP
PHEFWTHTDK LKRECDAIGF FLSGHPLDEY GPILEKLRVQ SWADFCRAVR AGTASVGRIA
ASVLDRSERR TKSGNKLGIV ILSDQTGHFE AIIFSEGLNQ YRDILEPGRP LVLTIQANLE
GEDVRARITT AEPLDQAAAR HQKGMRIFLR DDRALSSVQQ RLTLRGEGEV SLILILDGGE
REVEVRLKDR YQATPQVAGA LRAVPGVVQV EVN