Gene Mchl_5036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_5036 
Symbol 
ID7113639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp5382631 
End bp5385846 
Gene Length3216 bp 
Protein Length1071 aa 
Translation table11 
GC content71% 
IMG OID643527730 
ProductDNA polymerase III, alpha subunit 
Protein accessionYP_002423729 
Protein GI218532913 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.38358 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.080697 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACTCA CCATGAGCAC GCCGGAGCTT CAGGTCACGA CGCATTTCTC CTTCCTGCGC 
GGCGCATCGA GCCCGGAGGA GCTGTTTTCC GCAGCGAGCT TGCTCGGCAT CCCGGCGCTC
GGCGTCACCG ATCACGGCTC GCTGGCCGGA ATGGTCCGCG CGCATCAGGC GGCGAAGGTC
ACGGGCATGC GCCTCGTCGT CGGCTGCCGG CTCGACCTCG ACGATCTGTC TTCGCCGCTG
CTCGTCTACC CCACCGACCG GGCCGCCTAC GGCCGGCTCT GCCGCCTGCT CAGCCTGGGC
AAGGCCAAGG GCGGCAAGGG CCGCTGCCGC CTGACTTGGC GCGATCTCGA AGAGTGGCAG
GAGGGGTTTT TTGCAATTTT GCTCGCCGAG CAGCCGGGAT CGGCGCTTCT GAACGACCTC
GCGCGCTTCA AGGGCCTGTT CGGCGCCCGC GCGTCCTGCG CGCTGACGCG CCGCTTCCGC
CCCGACGACG CCGACCGGCT CGACGCTTTG GCCGCCGCCG CCCGCGCCGC GCGGGTGCCC
ACCGTCGCCA CCGGCGACAT CCTCTACCAT GTCGCGGGCC GGCGCCGCCT GCAGGACGTG
GTGACCTGCA TCCGGCTCGG GCTCACCATC GACCGGGCCG GGTTTGCCAA GGAGCGCCAC
GCCGACCGCT TCCTCAAACC CCCGGCCGAG ACCGCGCGCC TGTTCGCGCG CTTCCCAGAG
GCGCTGGCGC GGGCGGGCGA GATCGCGGCA GCCTGCCGCT TCTCGCTGGA CGACCTCGCC
TACACCTACC CCACGGAAAG CCGCGAGGAC GGGCTCTCGC CGCAGGAACG CCTGGACGTC
CTGACTTGGG CCGGCGCGGC GCGGCGCTAT CCGGCCGGCG TGCCGGAGGC GGTGACGCGC
CAATTGCACC ACGAACTCGA CCTGATCGGG CGGCTGGCCT ACGCCCCCTA CTTCCTCACG
GTCGAATCGA TCGTCGCCTT CGCCAATCGC GAAAAGATCC TGTGCCAGGG GCGCGGCTCG
GCGGCCAATT CCGCGGTCTG CTTCTGTCTC GGCATCACCT CGATCGATCC GACGCGGCAG
AACCTGCTGT TCGAGCGCTT CGTCTCGGAG GCGCGCCGCG AGCCGCCCGA CATCGACGTC
GATTTCGAGC ACGAGCGCCG CGAGGAGGTG ATCCAGTGGA TCTTTCAGAC CTACGGCCGC
CACCGCTCCG CGCTCACCGC CATCGTCAGC CGCTTCCGCT CGCGGGGCGC CCTGCGCGAG
GTCGGCAAGG TGATGGGCCT GCCCGAGGAC GTGACCGGCG CCATCAACCG CATGACCTGG
TCTTGGAGCA GCGAGGGCGT CGGCGAGCGC GAACTGCGCG AACTCAACCT CAACCCCGAC
GACCGGCGCC TGCGCCTGAC GCTGGCGATC GCCCGCGAGC TGATCGGCAC GCCGCGCCAC
CTCTCCCAGC ATCCCGGCGG CTTCGTCCTG ACCCTCGACC GGCTCGACGA ACTGGTGCCG
GTCGAGCCGG CGGCGATGGC CGACCGTCAG GTCATCGAGT GGGACAAGGA TGACATCGAC
GCCCTCAAGT TCATGAAGGT CGATGTGCTC GGGCTCGGCA TGCTCGGCTG CCTGCGCCGC
GCCTTCGACC TGTTGGCCGA GGTCAAGAAC GACCCGCACG ACCTCGCCTC GATCCCCTCA
AAGGATGAGC CGACCTTCGC GATGATCCGG CGCGCCGACA CCCTCGGCGT CTTCCAGATC
GAGAGCCGCG CGCAGATGGC GATGCTACCG CGGATGAAGC CGAAAGAGTT CTACGACCTC
GTCATCGAGG TGGCGATCGT GCGCCCCGGC CCGATCCAGG GCGACATGGT CCACCCCTAT
CTGCGCCGTC GCGAGGGCCT CGAAGAGGTG ACCTACCCGA CGCCCGAGTT GAAGGCGGTG
CTGGAGAAGA CGCTCGGCGT GCCGCTGTTC CAGGAGCAGG CGATGCAGGT GGCGATGGTC
GGCGCCGGCT TCTCGGCGAC GGAGGCCGAC GAACTGCGCC GCTCCATGGC GACCTTTAAG
TTCACGGGCG GCGTCCACCG CTTCCAGAAC CGCCTCGTCG AGGGCATGGT CGCCAACGGC
TACGCCCGCG ATTTTGCCGA GCGCACCTTC AAGCAGCTCG AGGGCTTCGG CTCCTACGGC
TTCCCCGAGA GCCACGCCGC CTCCTTCGCG CTCCTGGCCT ATGCCTCGTC GTGGATGAAG
TGCCACCATC CGGATGTGTT CTGCGCCGCG CTGCTGAACG CGCAGCCGAT GGGCTTCTAC
GCCCCCGCCC AGATCGTGCG CGACGCCCGC GCCCACGGCG TCGAGATCCG CCCGCTCGAC
GTGAACTTTT CGCAGTGGGA CTGCACCCTG GAATCGCTTG GCTCGGGCCG CAAACTGTTC
GCGGTGCGCC TGGGGCTACG CCTCGCGAGC GGCTTGGCCG AACGGGACGG GCATCGTCTC
GTGGCGGCAC GCGGCGGGCG CCCCTTCGCC TCCCTGCCCG AACTGGCGGA CCGGGCCGGC
ATCCCGGCAG CCAGCCTGAC CTGCCTCGTG CGGGCCGATG CGTTCCGCTC GCTCGGCCTC
AACCGGCGCG AGGCCGCCTG GGCGGTCAAG GCCCTGCGGC CCGATCCCCT CCCCCTCTTC
GCCGCCCTGT CCGCACCGAT ACAGGCCGAG CCGGCGCAGA CGGCGGAGCC CGCCGTATCG
CTGCCGGCCA TGAGCGCGGG CGGGGAGGTC GTCGCCGATT ACTGCGCCAA CGGCCTCAGC
CTGCGCGCCC ACCCCCTCGC CTTCCTGCGC GAGACCCTTA CCACTCTGGG GGCCCGGCCC
TGCGCGGCGC TGGAGCGGGT GGGCAACGGG GGGGCGATCG TCGTCGCCGG GATCGTGCTG
ATGCGCCAGC GGCCGGGCTC TGCCAAGGGC ACGATGTTCA TGACCCTGGA GGACGAGACC
GGCATCGCCA ATCTGATCGT CCGGCCCGAG CTGTTCGACC GGCAGCGCCG GGTCGTGCTC
GGCGCCCGGC TGATGGCCTG CCGCGGACGG GTGCAGCGGG TCGGCGACGT GATCCACCTC
GTGGCCGTGG AGCTGTTCGA CCGCTCCGGC CTGCTGCGGC GGATCGGCGA GGAGCCGATC
GCCCTGCGCA CCGGCCGCGG CGACGAGACC GGCCAGAGCG TCCGGCCCGA TCCGCGTGAG
GCCGCGCTGC CGGTGCGGGC ACGGAATTTC CGGTAG
 
Protein sequence
MQLTMSTPEL QVTTHFSFLR GASSPEELFS AASLLGIPAL GVTDHGSLAG MVRAHQAAKV 
TGMRLVVGCR LDLDDLSSPL LVYPTDRAAY GRLCRLLSLG KAKGGKGRCR LTWRDLEEWQ
EGFFAILLAE QPGSALLNDL ARFKGLFGAR ASCALTRRFR PDDADRLDAL AAAARAARVP
TVATGDILYH VAGRRRLQDV VTCIRLGLTI DRAGFAKERH ADRFLKPPAE TARLFARFPE
ALARAGEIAA ACRFSLDDLA YTYPTESRED GLSPQERLDV LTWAGAARRY PAGVPEAVTR
QLHHELDLIG RLAYAPYFLT VESIVAFANR EKILCQGRGS AANSAVCFCL GITSIDPTRQ
NLLFERFVSE ARREPPDIDV DFEHERREEV IQWIFQTYGR HRSALTAIVS RFRSRGALRE
VGKVMGLPED VTGAINRMTW SWSSEGVGER ELRELNLNPD DRRLRLTLAI ARELIGTPRH
LSQHPGGFVL TLDRLDELVP VEPAAMADRQ VIEWDKDDID ALKFMKVDVL GLGMLGCLRR
AFDLLAEVKN DPHDLASIPS KDEPTFAMIR RADTLGVFQI ESRAQMAMLP RMKPKEFYDL
VIEVAIVRPG PIQGDMVHPY LRRREGLEEV TYPTPELKAV LEKTLGVPLF QEQAMQVAMV
GAGFSATEAD ELRRSMATFK FTGGVHRFQN RLVEGMVANG YARDFAERTF KQLEGFGSYG
FPESHAASFA LLAYASSWMK CHHPDVFCAA LLNAQPMGFY APAQIVRDAR AHGVEIRPLD
VNFSQWDCTL ESLGSGRKLF AVRLGLRLAS GLAERDGHRL VAARGGRPFA SLPELADRAG
IPAASLTCLV RADAFRSLGL NRREAAWAVK ALRPDPLPLF AALSAPIQAE PAQTAEPAVS
LPAMSAGGEV VADYCANGLS LRAHPLAFLR ETLTTLGARP CAALERVGNG GAIVVAGIVL
MRQRPGSAKG TMFMTLEDET GIANLIVRPE LFDRQRRVVL GARLMACRGR VQRVGDVIHL
VAVELFDRSG LLRRIGEEPI ALRTGRGDET GQSVRPDPRE AALPVRARNF R