Gene Mext_4017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4017 
SymbolrpoB 
ID5832360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4460587 
End bp4464717 
Gene Length4131 bp 
Protein Length1376 aa 
Translation table11 
GC content66% 
IMG OID641369808 
ProductDNA-directed RNA polymerase subunit beta 
Protein accessionYP_001641458 
Protein GI163853415 
COG category[K] Transcription 
COG ID[COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit 
TIGRFAM ID[TIGR02013] DNA-directed RNA polymerase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.563725 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAATA CGCTGGTCGG TCGCAAGCGC ATTCGCAAGT TCTTCGGTAA GATCCGGGAA 
GTCGCCGAGA TGCCGAACCT CATCGAGGTT CAAAAGGCGT CCTACGACCA GTTCCTGATG
GTGGACGAGC CCGAGGGCGG GCGTGCCGAC GAGGGCCTGC AGAGCGTCTT CAAGTCGGTC
TTCCCGATCT CCGACTTCGC CTCGACGGCG CTGCTCGAGT TCGTGCGGTA CACCTTCGAA
CAGCCGAAAT ACGACGTCGA CGAGTGCCGC CAGCGCGGCA TCACCTTCGC GGCCCCGCTC
AAGGTGACGC TGCGGCTCAT CGTGTTCGAT GTGGACCCCG ATACCGGCGC CAAGTCGGTC
AAGGACATCA AGGAGCAGGA TGTCTACATG GGCGACATGC CGCTCATGAC GGACAACGGC
ACCTTCATCG TCAACGGCAC CGAGCGCGTC ATCGTCTCGC AGATGCACCG CTCGCCGGGC
GTGTTCTTCG ACCACGACAA GGGCAAGACC CACTCGTCCG GCAAGCTGCT GTTCGCCGCC
CGCATCATCC CCTACCGGGG CTCCTGGCTC GACGTCGAGT TCGACGCCAA GGACATCGTG
CACGTGCGTA TCGACCGCAA GCGCAAGCTG CCGGTCACCT CGCTGCTGTT CGCGCTCGGC
CTCGACGGCG AAGAGATCCT CTCGACCTTC TACAACCGCG TCGCCTACCA GCGTGACGGG
GCGGATTGGC GCGTGCCGTT CGACGCCGAG CGCCTCAAGG GATTCAAGGC CTCGGTCGAT
CTCATCGACG CCGATTCCGG CGAGGTCGTG CTGGAGGCGG GCAAGAAGCT CAACGCCCGC
AACGCGCGTC TGATCGGCGA GAAGGGCACC AAGTTCCTGC GCGCCGCGGA CGAGGATCTG
ATCGGCCAGT ACATCGCCGA AGACCTCGTC AACATGAAGA CGGGCGAGAT CTGGGCCGAG
GCCGGCGACG AGATCTCCGA GAAGCTCCTC AAGAGCCTCG ACGACGTCGG CGTCACCGAG
CTGCCCGTGC TCGACATCGA CCACGTCAAT GTCGGCCCCT ACATCCGCAA CACGCTGGCG
GTGGACAAGA ACTCCGCCCG CGAGGGCGCG CTGTTCGACA TCTACCGCGT CATGCGCCCC
GGCGAGCCGC CGACGCTCGA CACCGCTGAG GCGATGTTCC ACTCGCTGTT CTTCGATTCC
GAGCGCTACG ACCTCTCGGC GGTCGGCCGC GTGAAGATGA ACATGCGCCT CGACCTCGAC
GCCGCCGACA CCGTGCGGAC GCTGCGTCGC GAGGACATGC TCGCGGTCGT CAAGGCGCTG
GTCGACCTGC GCGACGGCAA GGGCGAGATC GACGACATCG ACCACCTCGG CAACCGCCGT
GTCCGCTCGG TCGGCGAGCT GATGGAGAAC CAGTACCGTC TGGGCCTCCT GCGGATGGAG
CGCGCCATCA AGGAGCGCAT GTCCTCGGTC GATATCGACA CGGTGATGCC GCAGGATCTC
ATCAACGCGA AGCCCGCCGC GGCCGCGGTG CGCGAGTTCT TCGGCTCGTC GCAGCTCTCG
CAGTTCATGG ACCAGACCAA CCCGCTGTCG GAAGTGACGC ACAAGCGCCG CCTCTCGGCG
CTTGGCCCGG GCGGTCTGAC CCGCGAGCGC GCCGGCTTCG AGGTGCGCGA CGTGCACCCG
ACCCATTACG GCCGCATCTG CCCGATCGAG ACCCCGGAAG GCCCGAACAT CGGCCTGATC
AACTCGCTGG CGACCTTCGC CCGCGTGAAC AAGTACGGCT TCATCGAGAC GCCGTTCCGC
CGGGTGAAGG ACGGCGTGGT GACCGACGAG GTCGCCTACC TCTCCGCCAT GGAAGAGGCG
AAGTACTACG TCGCCCAGGC CAATGCCGGC ATGGACGAGG GCCGCAAGCT GACGGACGAC
CTCGTGGTCT GCCGCCGCGC GGGTGAGGTC ATCGTGGTCG CCCCCGACCG CGTCGATCTT
ATGGACGTGT CGCCGAAGCA GCTCGTCTCG GTCGCCGCGG CGCTGATCCC GTTCCTCGAG
AACGACGACG CCAACCGCGC GCTGATGGGC TCGAACATGC AGCGCCAGGC GGTGCCGCTG
GTTCGCGCCG ACGCGCCCTT CGTCGGCACC GGCATGGAGG CCGTGGTCGC CCGCGACTCG
GGTGCCGCCA TCGCCGCCCG TCGCTCCGGC ATCGTCGATC AGGTGGACGC CACCCGTATC
GTCATCCGCG CCTCGGAAGA GACCGACCCG ACCAAGCCCG GCGTCGACAT CTACCGCCTG
CAGAAGTTCC AGCGCTCCAA CCAGTCGACC TGCATCACGC AGAAGCCGCT GGTGCGCGTC
GGCGAGCCGG TGAAGAAGGG CGAGATCATC GCCGACGGTC CCTCGACCGA GTTCGGTGAG
CTCGCGCTCG GCCGCAACGT GCTCGTCGCG TTCATGCCGT GGAACGGCTA CAACTTCGAG
GACTCGATCC TGCTCTCCGA GCGGATCGTG AAGGATGACG TGTTCACCTC GATCCACATC
GAGGAATTCG AGGTGATGGC CCGCGACACC AAGCTGGGTC CGGAGGAGAT CACCCGCGAC
ATCCCGAACG TCTCGGAAGA GGCGCTCAAG AACCTCGACG AGGCCGGCAT CGTCTATATC
GGCGCCGAGG TGCATGCCGG CGACATCCTC GTCGGCAAGA TCACGCCGAA GGGTGAGAGC
CCGATGACGC CGGAGGAGAA GCTCCTGCGC GCCATCTTCG GCGAGAAGGC CTCGGACGTG
CGCGACACCT CGCTCCGGGT TCCCCCGGGC GTGACCGGCA CGATCGTCGA GGTGCGGGTG
TTCAACCGCC ACGGCGTCGA CAAGGACGAG CGCGCCCAGG CGATCGAGCG TGAGGAGATC
GAGCGTCTCG CCAAGGACCG CGACGACGAG CAGACCATCC TCGACCGCAA CACCTATGCC
CGCTTGGCCG AGGTGCTGAT CGGTCAGTCG CCGATCGCCG GCCCGAAGGG CTTCCGCAAG
GACACCACGC TGACCCGCGA GATCATCAGC GAGTATCCCC GCTCGCAATG GTGGCAGTTT
GCCGTCGTCG ACGACCGTAT GATGACGGAA ATCGAGGCGA TGCAGAAGCA GTACGACGAG
TCGAAGAAGC GCCTCGAGCA GCGCTTCCTC GACAAGGTCG AGAAGCTGCA GCGCGGCGAC
GAGCTTCCGC CCGGCGTCAT GAAGATGGTC AAGGTCTTCG TGGCGGTGAA GCGCAAGATC
CAGCCCGGCG ACAAGATGGC CGGCCGCCAC GGCAACAAGG GTGTCGTGTC GCGCATCGTG
CCGATCGAGG ACATGCCGTT CCTCGAAGAC GGCACGCATG CCGACATCGT GCTCAACCCG
CTCGGCGTGC CCTCGCGCAT GAATGTCGGC CAGATCCTGG AAACCCATCT GGGCTGGGCG
GCTGCGGGCT TGGGTCGCAA GGTCTCCAAG GCGGTCGATG CCTATCTGAA GAACCAGGAT
ATCGCGCCGC TGCGGGCTGA GATGGAAGCG ATCTACTCGC CGTCCGAACT CGAGGGGCTG
TCCGACGAGG CTCTGGCCGA GGCGGGCAAC AACGTGCGCC GCGGCGTGCC GATGGCCACC
CCGGTGTTCA ACGGCGCCAA GGAAGCCGAC ATCGAGACGA TGCTGGAGAT GGCCGGGCTC
GACCGCTCGG CGCAGTCGAC GCTCTACGAC GGGCGCACCG GCGAGCCCTT CGACCGCAAG
GTCACCATGG GCTACATCTA CATGCTGAAG CTGCACCACC TCGTGGACGA CAAGATCCAC
GCGCGCTCGA TCGGCCCGTA CTCGCTCGTC ACCCAGCAGC CGCTGGGCGG TAAGGCGCAG
TTCGGCGGTC AGCGCTTCGG TGAGATGGAG GTCTGGGCGC TGGAGGCCTA CGGCGCGGCC
TACACGCTGC AGGAGATGCT CACGGTGAAG TCGGACGACG TGGCCGGCCG CACCAAGGTC
TACGAGGCGA TCGTCCGCGG CGACGACACG TTCGAGGCTG GCATCCCCGA GTCCTTCAAC
GTGCTCGTCA AGGAGATGCG CTCGCTCGGC CTCAACGTCG AGCTCACCTC CTCCAAGCAG
CAGCAGGCGG CCAACGACCA GATCGAGCCG CCGGCCGACG CCGCCGAGTA A
 
Protein sequence
MANTLVGRKR IRKFFGKIRE VAEMPNLIEV QKASYDQFLM VDEPEGGRAD EGLQSVFKSV 
FPISDFASTA LLEFVRYTFE QPKYDVDECR QRGITFAAPL KVTLRLIVFD VDPDTGAKSV
KDIKEQDVYM GDMPLMTDNG TFIVNGTERV IVSQMHRSPG VFFDHDKGKT HSSGKLLFAA
RIIPYRGSWL DVEFDAKDIV HVRIDRKRKL PVTSLLFALG LDGEEILSTF YNRVAYQRDG
ADWRVPFDAE RLKGFKASVD LIDADSGEVV LEAGKKLNAR NARLIGEKGT KFLRAADEDL
IGQYIAEDLV NMKTGEIWAE AGDEISEKLL KSLDDVGVTE LPVLDIDHVN VGPYIRNTLA
VDKNSAREGA LFDIYRVMRP GEPPTLDTAE AMFHSLFFDS ERYDLSAVGR VKMNMRLDLD
AADTVRTLRR EDMLAVVKAL VDLRDGKGEI DDIDHLGNRR VRSVGELMEN QYRLGLLRME
RAIKERMSSV DIDTVMPQDL INAKPAAAAV REFFGSSQLS QFMDQTNPLS EVTHKRRLSA
LGPGGLTRER AGFEVRDVHP THYGRICPIE TPEGPNIGLI NSLATFARVN KYGFIETPFR
RVKDGVVTDE VAYLSAMEEA KYYVAQANAG MDEGRKLTDD LVVCRRAGEV IVVAPDRVDL
MDVSPKQLVS VAAALIPFLE NDDANRALMG SNMQRQAVPL VRADAPFVGT GMEAVVARDS
GAAIAARRSG IVDQVDATRI VIRASEETDP TKPGVDIYRL QKFQRSNQST CITQKPLVRV
GEPVKKGEII ADGPSTEFGE LALGRNVLVA FMPWNGYNFE DSILLSERIV KDDVFTSIHI
EEFEVMARDT KLGPEEITRD IPNVSEEALK NLDEAGIVYI GAEVHAGDIL VGKITPKGES
PMTPEEKLLR AIFGEKASDV RDTSLRVPPG VTGTIVEVRV FNRHGVDKDE RAQAIEREEI
ERLAKDRDDE QTILDRNTYA RLAEVLIGQS PIAGPKGFRK DTTLTREIIS EYPRSQWWQF
AVVDDRMMTE IEAMQKQYDE SKKRLEQRFL DKVEKLQRGD ELPPGVMKMV KVFVAVKRKI
QPGDKMAGRH GNKGVVSRIV PIEDMPFLED GTHADIVLNP LGVPSRMNVG QILETHLGWA
AAGLGRKVSK AVDAYLKNQD IAPLRAEMEA IYSPSELEGL SDEALAEAGN NVRRGVPMAT
PVFNGAKEAD IETMLEMAGL DRSAQSTLYD GRTGEPFDRK VTMGYIYMLK LHHLVDDKIH
ARSIGPYSLV TQQPLGGKAQ FGGQRFGEME VWALEAYGAA YTLQEMLTVK SDDVAGRTKV
YEAIVRGDDT FEAGIPESFN VLVKEMRSLG LNVELTSSKQ QQAANDQIEP PADAAE