Gene GM21_3335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3335 
SymbolrpoB 
ID8138697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3860662 
End bp3864777 
Gene Length4116 bp 
Protein Length1371 aa 
Translation table11 
GC content62% 
IMG OID644870948 
ProductDNA-directed RNA polymerase subunit beta 
Protein accessionYP_003023118 
Protein GI253701929 
COG category[K] Transcription 
COG ID[COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit 
TIGRFAM ID[TIGR02013] DNA-directed RNA polymerase, beta subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.000000341705 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTTATT CAATCGCGAA TAACCCCCTG TTGCGCAAGA ACTTCGCCAA GATCCACAAG 
ATCATCGACA TACCGAACCT CATCGACATC CAGAAAAATT CCTACAAGAG ATTCCTCCAA
CTCGACACAC CCGTAGACGC ACGCAAGAAT TCCGGCCTGG AGGCCGTTTT CCGCAGCGTC
TTCCCGATCA GGGACTTCAG CGACACCGCT TCGCTTGAGT ACGTTTCGTA TTCGCTCGGC
GCGCCGAAGT ACGACGTGGA GGAGTGCCAC CAAAGGGGCA TGACCTTCGC GGCCCCGATG
AAGGTGAAGG TGAGGCTGGT GGTGTGGGAT GTGGCGAAGG ACCCCGGTAC CCGATCGATC
CGCGACATCA AGGAGCAGGA GGTTTATTTC GGCGAAATCC CGCTGATGAC CGACAACGGG
ACCTTTATCA TAAACGGCAC CGAGCGCGTC ATCGTGAGCC AGCTGCACCG CTCTCCGGGC
GTTTTCTACG ACCACGACAA GGGGAAGACC CACTCCTCCG GGAAGGTGCT CTACTCCGCT
CGCGTGATTC CGTACCGCGG CTCGTGGCTT GACTTTGAAT TTGACCATAA AGACATCCTT
TATGTTCGCA TAGACAGACG GCGCAAGATG CCGGCCACCG TGCTCCTCAA GGCGCTCGGC
TACTCCAACG ACGCGCTGAT CAACTATTTC TACAAGTCCG AGGACGTCAA GGTCGGCGAC
AACGGCTCCA TGACCAAGAT CGCGGACGCG GAACTGCTCT CCAACCAGAA GGCCACCGCG
GACATCGTGG ACCCGGCTAC CGGCGAGGTG ATCCTCAAGG CGAACCGCAA GTTCACCAAG
GCCGCCATCC GCAAGATGTC CGAGCATGGG ATCAAGGAGA TCCCCATCTC CGAGGAGGAG
GTGGTAGGGA AGGTCGCCTC GCACGACATC TACGACCCGG CCACCGGCGA GATCATCGTT
GAGTGCAACG AGGAATTAAC GCAGGCCAAA CTCGAGGAGA TCATCCAGAA GGGGATCACC
ACCTTCCAGG TGCTTTTCAT CGACAACCTG CACGTCACCT CCAGCTTCCG CGACACCATC
CTGATCGACA AGATCGGCTC CACCGACGAG GCGCTGATCG AGATCTACCG CCGTCTGCGT
CCGGGCGACC CGCCGACACT CAAGAGCGCG CTGGTTCTCT TCGAGAACCT GTTCTTCAAC
GCGGAGCGCT ACGACCTCTC CGCCGTCGGC CGCCTGAAGC TCAACTACAA GCTGGGCGTC
GACGTGCCGC TTGACTGCAT GACGCTCACC CGCGAGGACA TCCTCGAGGT GGTGCGCTAC
CTGATCGAGC TGAAAAACGG CAAGGGGAAC ATCGACGACA TCGACCACCT GGGCAACCGC
CGTGTTCGCG CCGTGGGCGA GCTTCTGGAA AACCAGTACC GCATCGGCCT GGTCAGGATG
GAGCGCGCCA TCAAGGAGCG CATGAGCCTG CAGGAAGTCG AGAACCTGAT GCCGCACGAC
CTGATCAACT CCAAGCCGGT CTCCGCGGTG GTGAAGGAGT TCTTCGGCTC CTCGCAGCTC
TCCCAGTTCA TGGACCAGAC CAACCCGCTC TCCGAGGTCA CGCACAAGCG TCGTCTCTCG
GCCCTGGGAC CGGGGGGCTT GACCCGCGAG CGCGCGGGCT TCGAGGTCCG CGACGTTCAT
CCGACCCACT ACGGCCGCGT CTGCCCGATC GAGACCCCTG AGGGTCCGAA CATCGGTCTC
ATCGCGTCGC TCTCCACCTA CGCCAGGATC AACGAACACG GCTTCGTCGA GACGCCGTAC
CGCGTGGTCA AGGAAGGGCG CGTCACCGAC GAGGTCCGCT TCTTCTCCGC TTTGGAGGAG
GAAGGTCACG CCATCGCCCA GGCCAACGCC GAGATCGACG AGACCGGGCG CTTCGCCGCC
GACTACATCT CGGCCAGAAA GTCCGGCGAG TTCGTCCTCG TGGGACGCGA CGAGCTGGAA
CTCATGGACG TGGCGCCGAT GCAGCTTGTC TCCGTCGCGG CATCGCTGAT CCCGTTCCTC
GAGAACGACG ACGCGAACCG CGCACTGATG GGCTCCAACA TGCAGCGCCA GGCAGTGCCG
CTTTTGCGCG CCGACTCCCC GCTGGTAGGT ACCGGCATGG AGAGGGTCGT GGCGAGGGAC
TCCGGGGTCT CCCTGGTTGC CCGCCACAAC GGCGTGGTCG AATCGGTCGA CGCCTCCAGG
ATCGTCGTGA AGATCGACGA GGATCAGTAC GACGCGACCG GCACCGGCGT CGACATCTAC
AACCTGATCA AGTTCGCCCG TTCCAACCAG AACACCTGCA TCAACCAGAG GCCGCTCGTC
AAGGTGGGCG ACCACGTCAC CGCCGGCGAC ATCATCGCCG ACGGCCCCTC CACCGACATG
GGGGAACTGG CTCTGGGACA AAACGTGCTG ATCGCGTTCA TGCCGTGGGG CGGCTACAAC
TACGAGGACT CCATCCTCAT CTCGGAGCGC CTGGTAAAAG ACGACCGCTA CACCTCGATC
CACATCGAGG AGTTCGAGGC GGTGGCGCGC GACACCAAGC TCGGCAAGGA GGAGATCACC
TCCGACATTC CGAACCTGGG CGAAGAGACC CTGAAAGACC TGGACGAGTC CGGCATCATC
AGGATCGGCG CCGAGGTCCG TCCGGGCGAC ATCCTGGTCG GCAAGATCAC CCCGAAGGGC
GAGACCCAGC TTTCCCCTGA AGAGAAACTC TTGCGCGCCA TCTTCGGCGA GAAGGCGGGC
GACGTCCGCG ACACCTCGCT CAGGGTGCCG CCGGGGGTCG AAGGGACCGT CATCGGCGCC
AAGATCTTCT CGCGTAAAGG CGCCGACAAG GACGCCCGTA CCGAGATCAT CGAGAAGGCC
GAGGAGATGC GTCTCAGGAA AGACGAGCAG GACGAGATCC GGATCATCCG CGATTCGGCT
ATCGGAAAAC TGAAGAAGCT CTTGGTTGGG AAAACCGCCG CAGTGAAGAT AGAAGGGACC
GACGGCAAGG TGCTCATCCC GAAAGGGGCG GCCATCACCG AGGAGATGCT GAAATCGTTC
TCCATGGACC GCTGGGACGA GATCTCCATC GCCGACGACG ACACCGTGGA CGAGAAGGTA
GCGCAGACGC TCTCCACGCT GAACCAGCAG ATCGACATCA TCAAGTACGT CTTCGACGAC
AAGGTGCAGA AGCTGCGCCG CGGCGACGAT CTCCCCCCGG GCGTCATCAA GATGGTCAAG
GTGTACATCG CCATCAAGAG GAAGCTGCAG GTGGGCGACA AGATGGCAGG ACGTCACGGT
AACAAAGGTG TCGTCTCCAG GATCCTGCCG GAAGAGGATA TGCCGTACAT GGAGGACGGG
CGTCCCGTCG AGATCGTGCT GAACCCGCTG GGCGTACCTT CCCGTATGAA CGTCGGGCAG
ATCCTTGAGA TGCACCTCGG CTGGGCCGCC AAGGGGCTCG GCTGGAAGAT AGAGGAATTC
CTCGACAAGA ACGCCCCGCA CGACGAGATC AAGAGGTTCC TGAAGGGGGC TTACAACAAC
CCGGACATGG ACCGCTTCCT CGACAAGCTG GAGGGGGAGG AGCTCCTCAA CGTGGCGAAG
CGCCTCAAGC GCGGCGTCCC GATGTCGTCG CCGGTCTTCG AGGGGGCGAG CGAGGAATCG
ATCCAGTCGA TGCTGAGCCA CGCCGGCTTC AGCACCACCG GGCAGGTCAC CCTATTCGAC
GGCAAGAGCG GCGACAAGTT CATGCATCAG GTCACCGTCG GCATCATGTA CTTCCTGAAG
CTGCACCATC TGGTCGACGA CAAGATCCAC GCGAGGTCCA TCGGGCCCTA CTCGCTGGTC
ACCCAGCAGC CGCTGGGGGG CAAGGCGCAG TTCGGCGGGC AGAGGCTCGG GGAGATGGAG
GTCTGGGCGA TGGAGGCCTA CGGCGCGGCG TACGCGCTGC AGGAATTCCT CACCGTCAAG
TCCGACGACG TGGCCGGCCG CACCAGGATG TACGAGGCGA TCGTCAAAGG AAAGCACACG
CTGGAGCCCG GTCTGCCGGA ATCGTTCAAC GTCCTCATCA AGGAACTCCA GTCCCTTGGC
CTCGACGTGG AACTTCTGGA AGGCGACGAG GACTAG
 
Protein sequence
MAYSIANNPL LRKNFAKIHK IIDIPNLIDI QKNSYKRFLQ LDTPVDARKN SGLEAVFRSV 
FPIRDFSDTA SLEYVSYSLG APKYDVEECH QRGMTFAAPM KVKVRLVVWD VAKDPGTRSI
RDIKEQEVYF GEIPLMTDNG TFIINGTERV IVSQLHRSPG VFYDHDKGKT HSSGKVLYSA
RVIPYRGSWL DFEFDHKDIL YVRIDRRRKM PATVLLKALG YSNDALINYF YKSEDVKVGD
NGSMTKIADA ELLSNQKATA DIVDPATGEV ILKANRKFTK AAIRKMSEHG IKEIPISEEE
VVGKVASHDI YDPATGEIIV ECNEELTQAK LEEIIQKGIT TFQVLFIDNL HVTSSFRDTI
LIDKIGSTDE ALIEIYRRLR PGDPPTLKSA LVLFENLFFN AERYDLSAVG RLKLNYKLGV
DVPLDCMTLT REDILEVVRY LIELKNGKGN IDDIDHLGNR RVRAVGELLE NQYRIGLVRM
ERAIKERMSL QEVENLMPHD LINSKPVSAV VKEFFGSSQL SQFMDQTNPL SEVTHKRRLS
ALGPGGLTRE RAGFEVRDVH PTHYGRVCPI ETPEGPNIGL IASLSTYARI NEHGFVETPY
RVVKEGRVTD EVRFFSALEE EGHAIAQANA EIDETGRFAA DYISARKSGE FVLVGRDELE
LMDVAPMQLV SVAASLIPFL ENDDANRALM GSNMQRQAVP LLRADSPLVG TGMERVVARD
SGVSLVARHN GVVESVDASR IVVKIDEDQY DATGTGVDIY NLIKFARSNQ NTCINQRPLV
KVGDHVTAGD IIADGPSTDM GELALGQNVL IAFMPWGGYN YEDSILISER LVKDDRYTSI
HIEEFEAVAR DTKLGKEEIT SDIPNLGEET LKDLDESGII RIGAEVRPGD ILVGKITPKG
ETQLSPEEKL LRAIFGEKAG DVRDTSLRVP PGVEGTVIGA KIFSRKGADK DARTEIIEKA
EEMRLRKDEQ DEIRIIRDSA IGKLKKLLVG KTAAVKIEGT DGKVLIPKGA AITEEMLKSF
SMDRWDEISI ADDDTVDEKV AQTLSTLNQQ IDIIKYVFDD KVQKLRRGDD LPPGVIKMVK
VYIAIKRKLQ VGDKMAGRHG NKGVVSRILP EEDMPYMEDG RPVEIVLNPL GVPSRMNVGQ
ILEMHLGWAA KGLGWKIEEF LDKNAPHDEI KRFLKGAYNN PDMDRFLDKL EGEELLNVAK
RLKRGVPMSS PVFEGASEES IQSMLSHAGF STTGQVTLFD GKSGDKFMHQ VTVGIMYFLK
LHHLVDDKIH ARSIGPYSLV TQQPLGGKAQ FGGQRLGEME VWAMEAYGAA YALQEFLTVK
SDDVAGRTRM YEAIVKGKHT LEPGLPESFN VLIKELQSLG LDVELLEGDE D