Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3335 |
Symbol | rpoB |
ID | 8138697 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3860662 |
End bp | 3864777 |
Gene Length | 4116 bp |
Protein Length | 1371 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644870948 |
Product | DNA-directed RNA polymerase subunit beta |
Protein accession | YP_003023118 |
Protein GI | 253701929 |
COG category | [K] Transcription |
COG ID | [COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit |
TIGRFAM ID | [TIGR02013] DNA-directed RNA polymerase, beta subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.000000341705 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCTTATT CAATCGCGAA TAACCCCCTG TTGCGCAAGA ACTTCGCCAA GATCCACAAG ATCATCGACA TACCGAACCT CATCGACATC CAGAAAAATT CCTACAAGAG ATTCCTCCAA CTCGACACAC CCGTAGACGC ACGCAAGAAT TCCGGCCTGG AGGCCGTTTT CCGCAGCGTC TTCCCGATCA GGGACTTCAG CGACACCGCT TCGCTTGAGT ACGTTTCGTA TTCGCTCGGC GCGCCGAAGT ACGACGTGGA GGAGTGCCAC CAAAGGGGCA TGACCTTCGC GGCCCCGATG AAGGTGAAGG TGAGGCTGGT GGTGTGGGAT GTGGCGAAGG ACCCCGGTAC CCGATCGATC CGCGACATCA AGGAGCAGGA GGTTTATTTC GGCGAAATCC CGCTGATGAC CGACAACGGG ACCTTTATCA TAAACGGCAC CGAGCGCGTC ATCGTGAGCC AGCTGCACCG CTCTCCGGGC GTTTTCTACG ACCACGACAA GGGGAAGACC CACTCCTCCG GGAAGGTGCT CTACTCCGCT CGCGTGATTC CGTACCGCGG CTCGTGGCTT GACTTTGAAT TTGACCATAA AGACATCCTT TATGTTCGCA TAGACAGACG GCGCAAGATG CCGGCCACCG TGCTCCTCAA GGCGCTCGGC TACTCCAACG ACGCGCTGAT CAACTATTTC TACAAGTCCG AGGACGTCAA GGTCGGCGAC AACGGCTCCA TGACCAAGAT CGCGGACGCG GAACTGCTCT CCAACCAGAA GGCCACCGCG GACATCGTGG ACCCGGCTAC CGGCGAGGTG ATCCTCAAGG CGAACCGCAA GTTCACCAAG GCCGCCATCC GCAAGATGTC CGAGCATGGG ATCAAGGAGA TCCCCATCTC CGAGGAGGAG GTGGTAGGGA AGGTCGCCTC GCACGACATC TACGACCCGG CCACCGGCGA GATCATCGTT GAGTGCAACG AGGAATTAAC GCAGGCCAAA CTCGAGGAGA TCATCCAGAA GGGGATCACC ACCTTCCAGG TGCTTTTCAT CGACAACCTG CACGTCACCT CCAGCTTCCG CGACACCATC CTGATCGACA AGATCGGCTC CACCGACGAG GCGCTGATCG AGATCTACCG CCGTCTGCGT CCGGGCGACC CGCCGACACT CAAGAGCGCG CTGGTTCTCT TCGAGAACCT GTTCTTCAAC GCGGAGCGCT ACGACCTCTC CGCCGTCGGC CGCCTGAAGC TCAACTACAA GCTGGGCGTC GACGTGCCGC TTGACTGCAT GACGCTCACC CGCGAGGACA TCCTCGAGGT GGTGCGCTAC CTGATCGAGC TGAAAAACGG CAAGGGGAAC ATCGACGACA TCGACCACCT GGGCAACCGC CGTGTTCGCG CCGTGGGCGA GCTTCTGGAA AACCAGTACC GCATCGGCCT GGTCAGGATG GAGCGCGCCA TCAAGGAGCG CATGAGCCTG CAGGAAGTCG AGAACCTGAT GCCGCACGAC CTGATCAACT CCAAGCCGGT CTCCGCGGTG GTGAAGGAGT TCTTCGGCTC CTCGCAGCTC TCCCAGTTCA TGGACCAGAC CAACCCGCTC TCCGAGGTCA CGCACAAGCG TCGTCTCTCG GCCCTGGGAC CGGGGGGCTT GACCCGCGAG CGCGCGGGCT TCGAGGTCCG CGACGTTCAT CCGACCCACT ACGGCCGCGT CTGCCCGATC GAGACCCCTG AGGGTCCGAA CATCGGTCTC ATCGCGTCGC TCTCCACCTA CGCCAGGATC AACGAACACG GCTTCGTCGA GACGCCGTAC CGCGTGGTCA AGGAAGGGCG CGTCACCGAC GAGGTCCGCT TCTTCTCCGC TTTGGAGGAG GAAGGTCACG CCATCGCCCA GGCCAACGCC GAGATCGACG AGACCGGGCG CTTCGCCGCC GACTACATCT CGGCCAGAAA GTCCGGCGAG TTCGTCCTCG TGGGACGCGA CGAGCTGGAA CTCATGGACG TGGCGCCGAT GCAGCTTGTC TCCGTCGCGG CATCGCTGAT CCCGTTCCTC GAGAACGACG ACGCGAACCG CGCACTGATG GGCTCCAACA TGCAGCGCCA GGCAGTGCCG CTTTTGCGCG CCGACTCCCC GCTGGTAGGT ACCGGCATGG AGAGGGTCGT GGCGAGGGAC TCCGGGGTCT CCCTGGTTGC CCGCCACAAC GGCGTGGTCG AATCGGTCGA CGCCTCCAGG ATCGTCGTGA AGATCGACGA GGATCAGTAC GACGCGACCG GCACCGGCGT CGACATCTAC AACCTGATCA AGTTCGCCCG TTCCAACCAG AACACCTGCA TCAACCAGAG GCCGCTCGTC AAGGTGGGCG ACCACGTCAC CGCCGGCGAC ATCATCGCCG ACGGCCCCTC CACCGACATG GGGGAACTGG CTCTGGGACA AAACGTGCTG ATCGCGTTCA TGCCGTGGGG CGGCTACAAC TACGAGGACT CCATCCTCAT CTCGGAGCGC CTGGTAAAAG ACGACCGCTA CACCTCGATC CACATCGAGG AGTTCGAGGC GGTGGCGCGC GACACCAAGC TCGGCAAGGA GGAGATCACC TCCGACATTC CGAACCTGGG CGAAGAGACC CTGAAAGACC TGGACGAGTC CGGCATCATC AGGATCGGCG CCGAGGTCCG TCCGGGCGAC ATCCTGGTCG GCAAGATCAC CCCGAAGGGC GAGACCCAGC TTTCCCCTGA AGAGAAACTC TTGCGCGCCA TCTTCGGCGA GAAGGCGGGC GACGTCCGCG ACACCTCGCT CAGGGTGCCG CCGGGGGTCG AAGGGACCGT CATCGGCGCC AAGATCTTCT CGCGTAAAGG CGCCGACAAG GACGCCCGTA CCGAGATCAT CGAGAAGGCC GAGGAGATGC GTCTCAGGAA AGACGAGCAG GACGAGATCC GGATCATCCG CGATTCGGCT ATCGGAAAAC TGAAGAAGCT CTTGGTTGGG AAAACCGCCG CAGTGAAGAT AGAAGGGACC GACGGCAAGG TGCTCATCCC GAAAGGGGCG GCCATCACCG AGGAGATGCT GAAATCGTTC TCCATGGACC GCTGGGACGA GATCTCCATC GCCGACGACG ACACCGTGGA CGAGAAGGTA GCGCAGACGC TCTCCACGCT GAACCAGCAG ATCGACATCA TCAAGTACGT CTTCGACGAC AAGGTGCAGA AGCTGCGCCG CGGCGACGAT CTCCCCCCGG GCGTCATCAA GATGGTCAAG GTGTACATCG CCATCAAGAG GAAGCTGCAG GTGGGCGACA AGATGGCAGG ACGTCACGGT AACAAAGGTG TCGTCTCCAG GATCCTGCCG GAAGAGGATA TGCCGTACAT GGAGGACGGG CGTCCCGTCG AGATCGTGCT GAACCCGCTG GGCGTACCTT CCCGTATGAA CGTCGGGCAG ATCCTTGAGA TGCACCTCGG CTGGGCCGCC AAGGGGCTCG GCTGGAAGAT AGAGGAATTC CTCGACAAGA ACGCCCCGCA CGACGAGATC AAGAGGTTCC TGAAGGGGGC TTACAACAAC CCGGACATGG ACCGCTTCCT CGACAAGCTG GAGGGGGAGG AGCTCCTCAA CGTGGCGAAG CGCCTCAAGC GCGGCGTCCC GATGTCGTCG CCGGTCTTCG AGGGGGCGAG CGAGGAATCG ATCCAGTCGA TGCTGAGCCA CGCCGGCTTC AGCACCACCG GGCAGGTCAC CCTATTCGAC GGCAAGAGCG GCGACAAGTT CATGCATCAG GTCACCGTCG GCATCATGTA CTTCCTGAAG CTGCACCATC TGGTCGACGA CAAGATCCAC GCGAGGTCCA TCGGGCCCTA CTCGCTGGTC ACCCAGCAGC CGCTGGGGGG CAAGGCGCAG TTCGGCGGGC AGAGGCTCGG GGAGATGGAG GTCTGGGCGA TGGAGGCCTA CGGCGCGGCG TACGCGCTGC AGGAATTCCT CACCGTCAAG TCCGACGACG TGGCCGGCCG CACCAGGATG TACGAGGCGA TCGTCAAAGG AAAGCACACG CTGGAGCCCG GTCTGCCGGA ATCGTTCAAC GTCCTCATCA AGGAACTCCA GTCCCTTGGC CTCGACGTGG AACTTCTGGA AGGCGACGAG GACTAG
|
Protein sequence | MAYSIANNPL LRKNFAKIHK IIDIPNLIDI QKNSYKRFLQ LDTPVDARKN SGLEAVFRSV FPIRDFSDTA SLEYVSYSLG APKYDVEECH QRGMTFAAPM KVKVRLVVWD VAKDPGTRSI RDIKEQEVYF GEIPLMTDNG TFIINGTERV IVSQLHRSPG VFYDHDKGKT HSSGKVLYSA RVIPYRGSWL DFEFDHKDIL YVRIDRRRKM PATVLLKALG YSNDALINYF YKSEDVKVGD NGSMTKIADA ELLSNQKATA DIVDPATGEV ILKANRKFTK AAIRKMSEHG IKEIPISEEE VVGKVASHDI YDPATGEIIV ECNEELTQAK LEEIIQKGIT TFQVLFIDNL HVTSSFRDTI LIDKIGSTDE ALIEIYRRLR PGDPPTLKSA LVLFENLFFN AERYDLSAVG RLKLNYKLGV DVPLDCMTLT REDILEVVRY LIELKNGKGN IDDIDHLGNR RVRAVGELLE NQYRIGLVRM ERAIKERMSL QEVENLMPHD LINSKPVSAV VKEFFGSSQL SQFMDQTNPL SEVTHKRRLS ALGPGGLTRE RAGFEVRDVH PTHYGRVCPI ETPEGPNIGL IASLSTYARI NEHGFVETPY RVVKEGRVTD EVRFFSALEE EGHAIAQANA EIDETGRFAA DYISARKSGE FVLVGRDELE LMDVAPMQLV SVAASLIPFL ENDDANRALM GSNMQRQAVP LLRADSPLVG TGMERVVARD SGVSLVARHN GVVESVDASR IVVKIDEDQY DATGTGVDIY NLIKFARSNQ NTCINQRPLV KVGDHVTAGD IIADGPSTDM GELALGQNVL IAFMPWGGYN YEDSILISER LVKDDRYTSI HIEEFEAVAR DTKLGKEEIT SDIPNLGEET LKDLDESGII RIGAEVRPGD ILVGKITPKG ETQLSPEEKL LRAIFGEKAG DVRDTSLRVP PGVEGTVIGA KIFSRKGADK DARTEIIEKA EEMRLRKDEQ DEIRIIRDSA IGKLKKLLVG KTAAVKIEGT DGKVLIPKGA AITEEMLKSF SMDRWDEISI ADDDTVDEKV AQTLSTLNQQ IDIIKYVFDD KVQKLRRGDD LPPGVIKMVK VYIAIKRKLQ VGDKMAGRHG NKGVVSRILP EEDMPYMEDG RPVEIVLNPL GVPSRMNVGQ ILEMHLGWAA KGLGWKIEEF LDKNAPHDEI KRFLKGAYNN PDMDRFLDKL EGEELLNVAK RLKRGVPMSS PVFEGASEES IQSMLSHAGF STTGQVTLFD GKSGDKFMHQ VTVGIMYFLK LHHLVDDKIH ARSIGPYSLV TQQPLGGKAQ FGGQRLGEME VWAMEAYGAA YALQEFLTVK SDDVAGRTRM YEAIVKGKHT LEPGLPESFN VLIKELQSLG LDVELLEGDE D
|
| |