Gene Franean1_6057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6057 
SymbolrpoB 
ID5674378 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7372910 
End bp7376335 
Gene Length3426 bp 
Protein Length1141 aa 
Translation table11 
GC content67% 
IMG OID641244905 
ProductDNA-directed RNA polymerase subunit beta 
Protein accessionYP_001510307 
Protein GI158317799 
COG category[K] Transcription 
COG ID[COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit 
TIGRFAM ID[TIGR02013] DNA-directed RNA polymerase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.492439 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCCGCCT CGCGTTCCGC CTCCCGCATC TCGTTCGCAA AGATCATCGA GCCCCTCGAG 
GTCCCGGATC TTCTCGCCCT GCAGACGCAG TCATTCGATT GGCTGATCGG CAACGACATC
TGGGCGGAGC GGGTCCAGGA GGCCATCGAC GCCGACAGGA ACGATGTCCC CATCACCTCC
GGGCTCGAGG AGGTCTTCGA GGAGATCTCC CCGATCGAGG ACTTCTCCGG CTCGATGTCC
CTGTCCTTCC GGGATCACAG GTTTGAGCCC CCCAAGTACT CCGTCGAGGA GTGCAAGGAC
AAGGACATGA CCTACTCGGC GCCGCTGTTC GTCACCGCCG AGTTCACCAA CAACACCACC
GGTGAGATCA AGAGCCAGAC GGTGTTCATG GGCGACTTCC CGCTCATGAC CCCGAAGGGC
ACGTTCGTCA TCAACGGCAC CGAGCGCGTG GTGGTCAGCC AGCTCGTCCG GTCCCCGGGT
GTGTACTTCG AGAAGGCGAT CGACAAGGCG TCCGACAAGG ATCTGTTCTC CTGCAAGGTC
ATCCCCTCAC GGGGGGCCTG GCTCGAGTTC GAGATCGACA AGCGGGACAC CGTCGGCGTC
CGCATCGACC GCAAGCGCCG CCAGTCGGTC ACCGTCCTGC TCAAGGCCCT GGGCTGGGAT
GAGGCCCGCA TCCTGGAGCG CTTCGGCGAC TTCCCGTCGA TGCGGGTCAC CCTGGAGAAG
GACCACACCT CCAGCCAGGA CGACGCCCTG CTCGACATCT ACCGCAAGCT GCGCCCGGGC
GAGCCGCCGA CGCGGGAGTC GGCGCAGACC CTGCTGGAGA ACCTCTTCTT CAACCCGAAG
CGCTACGACC TGGCCAAGGT CGGCCGCTAC AAGGTGAACA AGAAGCTGAC CCTCGAGGTC
GCGCACGATG TCGGCGTCCT CACCGAGGAC GACGTCGTCC GCACGATCGA GTACATCGTC
AAGCTGCACG CGGGCCAGGA CCCGGACAAC TACGAGGTCG ACGACATCGA CCACTTCGGC
AACCGGCGCC TGCGCACGGT CGGCGAGCTC ATCCAGAACC AGGTCCGCCT CGGCCTGGCC
CGGATGGAGC GCGTCGTCCG CGAGCGCATG ACGACCCAGG ACGTCGAGGC GATCACGCCG
CAGACCCTGA TCAACATCCG GCCGGTCGTC GCCTCCATCA AGGAGTTCTT CGGCACCAGC
CAGCTCTCCC AGTTCATGGA CCAGACGAAC CCGCTGGCCG GCCTGACCCA CAAGCGCCGC
CTGTCGGCGC TGGGCCCCGG CGGTCTGTCT CGGGAGCGGG CCGGCTTCGA GGTCCGCGAC
GTGCACCCCA GCCACTACGG CCGGATGTGC CCGATCGAGA CGCCGGAGGG CCCGAACATC
GGCCTGATCG GCTCGCTGTC GACGTTCGCG CGGGTCAACC CGTTCGGCTT CGTCGAGACC
CCGTACCGCA AGGTGGAGAA CGGCCGGGTC ACCGGCCAGA TCGACTACCT GACCGCGGAC
GAGGAGGACC GCCACGTCAA GGCGCAGGCG AACACCCCGC TGAACGCGGA CGGCACGTTC
GCCGAGGACC GCGTCCTGGT CCGCCGCAAG GGCGGTGAGG TCGAGTTCAT CCCGCCGGAC
GAGGTCGACT ACATGGACGT CTCGCCGCGG CAGATGGTGT CCGTCGCGAC GGCGATGATC
CCCTTCCTTG AGCACGACGA TGCCAACCGT GCCCTGATGG GCTCGAACAT GCAGCGCCAG
TCGGTTCCGC TGCTGCGTTC CGAGGCTCCG CTGGTCGGCA CCGGCATGGA GGCCCGCGCG
GCCAAGGACG CCGGCGACGT GGTCGTCTGC GCCCAGTCCG GCGTGGTCGA GGATCTGTCG
GCCGACTACA TCACCGTCAT GCACGACGAC GGGACGCGGC GCACCTACCG CCTGGCGAAG
TTCCGCCGGT CGAACCAGGG CACCTGCATC AACCAGAAGC CGATCGTCTT CGAGGGCGAC
CGCGTCGAGG CCGGTCAGGT CATCGCCGAC GGCCCGTGCA CCGACAACGG CGAGATGGCG
CTGGGCAAGA ACCTCCTGGT GGCCTTCATG CCGTGGGAGG GGCACAACTA CGAGGACGCG
ATCATCCTCT CCCAGCGCCT GGTGCAGGAC GACGTCCTCT CCTCGATCCA CATCGAGGAG
CACGAGGTCG ATGCCCGGGA CACCAAGCTG GGCCCGGAGG AGATCACCCG GGACATCCCG
AACGTCGCCG AGGAGGTTCT CGCCGACCTC GACGAGCGCG GCATCATCCG GATCGGTGCC
GAGGTGTCGC CCGGTGACGT GCTGGTCGGC AAGGTCACCC CGAAGGGCGA GACCGAGCTG
ACCCCGGAGG AGCGCCTGCT GCGCGCGATC TTCGGTGAGA AGGCCCGGGA GGTCCGCGAC
ACCTCGCTGA AGGTGCCGCA CGGCGAGTCC GGCAAGGTCA TCGGTGTCCG CGTGTTCTCC
CGCGAGGACG GCGACGAGCT GCCCCCGGGC GTGAACGAGC TGGTCCGCGT CTACGTGGCC
CAGAAGCGCA AGATCACTGA CGGTGACAAG CTCGCCGGCC GGCACGGCAA CAAGGGCGTC
ATCGCCAAGA TCCTGCCGGT GGAGGACATG CCCTTCCTCC CCGACGGCAC CGCCGTCGAC
GTCGTGCTGA ACCCGCACGG TGTGCCGCGT CGTATGAACA TCGGCCAGAT CCTGGAGACG
CACCTCGGCT GGGTCGCCAA GACCGGCTGG CAGGTCGACT CCGGCGCGGA GGGCTGGAAG
GAGCGCCTGC GCGGCATCGG CGCCGACGCG GCCCCGCCCG GGACCAACGT GGCGACGCCG
GTGTTCGACG GTGCCCGCGA GGAGGAGATC ACCGGCCTGC TCGACGCCAC CCTGCCCAAC
CGGGACGGTC AGCAGATGAT CGGTTCCTCC GGCAAGGCCG AGCTGTACGA CGGCCGCACC
GGAGAGCCGT ACCCGTACCC CGTGGCGGTT GGCTACATCT ACATCCTCAA GCTCCTGCAC
CTGGTCGACG ACAAGATCCA CGCCCGGTCG ACCGGCCCCT ACTCGATGAT CACGCAGCAG
CCGCTCGGTG GCAAGGCCCA GTTCGGTGGC CAGCGGTTCG GTGAGATGGA GGTGTGGGCG
CTCGAGGCGT ACGGCGCCGC CTACGCCCTG CAGGAACTCC TCACGATCAA GTCGGACGAC
GTTGTCGGCC GAGTGAAGGT CTATGAGGCG ATCGTCAAGG GCGAGAACAT CCCGGAACCC
GGTATCCCGG AGTCGTTCAA GGTCCTCATC AAGGAGATGC AGTCGCTGTG CCTCAACGTC
GAGGTGCTCT CCAGCGACGG TGTCTCGATC GAGATGCGGG ACACCGACGA GGACGTCTTC
CGCGCGGCGG AGGAGCTCGG AATCGACCTG TCACGGCGCG AGCCCAGCAG CGTCGAAGAG
GTCTGA
 
Protein sequence
MAASRSASRI SFAKIIEPLE VPDLLALQTQ SFDWLIGNDI WAERVQEAID ADRNDVPITS 
GLEEVFEEIS PIEDFSGSMS LSFRDHRFEP PKYSVEECKD KDMTYSAPLF VTAEFTNNTT
GEIKSQTVFM GDFPLMTPKG TFVINGTERV VVSQLVRSPG VYFEKAIDKA SDKDLFSCKV
IPSRGAWLEF EIDKRDTVGV RIDRKRRQSV TVLLKALGWD EARILERFGD FPSMRVTLEK
DHTSSQDDAL LDIYRKLRPG EPPTRESAQT LLENLFFNPK RYDLAKVGRY KVNKKLTLEV
AHDVGVLTED DVVRTIEYIV KLHAGQDPDN YEVDDIDHFG NRRLRTVGEL IQNQVRLGLA
RMERVVRERM TTQDVEAITP QTLINIRPVV ASIKEFFGTS QLSQFMDQTN PLAGLTHKRR
LSALGPGGLS RERAGFEVRD VHPSHYGRMC PIETPEGPNI GLIGSLSTFA RVNPFGFVET
PYRKVENGRV TGQIDYLTAD EEDRHVKAQA NTPLNADGTF AEDRVLVRRK GGEVEFIPPD
EVDYMDVSPR QMVSVATAMI PFLEHDDANR ALMGSNMQRQ SVPLLRSEAP LVGTGMEARA
AKDAGDVVVC AQSGVVEDLS ADYITVMHDD GTRRTYRLAK FRRSNQGTCI NQKPIVFEGD
RVEAGQVIAD GPCTDNGEMA LGKNLLVAFM PWEGHNYEDA IILSQRLVQD DVLSSIHIEE
HEVDARDTKL GPEEITRDIP NVAEEVLADL DERGIIRIGA EVSPGDVLVG KVTPKGETEL
TPEERLLRAI FGEKAREVRD TSLKVPHGES GKVIGVRVFS REDGDELPPG VNELVRVYVA
QKRKITDGDK LAGRHGNKGV IAKILPVEDM PFLPDGTAVD VVLNPHGVPR RMNIGQILET
HLGWVAKTGW QVDSGAEGWK ERLRGIGADA APPGTNVATP VFDGAREEEI TGLLDATLPN
RDGQQMIGSS GKAELYDGRT GEPYPYPVAV GYIYILKLLH LVDDKIHARS TGPYSMITQQ
PLGGKAQFGG QRFGEMEVWA LEAYGAAYAL QELLTIKSDD VVGRVKVYEA IVKGENIPEP
GIPESFKVLI KEMQSLCLNV EVLSSDGVSI EMRDTDEDVF RAAEELGIDL SRREPSSVEE
V