Gene Caul_0794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0794 
SymbolrpoB 
ID5898249 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp838718 
End bp842800 
Gene Length4083 bp 
Protein Length1360 aa 
Translation table11 
GC content65% 
IMG OID641561275 
ProductDNA-directed RNA polymerase subunit beta 
Protein accessionYP_001682423 
Protein GI167644760 
COG category[K] Transcription 
COG ID[COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit 
TIGRFAM ID[TIGR02013] DNA-directed RNA polymerase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.235631 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCAAT CCTTCACCGG CAAGAAGCGG ATCCGGAAGT CTTTCGGCCG CATCCCCGAA 
GCTGTGCAGA TGCCGAACCT GATCGAGGTT CAGCGCTCCT CCTACGAACA GTTCCTGCAG
CGCGAGGTCC GTCCGGGCGT GCGCAAGGAC GAGGGGATCG AAGCGGTCTT CAAGTCCGTC
TTCCCGATCA AGGATTTCAA CGAGCGCGCG GTGCTCGAAT ACGTCTCGTA CGAATTCGAA
GAGCCCAAGT ACGACGTTGA GGAATGCATC CAGCGCGACA TGACCTTCGC GGCGCCGCTG
AAGGTCAAGC TGCGGCTGAT CGTGTTCGAA ACCGAAGAAG AAACCGGCGC CCGGTCCGTC
AAGGACATCA AGGAGCAGGA CGTCTACATG GGCGATATCC CGCTCATGAC CGACAAGGGC
ACGTTCATCG TCAACGGCAC CGAGCGGGTC ATCGTCTCGC AGATGCACCG CTCGCCCGGC
GTGTTCTTCG ACCACGACAA GGGCAAGACC CACGCCTCGG GCAAGCTGCT GTTCGCCGCC
CGCGTGATCC CGTATCGCGG TTCGTGGCTC GACTTCGAGT TCGACGCCAA GGACGTGGTC
TATGTCCGCA TCGACCGTCG CCGCAAGCTG CCGGCCACCA CCTTCCTCTA TGCCCTGGGC
ATGGACGGCG AAGAGATCCT GACCACCTTC TACGACGTCG TGCCCTTCGA GAAGCGCACC
GCGGCCGGCG CCGAAGGCTG GGCCACCCCG TACAAGCCCG AGCGCTGGCG CGGCGTGAAG
CCGGAATTCG CCCTGATCGA CGCCGACAGC GGCGAAGAAG TGGCCGCGGC CGGCACCAAG
ATCACCGCGC GTCAGGCCAA GAAGCTGGCT GACAATGGTC TCAAGACCCT GCTGCTGGCC
CCCGAGGCCC TGACCGGCCG CTACCTGGCC CGAGACGCCG TCAACTTCGA GACCGGCGAG
ATCTACGCCG AGGCCGGCGA CGAGCTGGAC GTCACCTCGA TCGAGGCTCT GGCGGCCCAA
GGCTTCAGCA CCATCGACGT GCTGGACATC GACCACGTCA CGGTCGGCGC CTACATGCGC
AACACCCTGC GCGTGGACAA GAACGCCGTC CGCGAGGACG CGCTATTCGA CATCTATCGC
GTCATGCGTC CCGGCGAGCC GCCGACCGTC GAAGCCGCCG AGGCGATGTT CAAGTCGCTG
TTCTTCGACG CCGAGCGCTA CGACCTGTCG GCCGTGGGCC GCGTGAAGAT GAACATGCGC
CTGGAGCTGG ACGCTTCGGA CGAGATGCGC GTGCTCCGCA AGGAAGACGT GCTGGCCGTC
CTGAAGCTGC TAGTCGGCCT GCGCGATGGC CGCGGCGAAA TCGACGACAT CGACAACCTG
GGCAACCGCC GGGTGCGTTC GGTCGGCGAA CTGCTGGAAA ACCAGTACCG CGTCGGCCTG
CTGCGCATGG AACGCGCCAT CAAGGAACGC ATGTCGTCCG TCGACATCGA CACCGTGATG
CCGCACGACC TGATCAACGC CAAGCCGGCC GCGGCCTCGG TGCGTGAATT CTTCGGCTCC
TCGCAGCTGT CGCAGTTCAT GGACCAGACC AACCCGCTGT CGGAGATCAC CCACAAGCGT
CGTCTCTCGG CGCTTGGCCC GGGCGGTCTG ACCCGCGAGC GCGCCGGCTT CGAAGTCCGC
GACGTTCACC CGACCCACTA CGGCCGCATC TGCCCGATTG AAACGCCGGA AGGCCCGAAC
ATCGGCCTGA TCAACTCGCT GGCCACCCAC GCCCGCGTCA ACAAGTACGG CTTCATCGAG
AGCCCCTACC GTCGCGTTGC CGACGGCAAG CCGCTCGAAG AGGTCGTCTA CATGTCGGCC
ATGGAAGAGT CGAAGCACGT CATCGCCCAG TCCAACATCA AGGTGTTGAA CGGCGAGATC
GTCGACGACC TGGTCCCCGG CCGGATCAAC GGCGAACCGA CCCTGCTCCA GAAGGAGCAA
GTCGACCTGA TGGACGTGTC GCCGCGTCAG GTGGTTTCGG TGGCCGCGGC CCTGATCCCC
TTCCTGGAAA ACGATGACGC CAACCGCGCC CTCATGGGCT CGAACATGCA ACGTCAGGCC
GTGCCGCTGG TGCAGTCCGA CGCCCCGCTG GTCGGCACGG GCATGGAAGC CGTCGTCGCT
CGCGACTCCG GCGCCGTGGT CATCGCCAAG CGCACCGGCG TCGTGGAGCA GATCGACGGC
ACCCGTATCG TCGTCCGCGC CACGGAAGAA ACCGACGCCG CCCGCTCGGG CGTCGACATC
TATCGCCTGA GCAAGTTCCA GCGTTCGAAC CAGTCGACCT GCATCAACCA GCGTCCGCTG
GTGAAGGTGG GCGACAAGAT CAGCGCCGGC GACATCATCG CCGACGGTCC CTCGACCGAG
CTGGGCGAAC TGGCCCTGGG CCGCAACGCG CTCGTCGCGT TCATGCCCTG GAACGGCTAC
AACTTCGAAG ACTCGATCCT GATCTCCGAG CGCATCGTCC GCGACGACGT CTTCACCTCG
ATCCACATCG AGGAATTCGA AGTCATGGCC CGCGATACGA AGCTCGGTCC GGAAGAAATC
ACCCGCGACA TCCCAAACGT CGGCGAGGAA GCCCTGCGCA ACCTCGACGA AGCGGGCATC
GTGGCGATCG GCGCCGAAGT CCAGCCGGGC GACATCCTGG TCGGCAAGGT CACCCCGAAG
GGCGAAAGCC CGATGACGCC GGAAGAAAAG CTCCTGCGCG CCATCTTCGG CGAAAAGGCC
TCGGACGTCC GCGACACCAG CCTGCGCCTG CCCCCGGGCG TCGCCGGCAC GATCGTCGAC
GTGCGGGTGT TCAACCGCCA CGGCGTCGAC AAGGACGAAC GCGCCCTGGC CATCGAACGC
GCCGAGATCG ACCGTCTGGG CAAGGACCGC GACGACGAAT TCGCGATCCT GAACCGCAAC
ATCTCGGGCC GTCTGCGCGA ACTGCTGATC GGCAATGTCG CTCTGTCGGG TCCCAAGGGC
CTGTCGCGCG GCCAGATCAC CGCCGAGGGC CTGGCTGAAG TCCAGCCGGG TCTGTGGTGG
CAGATCGCCC TCGAGGACGA GAAGGCGATG GGCGAACTGG AAAGCCTGCG CCGTCTGTTT
GACGAGAACC GCAAGCGCCT GGACCGGCGC TTCGAGGACA AGGTCGACAA GCTGCAGCGC
GGCGACGAAC TGCCCCCGGG CGTGATGAAG ATGGTAAAGG TCTTCGTGGC CGTGAAGCGC
AAGCTTCAGC CGGGCGACAA GATGGCCGGC CGTCACGGCA ACAAGGGCGT CATCTCGCGC
ATCCTGCCGA TCGAGGACAT GCCGTTCCTC GCCGACGGGA CGCACGTGGA CGTCGTTCTG
AACCCGCTGG GCGTGCCTTC GCGCATGAAC GTCGGTCAGA TCTTCGAAAC CCACCTGGGT
TGGGCCTGCG CCGGTCTCGG CAAGCAGATC ACGACCCTGC TCGAGGACTG GCAGGCCGGC
GGCCAGAAGC AGGCCCTGAT CGATCGTCTG ACCGAAATCT ACGGCCCGGA CGAGGAACTG
CCGGAAACCG AGGAAGAGCT GGTCGAACTG GCTCGCAACC TCGGCAAGGG TGTTCCGATC
GCCACCCCGG TGTTCGACGG TGCGCGTATC GGCGACATCG AGGACCACCT GCGCATGGCG
GGTCTCGATC CGTCGGGCCA GTCGATCCTG TACGACGGCC AGACCGGCGA GCAGTTCAAG
CGTCCGGTCA CGGTCGGCTA CATCTACATG CTGAAGCTGC ACCACCTGGT CGACGACAAG
ATCCACGCCC GTTCGATCGG TCCGTACTCG CTCGTCACGC AACAGCCGCT GGGTGGTAAG
GCCCAGTTCG GCGGTCAGCG CTTCGGGGAA ATGGAAGTGT GGGCTCTGGA AGCCTACGGC
GCGGCCTACA CCCTGCAGGA AATGCTGACG GTGAAGTCCG ACGACGTGGC CGGCCGGACC
AAGGTCTACG AGTCGATCGT CCGCGGCGAC GACACGTTCG AAGCCGGTAT CCCGGAAAGC
TTCAACGTGC TGGTCAAGGA AATGCGCTCG CTCGGCCTGA ACGTCGAGCT GGAGAACAGC
TGA
 
Protein sequence
MAQSFTGKKR IRKSFGRIPE AVQMPNLIEV QRSSYEQFLQ REVRPGVRKD EGIEAVFKSV 
FPIKDFNERA VLEYVSYEFE EPKYDVEECI QRDMTFAAPL KVKLRLIVFE TEEETGARSV
KDIKEQDVYM GDIPLMTDKG TFIVNGTERV IVSQMHRSPG VFFDHDKGKT HASGKLLFAA
RVIPYRGSWL DFEFDAKDVV YVRIDRRRKL PATTFLYALG MDGEEILTTF YDVVPFEKRT
AAGAEGWATP YKPERWRGVK PEFALIDADS GEEVAAAGTK ITARQAKKLA DNGLKTLLLA
PEALTGRYLA RDAVNFETGE IYAEAGDELD VTSIEALAAQ GFSTIDVLDI DHVTVGAYMR
NTLRVDKNAV REDALFDIYR VMRPGEPPTV EAAEAMFKSL FFDAERYDLS AVGRVKMNMR
LELDASDEMR VLRKEDVLAV LKLLVGLRDG RGEIDDIDNL GNRRVRSVGE LLENQYRVGL
LRMERAIKER MSSVDIDTVM PHDLINAKPA AASVREFFGS SQLSQFMDQT NPLSEITHKR
RLSALGPGGL TRERAGFEVR DVHPTHYGRI CPIETPEGPN IGLINSLATH ARVNKYGFIE
SPYRRVADGK PLEEVVYMSA MEESKHVIAQ SNIKVLNGEI VDDLVPGRIN GEPTLLQKEQ
VDLMDVSPRQ VVSVAAALIP FLENDDANRA LMGSNMQRQA VPLVQSDAPL VGTGMEAVVA
RDSGAVVIAK RTGVVEQIDG TRIVVRATEE TDAARSGVDI YRLSKFQRSN QSTCINQRPL
VKVGDKISAG DIIADGPSTE LGELALGRNA LVAFMPWNGY NFEDSILISE RIVRDDVFTS
IHIEEFEVMA RDTKLGPEEI TRDIPNVGEE ALRNLDEAGI VAIGAEVQPG DILVGKVTPK
GESPMTPEEK LLRAIFGEKA SDVRDTSLRL PPGVAGTIVD VRVFNRHGVD KDERALAIER
AEIDRLGKDR DDEFAILNRN ISGRLRELLI GNVALSGPKG LSRGQITAEG LAEVQPGLWW
QIALEDEKAM GELESLRRLF DENRKRLDRR FEDKVDKLQR GDELPPGVMK MVKVFVAVKR
KLQPGDKMAG RHGNKGVISR ILPIEDMPFL ADGTHVDVVL NPLGVPSRMN VGQIFETHLG
WACAGLGKQI TTLLEDWQAG GQKQALIDRL TEIYGPDEEL PETEEELVEL ARNLGKGVPI
ATPVFDGARI GDIEDHLRMA GLDPSGQSIL YDGQTGEQFK RPVTVGYIYM LKLHHLVDDK
IHARSIGPYS LVTQQPLGGK AQFGGQRFGE MEVWALEAYG AAYTLQEMLT VKSDDVAGRT
KVYESIVRGD DTFEAGIPES FNVLVKEMRS LGLNVELENS