Gene TM1040_0234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0234 
SymbolrpoB 
ID4076267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp247712 
End bp251851 
Gene Length4140 bp 
Protein Length1379 aa 
Translation table11 
GC content59% 
IMG OID638005528 
ProductDNA-directed RNA polymerase subunit beta 
Protein accessionYP_612229 
Protein GI99080075 
COG category[K] Transcription 
COG ID[COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit 
TIGRFAM ID[TIGR02013] DNA-directed RNA polymerase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCAAA CGTTCCTTGG CCAGAAACGT CTTCGCAAAT ATTACGGCAA AATCCGCGAA 
GTGCTGGATA TGCCGAACCT CATTGAGGTT CAGAAATCTT CTTATGATCT TTTCCTGCGC
TCCGGTGATG CGCCTCAGCC GCTTGACGGC GAAGGCATCA AAGGTGTGTT CCAGTCGGTT
TTCCCGATCA AGGATTTCAA CGAAACCTCC GTTCTGGAGT TCGTGAACTA CGCGCTTGAG
CGTCCCAAGT ACGACGTTGA CGAATGCATG CAGCGTGACA TGACCTACAG CGCACCGCTG
AAGGTCACTC TGCGCCTGAT CGTGTTTGAT GTCGATGAGG ACACCGGCGC GAAATCCGTT
AAAGACATCA AGGAACAGGA TGTCTTTATG GGCGACATGC CCCTGATGAC CCCGAACGGG
ACGTTCATCG TGAACGGCAC CGAACGTGTG ATCGTGTCCC AGATGCACCG CTCACCGGGT
GTGTTCTTTG ACCACGATAA GGGCAAAACC CATTCCTCCG GTAAACTGCT GTTTGCCTGC
CGCATCATCC CGTATCGCGG CTCCTGGCTC GACTTCGAAT TCGACGCCAA AGACATCGTC
TTTGCTCGCA TCGACCGTCG CCGCAAACTG CCGGTGACCA CGCTGCTTTA TGCGCTGGGT
CTCGACCAGG AAGGCATCAT GGATGCCTAC TACAACACCG TGAACTTCAA ACTTGAGAAG
TCGCGCGGCT GGGTCACGCC CTTCTTCCCC GAGCGTGTGC GTGGCACCCG CCCGACCTAT
GACCTCGTGG ATGCCGCCAC TGGCGAAATC ATCTGCGAAG CAGGCAAGAA AGTCACCCCG
CGCGCGGTCA AGAAGATGAT CGACGAAGGC AACATTACCG AGCTGCTCGT TCCGTTCGAG
CATATTGTCG GTAAATATGT CGCCAAGGAT ATCATCAACG AAGAAAACGG CGCGATCTAT
GTCGAAGCGG GCGACGAGCT GACGCTTGAG TATGACAAAG ACGGCGACAT CATTGGCGGG
TCCGTCAAAG AGCTGCTCGA TGCCGGTATC ACCGACATCC CGGTTCTCGA CATCGACAAC
GTCAATGTCG GCCCCTACAT GCGCAACACC ATGGCGCAGG ACAAGAACAT GTCCCGCGAA
ACCGCGCTCA TGGACATCTA CCGCGTGATG CGTCCGGGCG AGCCGCCCAC CGTCGAAGCG
GCCTCCGCGC TGTTTGACAC CCTGTTTTTT GATTCCGAGC GCTACGACCT GTCGGCCGTT
GGTCGCGTGA AGATGAACAT GCGTCTGGCT CTGGATGCCG AAGACACTCA GCGGACCCTG
CGCAAAGAAG ACATCGTCGC CTGTATCAAG GCGCTGGTGG AGCTGCGTGA CGGCAAGGGC
GACGTGGACG ACATCGACCA CCTCGGCAAC CGCCGCGTGC GTTCCGTGGG CGAGCTCATG
GAAAACCAGT ACCGCGTTGG TCTCCTGCGC ATGGAGCGCG CGATCAAAGA GCGTATGTCC
TCTGTCGAGA TCGACACCGT GATGCCGCAG GACCTGATCA ACGCCAAGCC CGCTGCTGCG
GCTGTGCGTG AATTCTTTGG CTCCTCGCAG CTGTCGCAGT TCATGGACCA AACGAACCCG
CTGTCGGAAG TCACCCACAA ACGCCGTCTC TCGGCGCTTG GCCCCGGTGG TCTGACCCGC
GAGCGTGCCG GCTTTGAGGT GCGCGACGTT CACCCGACCC ACTATGGCCG GATGTGTCCG
ATTGAAACGC CGGAAGGTCC GAACATTGGT CTGATCAACT CGCTGGCGAC CTTTGCCCGC
GTGAACAAGT ATGGCTTCAT CGAGACCCCC TATCGCGTGG TGAAGGACGG ACAAGTGACC
GATGAAGTTC ACTACATGTC CGCCACCGAG GAAATGCGTC ACACCGTGGC GCAGGCGAAC
GCCAACCTCG ATGAAGACAA CCGCTTTGTG AATGACCTTG TGTCCACCCG TCAGTCGGGC
GACTACACGC TGGCTCCGAA TGAAAGCGTC GACCTGATCG ACGTGTCGCC AAAACAGTTG
GTATCGGTTG CGGCCTCGCT CATTCCGTTC CTTGAGAACG ACGACGCGAA CCGCGCCCTG
ATGGGTTCGA ACATGCAACG TCAGGCGGTT CCGCTGCTGC GCGCAGAAGC GCCGCTGGTC
GGTACCGGCA TCGAGGAAAT CGTGGCGCGC GACTCTGGCG CGGCGATCAT GGCGAAGCGC
GCAGGCGTGA TCGACCAGAT CGACGCTCAG CGTATCGTGA TCCGTGCAAC CTCCGATCTG
GAGCTGGGCG ACGCAGGCGT GGACATCTAC CGCATGCGCA AGTTCCAGCG CTCGAACCAG
AACACCTGCA TCAACCAGCG CCCGCTGGTG AAAGTGGGTC AGACGGTCGA GAAGGGCGAA
GTGATCGCAG ATGGTCCCTC CACCGATATG GGTGAACTGG CGCTCGGTAA AAACGTGGTT
GTCGCGTTTA TGCCGTGGAA CGGTTACAAC TATGAGGACT CCATCCTGAT CTCCGAGCGT
ATCGCGCGTG ACGACGTCTT CACCTCGATC CACATCGAGG AATTCGAAGT CGCCGCTCGT
GACACCAAGC TTGGGCCGGA AGAGATCACC CGCGATATTC CGAACGTTGG TGAAGAAGCG
CTGCGCAACC TCGACGAGGC AGGCATCGTC TACATCGGTG CCGATGTGGA GCCGGGCGAT
ATCCTCGTGG GTAAGATCAC ACCGAAGGGC GAAAGCCCGA TGACGCCGGA AGAAAAGCTT
CTGCGCGCCA TCTTTGGTGA GAAAGCCTCC GACGTGCGTG ACACCTCGCT GCGTGTGAAG
CCGGGCGACT ACGGTACTGT TGTTGAGGTT CGCGTCTTCA ACCGTCACGG CGTCGAAAAA
GACGAACGTG CCCTGCAGAT CGAGCGCGAA GAAGTCGAAC GTCTGGCCCG TGACCGGGAC
GACGAGCTCG GCATTCTGGA CCGCAACATC TACGCCCGTC TGCGTGACCT TCTGCTCGGC
AAAACTGCCG TCAAAGGCCC CAAAGGCGTG CGCGGCAACA CCGTCATCGA CGAGGATCTG
CTGGACAACC AGCTAACCCG TGGTCAGTGG TGGATGCTTG CTCTGGAAGA AGAGCAGGAC
GCTCAGATCC TTGAGGCTCT GAACGAGCAG TACGAAGCAC AAAAGCGTGC TCTGGACGCC
CGTTTCGAGG ACAAGGTCGA GAAGGTCCGC CGCGGCGACG ATCTGCCTCC GGGTGTGATG
AAGATGGTCA AAGTCTTCAT CGCCGTGAAG CGTAAGCTGC AGCCGGGCGA CAAGATGGCC
GGTCGTCACG GGAACAAAGG TGTTATCTCT CGCGTGGTGC CGATGGAGGA CATGCCGTTC
CTCGCCGATG GTACCCCGGT GGACTTCTGC CTTAACCCGC TCGGCGTGCC GTCGCGTATG
AACGTTGGTC AGATCCTTGA AACCCACATG GGTTGGGCCG CACGCGGCCT GGGTCTGAAA
ATCGACGACG CACTTCAGGA CTATCGCCGC ACTGGCGATC TGACCCCTGT GCGTGATGCG
ATGCGCGAGG CCTATGGTGA GGACGTTTAT GAAGAGGGGA TCTCTTCCAT GGACGAAACC
CAACTCATCG AGGCTGCTGG TAACGTGACC CGTGGTGTGC CGATCGCAAC GCCGGTCTTT
GACGGTGCGA AAGAAGATGA CGTCAACGAC GCGCTGGTGC GCGCGGGCTT TGACCAGTCC
GGTCAGTCGA TCCTGTTCGA TGGCCGCACC GGTGAGCAGT TCGCACGTCC AGTGACCGTG
GGCATCAAGT ATCTCTTGAA GCTGCACCAC CTCGTGGACG ACAAGATCCA CGCCCGTTCC
ACTGGTCCGT ACTCGCTGGT TACCCAGCAG CCGCTCGGTG GTAAGGCACA GTTCGGTGGT
CAGCGCTTTG GTGAGATGGA GGTCTGGGCT CTGGAAGCTT ACGGCGCCGC CTACACCCTG
CAAGAGATGC TCACCGTGAA ATCGGATGAC GTCGCTGGCC GGACCAAGGT CTATGAGTCG
ATCGTCAAGG GCGAGGACAA CTTTGAGGCG GGCGTACCGG AATCGTTCAA CGTTCTGGTC
AAAGAAGTCC GCGGCCTCGG CCTGAACATG GAACTCCTGG ATGCGGAGGT TGAGGAGTGA
 
Protein sequence
MAQTFLGQKR LRKYYGKIRE VLDMPNLIEV QKSSYDLFLR SGDAPQPLDG EGIKGVFQSV 
FPIKDFNETS VLEFVNYALE RPKYDVDECM QRDMTYSAPL KVTLRLIVFD VDEDTGAKSV
KDIKEQDVFM GDMPLMTPNG TFIVNGTERV IVSQMHRSPG VFFDHDKGKT HSSGKLLFAC
RIIPYRGSWL DFEFDAKDIV FARIDRRRKL PVTTLLYALG LDQEGIMDAY YNTVNFKLEK
SRGWVTPFFP ERVRGTRPTY DLVDAATGEI ICEAGKKVTP RAVKKMIDEG NITELLVPFE
HIVGKYVAKD IINEENGAIY VEAGDELTLE YDKDGDIIGG SVKELLDAGI TDIPVLDIDN
VNVGPYMRNT MAQDKNMSRE TALMDIYRVM RPGEPPTVEA ASALFDTLFF DSERYDLSAV
GRVKMNMRLA LDAEDTQRTL RKEDIVACIK ALVELRDGKG DVDDIDHLGN RRVRSVGELM
ENQYRVGLLR MERAIKERMS SVEIDTVMPQ DLINAKPAAA AVREFFGSSQ LSQFMDQTNP
LSEVTHKRRL SALGPGGLTR ERAGFEVRDV HPTHYGRMCP IETPEGPNIG LINSLATFAR
VNKYGFIETP YRVVKDGQVT DEVHYMSATE EMRHTVAQAN ANLDEDNRFV NDLVSTRQSG
DYTLAPNESV DLIDVSPKQL VSVAASLIPF LENDDANRAL MGSNMQRQAV PLLRAEAPLV
GTGIEEIVAR DSGAAIMAKR AGVIDQIDAQ RIVIRATSDL ELGDAGVDIY RMRKFQRSNQ
NTCINQRPLV KVGQTVEKGE VIADGPSTDM GELALGKNVV VAFMPWNGYN YEDSILISER
IARDDVFTSI HIEEFEVAAR DTKLGPEEIT RDIPNVGEEA LRNLDEAGIV YIGADVEPGD
ILVGKITPKG ESPMTPEEKL LRAIFGEKAS DVRDTSLRVK PGDYGTVVEV RVFNRHGVEK
DERALQIERE EVERLARDRD DELGILDRNI YARLRDLLLG KTAVKGPKGV RGNTVIDEDL
LDNQLTRGQW WMLALEEEQD AQILEALNEQ YEAQKRALDA RFEDKVEKVR RGDDLPPGVM
KMVKVFIAVK RKLQPGDKMA GRHGNKGVIS RVVPMEDMPF LADGTPVDFC LNPLGVPSRM
NVGQILETHM GWAARGLGLK IDDALQDYRR TGDLTPVRDA MREAYGEDVY EEGISSMDET
QLIEAAGNVT RGVPIATPVF DGAKEDDVND ALVRAGFDQS GQSILFDGRT GEQFARPVTV
GIKYLLKLHH LVDDKIHARS TGPYSLVTQQ PLGGKAQFGG QRFGEMEVWA LEAYGAAYTL
QEMLTVKSDD VAGRTKVYES IVKGEDNFEA GVPESFNVLV KEVRGLGLNM ELLDAEVEE