Gene Synpcc7942_1522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1522 
SymbolrpoB 
ID3774946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp1573729 
End bp1577031 
Gene Length3303 bp 
Protein Length1100 aa 
Translation table11 
GC content57% 
IMG OID637799955 
ProductDNA-directed RNA polymerase subunit beta 
Protein accessionYP_400539 
Protein GI81300331 
COG category[K] Transcription 
COG ID[COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit 
TIGRFAM ID[TIGR02013] DNA-directed RNA polymerase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.18343 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAGC AAACGCAACT CGCTCCCGCT GCCTTCCATT TACCTGATCT CGTTGCCATT 
CAACGGAACA GCTTCCGTTG GTTCCTAGAA GAAGGGCTCA TTGAGGAGCT GGAAAGCTTT
TCACCAATCA CCGACTATAC CGGTAAGCTT GAGCTTCACT TCCTCGGTAA GCAGTATAAG
CTCAAGCGCC CGAAGTACGA CGTTGATGAA GCAAAACGGC GTGACGGCAC CTACTCGGTT
CAAATGTATG TGCCGACCCG TTTGATCAAC AAAGAAACGG GCGAAATCAA GGAGCAGGAA
GTCTTTATTG GCGATCTGCC CCTGATGACT GATCGCGGCA CCTTCATCAT CAACGGTGCC
GAACGGGTGA TCGTCAACCA GATCGTCCGT AGCCCCGGCG TCTACTACAA ATCAGAGCGC
GACAAGAACG GTCGCCTCAC CCACAATGCC AGCCTGATCC CCAACCGTGG CGCTTGGCTG
AAATTTGAAA CCGATAAAAA CGGTTTAGTT TGGGTGCGCA TCGACAAGAC GCGTAAGTTG
TCGGCGCAGG TACTGCTGAA AGCCTTGGGC CTGAGCGACA ACGAAATTTA CGACAAGCTC
CGTCACCCTG AGTATTACCA GAAGACCATC GATAAAGAAG GTCAGTTCAG CGAAGACGAA
GCGCTGATGG AGCTCTACCG CAAGCTCCGT CCGGGCGAAC CGCCCACGGT CTCTGGCGGT
CAGCAATTGC TGGAATCGCG GTTCTTCGAT CCCAAACGCT ACGACCTAGG TCGGGTGGGC
CGCTACAAGC TCAATAAGAA GCTGGGTCTC AACGTCGCTG ATACGGTGCG GACGCTGACA
TCCGAAGATA TTTTGGCGGC GATCGACTAC CTGATTAACC TCGAGCTTGA CTTGGGTGGC
TGTGAAGTCG ATGACATCGA CCACCTCGGC AACCGTCGGG TGCGATCGGT GGGCGAGCTG
CTGCAAAACC AAGTGCGGGT CGGCCTCAAC CGCCTAGAGC GGATCATTCG GGAACGGATG
ACGGTGTCGG ATTCCGACAG TCTTTCGCCG GCTTCCTTGG TCAACCCCAA ACCGCTGGTG
GCTGCGATCA AAGAATTCTT TGGTTCCTCG CAACTCTCGC AGTTCATGGA CCAAACCAAC
CCCTTGGCGG AGTTGACCCA TAAACGACGT CTGAGTGCCC TCGGTCCCGG TGGTTTGACA
CGGGAGCGGG CTGGCTTTGC GGTGCGGGAC ATTCACCCCA GCCACTACGG CCGGATTTGC
CCGATTGAAA CGCCGGAAGG CCCGAACGCG GGTCTGATTG GTTCCTTAGC GACCCACGCC
CGCGTCAACG ACTACGGCTT TATTGAAACG CCGTTCTGGC GCGTCGAAGA AGGACGGGTT
CGCAAGGACT TGGCGCCGGT CTACATGACT GCTGACCAGG AAGATGACCT GCGGGTGGCT
CCGGGAGACG TGGCTACGGA TGACGCGGGC TACATCCTGG GAACCACAAT TCCGGTACGT
TATCGCCAGG ACTTCACCAC CACGACGCCG GAGCGGGTGG ACTACGTTGC GCTCTCGCCG
GTGCAGATTA TCTCGGTGGC AACGTCGTTG ATTCCTTTCT TGGAACACGA TGACGCCAAC
CGTGCCCTGA TGGGCTCGAA CATGCAACGG CAGGCGGTGC CGCTGTTGCG GCCAGAGCGG
CCTTTGGTCG GGACGGGTCT GGAGCCCCAA GCGGCCCGTG ACTCAGGGAT GGTGATCACC
AGCCCGGTCG ATGGCACGAT CTCCTACGTC GATGCCACCC ACATTGAGGT GACGGCTGAC
ACAGGTGAGA AGTATGGCTA CGCCCTACAG AAGTACCAGC GCTCCAACCA AGATACTTGT
CTGAACCAAC GGCCGATCGT GTTTGAAGGC GATCGCGTTC AACGGGGTCA GGTGATTGCT
GACGGTTCTG CCACCGAAAA AGGTGAGCTG GCCCTGGGGC AAAACATCCT CGTCGCCTAC
ATGCCTTGGG AAGGCTACAA CTACGAGGAC GCGATTCTGA TCAGCGAGCG GCTGGTCTAT
GACGACGTCT ATACCTCGAT CCACATTGAA AAATTCGAGA TTGAGGCGCG TCAGACCAAG
TTAGGCCCTG AGGAGATTAC CCGCGAAATT CCCAACGTTG GCGAAGATGC TCTGCGCCAA
CTCGACGAAA ACGGGATTAT CCGCGTCGGC GCATGGGTGG AGTCCGGTGA CATCTTGGTT
GGTAAGGTGA CGCCCAAAGG CGAATCGGAT CAGCCGCCAG AAGAAAAACT GCTGCGGGCA
ATCTTCGGTG AGAAAGCGCG GGACGTCCGC GATAACTCCC TGCGGGTGCC GAATGGTGAG
AAAGGCCGCG TTGTTGATGT GCGTCTCTTC ACCCGTGAGC AGGGTGATGA GTTGCCGCCG
GGCGCCAACA TGGTTGTCCG GGTTTACGTG GCTCAGAAAC GCAAGATCCA AGTCGGCGAC
AAGATGGCGG GTCGCCACGG CAACAAGGGG ATCATTTCGC GGATTCTGCC CTGCGAAGAC
ATGCCTTACC TGCCGGATGG CACGCCGCTA GACATCGTGC TCAATCCCTT GGGTGTACCC
TCGCGGATGA ACGTCGGTCA GGTGTTTGAG TGCATGCTGG GCTGGGCGGG TCAGCTGCTA
GATGCCCGCT TCAAGGTCAC GCCCTTTGAC GAGATGTACG GGGCGGAAGC CTCTCGCTTG
ACTGTCAACG CCAAGCTGTC TGAAGCGCGG GAGCAAACAG GTCAGCCCTG GGTCTTTAGC
GATGATGAAC CCGGCAAGAT CCAGGTCTAC GACGGTCGGA CAGGGGAACC CTTCGATCGC
CCAGTGACGG TGGGTCGTGC CTACATGCTT AAGCTGGTTC ACTTGGTCGA CGACAAGATC
CACGCTCGCT CGACAGGTCC CTACTCCTTG GTGACCCAAC AACCCTTGGG TGGTAAAGCC
CAACAAGGGG GTCAGCGCTT CGGGGAGATG GAAGTTTGGG CACTGGAAGC CTACGGTGCT
GCCTACATCC TGCAAGAGTT GCTGACGGTC AAGTCGGACG ACATGCAAGG GCGGAATGAA
GCTCTCAATG CGATCGTTAA GGGCAAAGCG ATTCCTCGCC CCGGCACACC AGAGTCTTTC
AAGGTGTTGA TGCGAGAGCT GCAGTCGCTC TGTCTCGACA TTGCGGTCTA CAAGGCCTCA
ACCGAGGACT ATGAAGAAGA CAAAGAAGTG GATCTGATGG CGGATGTCAA CCAGCGTCGG
ACTCCCTCGC GCCCGACCTA CGAGTCGATG TCTGTGGGCG ACATTGATGA CGATGACGAC
TAA
 
Protein sequence
MAEQTQLAPA AFHLPDLVAI QRNSFRWFLE EGLIEELESF SPITDYTGKL ELHFLGKQYK 
LKRPKYDVDE AKRRDGTYSV QMYVPTRLIN KETGEIKEQE VFIGDLPLMT DRGTFIINGA
ERVIVNQIVR SPGVYYKSER DKNGRLTHNA SLIPNRGAWL KFETDKNGLV WVRIDKTRKL
SAQVLLKALG LSDNEIYDKL RHPEYYQKTI DKEGQFSEDE ALMELYRKLR PGEPPTVSGG
QQLLESRFFD PKRYDLGRVG RYKLNKKLGL NVADTVRTLT SEDILAAIDY LINLELDLGG
CEVDDIDHLG NRRVRSVGEL LQNQVRVGLN RLERIIRERM TVSDSDSLSP ASLVNPKPLV
AAIKEFFGSS QLSQFMDQTN PLAELTHKRR LSALGPGGLT RERAGFAVRD IHPSHYGRIC
PIETPEGPNA GLIGSLATHA RVNDYGFIET PFWRVEEGRV RKDLAPVYMT ADQEDDLRVA
PGDVATDDAG YILGTTIPVR YRQDFTTTTP ERVDYVALSP VQIISVATSL IPFLEHDDAN
RALMGSNMQR QAVPLLRPER PLVGTGLEPQ AARDSGMVIT SPVDGTISYV DATHIEVTAD
TGEKYGYALQ KYQRSNQDTC LNQRPIVFEG DRVQRGQVIA DGSATEKGEL ALGQNILVAY
MPWEGYNYED AILISERLVY DDVYTSIHIE KFEIEARQTK LGPEEITREI PNVGEDALRQ
LDENGIIRVG AWVESGDILV GKVTPKGESD QPPEEKLLRA IFGEKARDVR DNSLRVPNGE
KGRVVDVRLF TREQGDELPP GANMVVRVYV AQKRKIQVGD KMAGRHGNKG IISRILPCED
MPYLPDGTPL DIVLNPLGVP SRMNVGQVFE CMLGWAGQLL DARFKVTPFD EMYGAEASRL
TVNAKLSEAR EQTGQPWVFS DDEPGKIQVY DGRTGEPFDR PVTVGRAYML KLVHLVDDKI
HARSTGPYSL VTQQPLGGKA QQGGQRFGEM EVWALEAYGA AYILQELLTV KSDDMQGRNE
ALNAIVKGKA IPRPGTPESF KVLMRELQSL CLDIAVYKAS TEDYEEDKEV DLMADVNQRR
TPSRPTYESM SVGDIDDDDD