Gene PCC8801_1751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1751 
SymbolrpoB 
ID7101821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1836973 
End bp1840284 
Gene Length3312 bp 
Protein Length1103 aa 
Translation table11 
GC content49% 
IMG OID643474819 
ProductDNA-directed RNA polymerase subunit beta 
Protein accessionYP_002371954 
Protein GI218246583 
COG category[K] Transcription 
COG ID[COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit 
TIGRFAM ID[TIGR02013] DNA-directed RNA polymerase, beta subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTAATC TAACCTATAA CTACAATCTG CTTCCCGACT TAATCGAAAT TCAACGGTCG 
AGTTTTCGCT GGTTCCTCGA AGAAGGACTC ATCGAAGAAC TCGATAGCTT CTCCCCCATT
AGCGACTATA CGGGCAAATT AGAACTGCAT TTTTTAGGAG AAAACTATAA ATTAAAACAG
CCAAAATATG ATGTAGACGA AGCAAAACGG CGTGATAGTA CCTATTCCGT GCAAATGTAT
GTGCCGACTC GTTTAATTAA CAAAGAAACC GGAGAAATTA AAGAACAAGA AGTCTTTATT
GGGGATCTAC CCCTGATGAC TGAACGGGGA ACCTTTATTA TTAATGGTGC CGAACGGGTC
ATCGTCAATC AAATTGTCCG TTCTCCAGGG GTTTACTACA AAGCCGAAAT CGACAAAAAC
GGACGGAGAA CCTATTCAGC CTCGTTAATT CCCAACCGAG GAGCCTGGTT AAAGTTTGAA
ACCGACAAAA ATGGCTTAGT TTGGGTCAGA ATCGACAAAA CCCGTAAACT ATCAGCCCAA
GTTTTACTCA AAGCCATTGG CTTAAGTGAT GCCGAAATTC TCGACGGGTT ACGTCATCCC
GACTTCTATC AGCGTACCCT CGACAAAGAA GGAAATCCCA GCGAAGAAGA GGCATTATTA
GAATTATATC GCAAATTACG CCCCGGAGAA CCCCCCACGG TAACAGGAGG ACAACAACTC
CTCGATTCGC GCTTTTTTGA CAGCAAACGC TACGATCTAG GCCGCGTTGG TCGGTATAAA
CTCAACAAAA AACTGCGCCT GAATGCCCCC GATACCATTC GGGTACTACG CCCGGCGGAT
ATTATGGCAG CGATCGACTA TTTAATCAAC CTAGAATTTG ATGTAGGTAG TACCGATGAT
ATTGACCACC TAGGAAACCG TCGCGTTCGT TCCGTCGGAG AACTCCTACA AAACCAAGTC
AGAGTCGGCT TAAACCGCCT AGAACGGATA ATCCGCGAAC GGATGACCGT TAGTGAGTCC
GATGGACTAA CCCCCGCGTC TTTGGTTAAC CCCAAACCCC TAGTAGCTGC CATTAAAGAG
TTTTTTGGCT CATCCCAACT GTCCCAATTC ATGGATCAAA CCAACCCTTT AGCGGAATTG
ACCCATAAAC GGCGGATTTC AGCCCTAGGA CCTGGCGGGT TAACGCGAGA ACGCGCAGGC
TTTGCGGTGC GAGATATCCA TCCCTCCCAC CACGGGCGCA TCTGTCCGGT AGAAACGCCT
GAAGGACCCA ACGCAGGATT AATTGGTTCT TTGGCAACCT ATGCACGGGT CAATCAATAC
GGATTCATCG AAACACCCTA CTATAAGGCA GAAAATGGAC GAGTCAGACG AGATTTAGAC
CCCGTTTACC TCACCGCCGA CGAAGAAGAC GATCTGCGGG TCTGTCCGGG GGATGCGGCC
ACTGATGCCG AGGGCAATAT CCTAGGGGAA AGCGTCCCCA TTCGCTACCG TCAGGAATTT
TCCACCACCA GTCCCGAACA AGTGGACTAT GTGGCCGTCT CTCCGGTACA GATTGTTTCG
GTAGCTACGT CCATGATTCC CTTTTTGGAA CACGACGACG CTAACCGCGC CCTGATGGGA
TCGAATATGC AGCGTCAAGC CGTTCCTTTA CTGCGGCCAG AACGTCCCTT AGTGGGAACT
GGACTAGAAG CCCAAGCCGC GAGAGATTCA GGGATGGTGA TCGTCAGCCG TACCTATGGG
ATCGTTACCT ATGTAGATGC CTCCGAAATT CGGGTACAGG TGACGGGTCC AGAAAACCAC
GACAAATTAG GAACCGAGAT CACCTATCTG TTACAGAAAT ATCAACGGTC TAACCAAGAT
ACCTGTTTAA ATCAGCGTCC TCTGGTGTAC GTTAATGAGG AGGTTGTCCC TGGTCAAGTC
TTAGCCGATG GATCAGCCAC CGAAGGCGGA GAATTAGCCC TAGGACAGAA TATTCTCGTC
GCCTATATGC CCTGGGAAGG CTATAACTAC GAGGATGCGA TCCTGATTAG TGAACGGTTG
GTCTATGACG ACGTTTACAC CAGTATTCAC GTTGAAAAGT ACGAAATTGA AGCCCGTCAG
ACTAAATTAG GACCAGAAGA GATTACCCGC GAAATTCCTA ACGTTGGGGA AGATTCCCTG
AGAAACCTCG ATGAACAGGG CATTATCCGC ATTGGGGCTT GGACAGAAGC CGGGGATATC
CTGGTCGGAA AAGTGACCCC CAAAGGAGAA TCGGATCAAC CCCCTGAAGA AAAGCTATTA
CGCGCAATTT TTGGTGAAAA AGCCCGTGAT GTACGGGATA ACTCTTTAAG GGTTCCCAAC
GGAGAAAAAG GCCGCGTCGT CGATGTCCGG GTGTTTACGC GGGAACAGGG AGATGAATTA
CCCCCTGGGG CTAACATGGT AGTACGGGTG TATGTTGCCC AAAAACGCAA AATCCAAGTC
GGGGATAAAA TGGCCGGTCG CCACGGGAAT AAGGGGATTA TTTCCCGTAT TTTGCCCATT
GAAGATATGC CCTATTTACC TGATGGCCGC CCCGTGGATA TCGTTCTCAA CCCCTTGGGT
GTGCCCTCGC GGATGAACGT CGGTCAGGTG TTTGAATGCT TATTAGGATG GGCAGGTGAA
AACTTGGGCG TTCGCTTTAA AGTGACTCCC TTTGATGAAA TGTACGGCGA AGAAACCTCT
CGTAAAACCG TTCATGGCAA ACTGCAAGAA GCCAGTCACA AACGCGGGAA AAGCTGGATT
TATCAAGAAG AAAATCCAGG GAAGATTCAG GTCTTTGATG GCCGTACTGG AGAACCCTTC
GATCGCCCCG TCACCGTAGG TCAAGCCTAT ATGCTTAAAT TAGTCCACTT GGTGGATGAT
AAGATCCACG CCCGTTCGAC GGGTCCCTAC TCCTTGGTAA CACAACAACC TTTAGGTGGA
AAGGCACAAC AAGGTGGACA ACGCTTCGGA GAAATGGAAG TTTGGGCGTT AGAAGCCTAT
GGGGCAGCCT ATACCCTGCA AGAGTTGTTA ACGGTGAAAT CCGACGATAT GCAAGGACGG
AACGAAGCCT TAAATGCGAT CGTTAAGGGT AAACCCATTC CTCAACCCGG AACGCCTGAG
TCCTTTAAGG TACTGATGCG CGAATTGCAG TCCTTGGGCT TAGATATTGC CGTCCATAAG
GTAGAAAATG CCGAAGATGG GACCAGTCGG GATGTGGAGG TTGATTTGAT GGTGGATACC
CAACGTCGTA CTCCCAGTCG TCCTACTTAT GAGTCCTTAA CTAGCGAGGA TCTCGAAGAA
GAAGAAGTGT AA
 
Protein sequence
MTNLTYNYNL LPDLIEIQRS SFRWFLEEGL IEELDSFSPI SDYTGKLELH FLGENYKLKQ 
PKYDVDEAKR RDSTYSVQMY VPTRLINKET GEIKEQEVFI GDLPLMTERG TFIINGAERV
IVNQIVRSPG VYYKAEIDKN GRRTYSASLI PNRGAWLKFE TDKNGLVWVR IDKTRKLSAQ
VLLKAIGLSD AEILDGLRHP DFYQRTLDKE GNPSEEEALL ELYRKLRPGE PPTVTGGQQL
LDSRFFDSKR YDLGRVGRYK LNKKLRLNAP DTIRVLRPAD IMAAIDYLIN LEFDVGSTDD
IDHLGNRRVR SVGELLQNQV RVGLNRLERI IRERMTVSES DGLTPASLVN PKPLVAAIKE
FFGSSQLSQF MDQTNPLAEL THKRRISALG PGGLTRERAG FAVRDIHPSH HGRICPVETP
EGPNAGLIGS LATYARVNQY GFIETPYYKA ENGRVRRDLD PVYLTADEED DLRVCPGDAA
TDAEGNILGE SVPIRYRQEF STTSPEQVDY VAVSPVQIVS VATSMIPFLE HDDANRALMG
SNMQRQAVPL LRPERPLVGT GLEAQAARDS GMVIVSRTYG IVTYVDASEI RVQVTGPENH
DKLGTEITYL LQKYQRSNQD TCLNQRPLVY VNEEVVPGQV LADGSATEGG ELALGQNILV
AYMPWEGYNY EDAILISERL VYDDVYTSIH VEKYEIEARQ TKLGPEEITR EIPNVGEDSL
RNLDEQGIIR IGAWTEAGDI LVGKVTPKGE SDQPPEEKLL RAIFGEKARD VRDNSLRVPN
GEKGRVVDVR VFTREQGDEL PPGANMVVRV YVAQKRKIQV GDKMAGRHGN KGIISRILPI
EDMPYLPDGR PVDIVLNPLG VPSRMNVGQV FECLLGWAGE NLGVRFKVTP FDEMYGEETS
RKTVHGKLQE ASHKRGKSWI YQEENPGKIQ VFDGRTGEPF DRPVTVGQAY MLKLVHLVDD
KIHARSTGPY SLVTQQPLGG KAQQGGQRFG EMEVWALEAY GAAYTLQELL TVKSDDMQGR
NEALNAIVKG KPIPQPGTPE SFKVLMRELQ SLGLDIAVHK VENAEDGTSR DVEVDLMVDT
QRRTPSRPTY ESLTSEDLEE EEV