Gene PCC8801_4132 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4132 
Symbol 
ID7101915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4336454 
End bp4337707 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content45% 
IMG OID643477121 
ProductRNA polymerase, sigma 70 subunit, RpoD subfamily 
Protein accessionYP_002374220 
Protein GI218248849 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family
[TIGR02997] RNA polymerase sigma factor, cyanobacterial RpoD-like family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGCCA CACCATTTTA TTCTGACACC GATTATTATT CTCCCTTGGC ATCTGACGGA 
AATGCTTCTT CCGTCGGGTT TTCTGAGAAG GCTTATGAAT TACGGGAGGG TTCCTTAGAA
CTGGAACTAC ATGGAATAGA CTTTACAGAA GCCGAAAAAA ATCACCACCA AGGTAGTACA
GATTTAGTGA GATTATACCT TCAAGATATT GGTCGAGTTC CTCTACTCAA CCAAGAAGAA
GAAGTGATCA AAGCGCAACA AGTCCAAAGC TACGTCGAGT TATTAGATTT GCGTCAAAAG
GCGGCTCAAG CAGGGGATAG AGCCATTCAA CAATTGATTG AAATGGCAGA AATTCACGAT
CGCTTGTTCA TTAGTCTCGG CTATCGTCCA TCTTGGGACC GGTGGGCAAA GGCTGCACAA
ATCTCCGTTG GGGAATTGAA AACGATTCTA GGACAAGGAA AAGAACGCTG GGCTCAACTG
GCCGACATTG AGCTAGATCA ACTCGAAACG ATTGAAAAAG TCGGGATTCG CGCGAAAGAA
CAGATGATCA AAGCCAATTT GCGTCTAGTG GTGTCTGTGG CCAAAAAATA TCAACATCGT
GGGTTAGAAC TCCTCGACTT GATTCAAGAA GGAACCCTAG GGTTAGAACG AGCAGTCGAA
AAGTTTGATC CCACTAAAGG ATATCGCTTT TCTACCTACG CTTATTGGTG GATTCGGCAG
GGCATTACTA GAGCAATCTC CACCCAAAGC CGTATTATTC GACTTCCGGT GCACATCACC
GAAAAACTCA ATAAAATTAA ACGAGCCCAG AGAAAGCTTT CCCAAGAAAA AGGGCGCAAA
GCGACTATCG AAGAAATGTC CAAGGAATTG GATATGAGTG CCGAGCAAAT TAGGGAAGTG
TTGCTGCGCA TTCCTCGCTC AGTCTCCTTA GAGGTGAAAG TCGGTAAAGA AAAAGATACG
GAATTAGTAG ACCTATTAGA AACCGAATGT CAATCCCCTG AAGAAAGTTT AATCAGTGAA
TCCCTACGGA AAGATTTGCA AGTCCTTCTT GAAGATCTCA CGGAACGAGA ACAGGACGTA
ATTAAGCTGC GCTACGGGTT CGAGGATGGA ACTTGTTACT CCTTAGCCGA TATTGGGCGT
GTCCTAGAAT TGTCCCGTGA GCGCGTGCGT CAAATTGAAG CTAAGGCACT ACAAAAGTTA
CGCCAACCGC GACGACGCAA TCAAATTCGG GATTATTTTG AATCCCTAAC CTAA
 
Protein sequence
MPATPFYSDT DYYSPLASDG NASSVGFSEK AYELREGSLE LELHGIDFTE AEKNHHQGST 
DLVRLYLQDI GRVPLLNQEE EVIKAQQVQS YVELLDLRQK AAQAGDRAIQ QLIEMAEIHD
RLFISLGYRP SWDRWAKAAQ ISVGELKTIL GQGKERWAQL ADIELDQLET IEKVGIRAKE
QMIKANLRLV VSVAKKYQHR GLELLDLIQE GTLGLERAVE KFDPTKGYRF STYAYWWIRQ
GITRAISTQS RIIRLPVHIT EKLNKIKRAQ RKLSQEKGRK ATIEEMSKEL DMSAEQIREV
LLRIPRSVSL EVKVGKEKDT ELVDLLETEC QSPEESLISE SLRKDLQVLL EDLTEREQDV
IKLRYGFEDG TCYSLADIGR VLELSRERVR QIEAKALQKL RQPRRRNQIR DYFESLT