Gene PCC8801_0228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_0228 
Symbol 
ID7105285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp220052 
End bp221002 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content47% 
IMG OID643473341 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_002370487 
Protein GI218245116 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAGTGG CGCAGTTTCA AATTGAGTGT ATAGAATCTA AGACTCAGAA AAATCAAAGT 
CAATATAGTA AGTTTGTCCT AGAACCTCTA GCGAGGGGTC AGGGAACCAC CGTCGGCAAC
GCTTTACGAC GGGTGTTACT GGCTAACTTG CAAGGAGCCG CCGTCACAGC GATCCGGATT
GCAGGGGTAA ATCACGAATT TGCTACCATT CCAGGGGTCA GAGAGGATGT CTTAGAAATC
ATGTTAAACA TGAAAGAAAT CGTCCTAAAA AGCTATAGTG ATCAGGCGCA AATTGGCCGC
CTGGTTGCCA CAAGTGCGGG GACGGTCACG GCAGCCAACT TTGAGTTACC CTCAGAAGTG
GAAGTGGTTG ATCCAACCCA GTATGTGGCA ACGCTGACCG AAGGCTCGAA ATTAGAGATG
GAGTTTCGGA TCGAAACAGG AACCGGGTAT AAAGGGGTTG AGCGAGGCAA AGATGACGGT
ACATCCCTTG ACTTTCTAGA GATCGATGCC GTGTTTATGC CGGTGACTAA GGTCAATTAC
ATCGTCGAGG ACATCAGGGG AGAACACGGG GAAGCCCAAG ATCGGCTAAT TTTGGAAATT
TGGACGAATG GGAGTTTTAA TCCCAAGGAA GCCCTATCTG AAGCTGCTGA GATTGTGGTG
GATTTGTTTA GTCCCCTGAA AGACCTGAAC CAGCTCGAAA CCACCACCCC TGACTATCAA
GACGATGAGA ATCCGCAAAG TCAGATTCCC ATCGAAGAAT TACAGCTTTC GGTCAGGGCT
TACAACTGTC TAAAACGGGC ACAAATTAAT ACAGTGGCCG ATTTATTAGA TTATAGCCAA
GAAGATCTCT TGGAGATCAA AAACTTCGGT CAAAAATCGG CTGAAGAAGT GATCGAAGCT
TTGCAAAAGC GGTTAGGCAT TACCCTACCC CAAGAAAAAG CGAAATCTTA A
 
Protein sequence
MVVAQFQIEC IESKTQKNQS QYSKFVLEPL ARGQGTTVGN ALRRVLLANL QGAAVTAIRI 
AGVNHEFATI PGVREDVLEI MLNMKEIVLK SYSDQAQIGR LVATSAGTVT AANFELPSEV
EVVDPTQYVA TLTEGSKLEM EFRIETGTGY KGVERGKDDG TSLDFLEIDA VFMPVTKVNY
IVEDIRGEHG EAQDRLILEI WTNGSFNPKE ALSEAAEIVV DLFSPLKDLN QLETTTPDYQ
DDENPQSQIP IEELQLSVRA YNCLKRAQIN TVADLLDYSQ EDLLEIKNFG QKSAEEVIEA
LQKRLGITLP QEKAKS