Gene PCC8801_4452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4452 
Symbol 
ID7095829 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011721 
Strand
Start bp5346 
End bp6539 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content45% 
IMG OID643467409 
Productputative RNA polymerase, sigma 28 subunit, FliA/WhiG subfamily 
Protein accessionYP_002364705 
Protein GI218203850 
COG category[K] Transcription 
COG ID[COG1191] DNA-directed RNA polymerase specialized sigma subunit 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAAAC GTGGCAGCGT TCTGGAGATT TTTTCCACTT TTTTACAGTT TGAGAGTGAC 
TGGATCAGTC GTTGGATCGC CGATCCCAAA CTGCACCGGA GTATGCAGCA ATGTCTGTCT
CAATCCTCAC AATCTCAAGA ATCTAACCAT TTTTGGGCTC TCTACTGGCA TAAAGTTTGG
CAAACCCAAA AAAGTCCCCT AGCCTCCGCT CATCTTTGCG CCTATCTTCA GGAAGCGAGT
TATTGGACGG CTAAAAAAAT GACCATGACC TTTGGCAGCA GTCTGTCTCT CATGGACTTA
TTTCAAATTG CCCTGCTCAA AATTGACAAA ATTTTCCAAA CATTCAACCC GCAACAGGGC
ACCAATTTAG AACAATATGC CAGCCTGGTT TTTCGCAGTA TTATTAAAGA CGAATTACGC
CAACGACGGG AAATCGATCT CTGTACCAAT TGGGCACTAT TGCATAAATT GAGCCAGAAA
AAACTCCTTG AAGCCCTGCA ATTTCAAGGA CTGAACCCAG AGGCGATCGC CGAATACCTC
CTGGCCTGGA AATGTTTTCA AGCCCTTTAT GCCCCCAGCC AAGGGAGAAG CACTCGCAAG
ATTCCTGAAC CTGACGCGAC TATGTGGGGG CAAATCTGTC AAGTTTATAA CCAACAGAGC
TTTAAAAAGC CGTTAGAACC CGATATCCTC AAAAAAAGGC TAGAGACCTG TGCCAAAGCA
GCGCGAGCCT ATTTATACCC CCAAATGCTG TCTGTTGATG CCCCTAAACC AGGACAGGAA
GAGGGATCTT TCCTCGATAG CCTCTCCCTC GATCTGCAAC ATTCCTTAGA AAGCGAGATT
ATTGCCCAAG AAGAAGAAGA AATAAGAAAA CAAGAGCGAG AACAAATTAA TGGCGTTTTA
TTAAGAGCCT TAATTAAATT TGATGTTCAA TCCCAACGAT TGCTACAGAT GTATTATGGT
CAAGGTCTCA CGCAACAGGA GATCGCCCAA CAACTAGAAA TCAAACAGTA TACCGTGTCT
CGCCGTCTTG CTAGTCAGCG AAAAACCCTA ATTCTCACCT TAGGACAATG GGCGCAGAAC
ACCCTGCATT ATTCTCTCGA TGCGGACGTA CTTAATAAAA TAAACACGAT TCTCGAAGAG
TGGCTCAAAG TTCACTACCA TCACCCTGAC CTTAGTGCAA GAGAAGTAGA GTAA
 
Protein sequence
MQKRGSVLEI FSTFLQFESD WISRWIADPK LHRSMQQCLS QSSQSQESNH FWALYWHKVW 
QTQKSPLASA HLCAYLQEAS YWTAKKMTMT FGSSLSLMDL FQIALLKIDK IFQTFNPQQG
TNLEQYASLV FRSIIKDELR QRREIDLCTN WALLHKLSQK KLLEALQFQG LNPEAIAEYL
LAWKCFQALY APSQGRSTRK IPEPDATMWG QICQVYNQQS FKKPLEPDIL KKRLETCAKA
ARAYLYPQML SVDAPKPGQE EGSFLDSLSL DLQHSLESEI IAQEEEEIRK QEREQINGVL
LRALIKFDVQ SQRLLQMYYG QGLTQQEIAQ QLEIKQYTVS RRLASQRKTL ILTLGQWAQN
TLHYSLDADV LNKINTILEE WLKVHYHHPD LSAREVE