Gene PCC7424_1049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC7424_1049 
Symbol 
ID7111637 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 7424 
KingdomBacteria 
Replicon accessionNC_011729 
Strand
Start bp1150654 
End bp1151724 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content46% 
IMG OID643479319 
Productphotosystem q(b) protein 
Protein accessionYP_002376371 
Protein GI218438042 
COG category 
COG ID 
TIGRFAM ID[TIGR01151] photosystem II, DI subunit (also called Q(B)) 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.0115915 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTACTA CTTTACAGCA ACGCGAAAGC GTTTCCCTGT GGGAACAGTT TTGTCAGTGG 
ATCACCAGCA CCAACAACCG TTTATACATC GGTTGGTTCG GTGTCATCAT GATCCCCACC
CTCTTAACTG CTACTACCTG TTTCATCATT GCTTTCATCG CTGCTCCTCC TGTAGACATC
GATGGAATCC GTGAACCCGT AGCTGGTTCT TTACTCTACG GAAACAACAT CATCTCTGGT
GCAGTTGTTC CTTCTTCCAA CGCCATTGGA TTACACTTCT ACCCCATTTG GGAAGCCGCT
TCCTTAGATG AGTGGCTTTA CAACGGTGGC CCTTACCAGT TAGTAGTATT CCACTTCTTA
ATCGGAGTAT TCTGCTACAT GGGTCGTCAG TGGGAATTAA GCTACCGCTT AGGAATGCGT
CCTTGGATTT GTGTAGCTTA CTCTGCTCCT GTATCCGCAG CTACCGCAGT ATTCTTAATC
TACCCCATCG GACAAGGTTC TTTCTCTGAT GGAATGCCTT TAGGAATCAG TGGAACATTC
AACTTCATGT TCGTTTTCCA AGCAGAACAC AACATCTTAA TGCACCCCTT CCATATGTTG
GGAGTAGCTG GTGTATTCGG AGGTTCTTTA TTCTCTGCAA TGCACGGAAG CTTAGTAACC
AGTTCTTTAG TTCGTGAAAC TACCGAAGTA GAATCTCAGA ACTATGGTTA CAAGTTCGGA
CAAGAAGAAG AAACCTACAA CATCGTAGCA GCACACGGAT ACTTCGGACG TTTAATTTTC
CAATATGCGT CCTTCAACAA CAGCCGTTCA TTACACTTCT TCTTAGGAGC ATGGCCTGTA
ATCGGTATCT GGTTCACCGC AATGGGAATC TCTACCATGG CCTTCAACCT CAACGGTTTC
AACTTCAACC AGTCTATCCT TGATTCTCAA GGTCGTGTCA TCAGCACCTG GGCTGACGTA
TTAAACCGCG CTAACTTAGG ATTTGAAGTA ATGCACGAGC GCAACGCTCA CAACTTCCCC
TTAGACTTAG CGTCTGCTGA ACCTGTTGTT GCTCCTTCCA TCAATGGCTA G
 
Protein sequence
MTTTLQQRES VSLWEQFCQW ITSTNNRLYI GWFGVIMIPT LLTATTCFII AFIAAPPVDI 
DGIREPVAGS LLYGNNIISG AVVPSSNAIG LHFYPIWEAA SLDEWLYNGG PYQLVVFHFL
IGVFCYMGRQ WELSYRLGMR PWICVAYSAP VSAATAVFLI YPIGQGSFSD GMPLGISGTF
NFMFVFQAEH NILMHPFHML GVAGVFGGSL FSAMHGSLVT SSLVRETTEV ESQNYGYKFG
QEEETYNIVA AHGYFGRLIF QYASFNNSRS LHFFLGAWPV IGIWFTAMGI STMAFNLNGF
NFNQSILDSQ GRVISTWADV LNRANLGFEV MHERNAHNFP LDLASAEPVV APSING