Gene PCC8801_3850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3850 
Symbol 
ID7102138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4029557 
End bp4030627 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content45% 
IMG OID643476855 
Productphotosystem q(b) protein 
Protein accessionYP_002373956 
Protein GI218248585 
COG category 
COG ID 
TIGRFAM ID[TIGR01151] photosystem II, DI subunit (also called Q(B)) 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCATG TTATCCAACG TCGCCGAGAA TGGGATATAG GTAGCAGTTG GGACAAGTTT 
TGCCAATGGG TAACGAGTAC CGATAATCGG ATTTATATCG GTTGGTTTGG CCTGTTGATG
ATTCCCACCT TAATCGCTGC TATCACTTGC TTTATTATCG CCTTTATTAC CGCTCCTGCT
GTGGATATGG AAGGCATTCG GGAACCCATT TTAGGCTCAA TTTTGAGTGG TAATAACGTC
ATTTCCGCCG CCGTCGTTCC CACTTCCGCC GCCATTGGCC TACACTTCTA TCCTATCTGG
GATGCTGCCT CGATGGATGA ATGGCTCTAC AATGGAGGCC CCTATCAACT GATCATTTTC
CATTTCTTAA TTGGAATTTG GTGTTATTTA GGTCGTTTGT GGGAATTGAG CTACCGTTTA
GGTATGCGTC CTTGGATTTC CGTTGCTTTT TCTGCCCCTG TTGCAGCAGC GACTTCTATC
TTTCTGATTT ATCCTATTGG ACAAGGAAGC TTTTCTGAAG GAATGCCCCT CGGTATTAGC
GGCACATTCC ACTTTATGTT AGCCTTCCAA GCGGCTCATA ACATCCTGAT GCACCCCTTA
CATATGTTAG CCGTTTCAGG AGTGTTTGCG GGAGCGTTAC TGGCTGCTTT ACACGGTTCT
TTAGTGACTT CTAGCCTCAT TCGGGAAACC ACCATCGAAG AATCAGTTAA TGAAGGGTAT
CACTTCGGTC AGGAGGAAAC CACCTATAAT TTAGTCGCTG GCCACGCGGG TTATTTAGGT
CGTTTGTTAA TTCCCAGTTT GGGATGGCAA AATAGCCGTT CAATTCACTT TATTTTAGGA
GCTATTCCTG TTATTGGAAT TTGGTGTGCT GCCTTGGCTA TTGGGGTGAT GGCTTTTAAC
CTCAATGGGT TTAATTTTAA TCAATCTATT CATGATAGCC AAGGTCATCC TATCCTCACC
GAAGCTGATA TGTTAAATCG CGCTAATTTA GGCATTCGTG CCATGCACGC TCCCAATACC
CATCATTTTC CCCTTACCTT AGCTAGTGGA GAAAGTATCC CCCTTAGTTA A
 
Protein sequence
MTHVIQRRRE WDIGSSWDKF CQWVTSTDNR IYIGWFGLLM IPTLIAAITC FIIAFITAPA 
VDMEGIREPI LGSILSGNNV ISAAVVPTSA AIGLHFYPIW DAASMDEWLY NGGPYQLIIF
HFLIGIWCYL GRLWELSYRL GMRPWISVAF SAPVAAATSI FLIYPIGQGS FSEGMPLGIS
GTFHFMLAFQ AAHNILMHPL HMLAVSGVFA GALLAALHGS LVTSSLIRET TIEESVNEGY
HFGQEETTYN LVAGHAGYLG RLLIPSLGWQ NSRSIHFILG AIPVIGIWCA ALAIGVMAFN
LNGFNFNQSI HDSQGHPILT EADMLNRANL GIRAMHAPNT HHFPLTLASG ESIPLS