Gene Cyan8802_3899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_3899 
Symbol 
ID8393249 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp4009553 
End bp4010623 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content45% 
IMG OID644981824 
Productphotosystem q(b) protein 
Protein accessionYP_003139538 
Protein GI257061650 
COG category 
COG ID 
TIGRFAM ID[TIGR01151] photosystem II, DI subunit (also called Q(B)) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0661393 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.162734 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCATG TTATCCAACG TCGCCGAGAA TGGGATATAG GTAGCAGTTG GGACAAGTTT 
TGCCAATGGG TAACAAGTAC CGATAATCGC ATTTATATCG GTTGGTTTGG CCTGTTGATG
ATTCCCACCT TAATCGCTGC TATCACTTGC TTTATTATCG CCTTTATTAC CGCTCCTGCT
GTGGATATGG AAGGCATTCG GGAACCCATT TTAGGCTCAA TTTTGAGTGG TAATAACGTC
ATTTCCGCCG CCGTCGTTCC CACTTCCGCC GCCATTGGCC TACACTTCTA TCCTATCTGG
GATGCTGCGT CGATGGATGA ATGGCTCTAC AATGGAGGCC CCTATCAACT GATCATTTTC
CATTTCTTAA TTGGAATTTG GTGTTATTTA GGTCGTTTGT GGGAATTGAG CTACCGTTTA
GGTATGCGTC CTTGGATTTC CATCGCTTTT TCTGCCCCTG TTGCAGCAGC GACTTCTATC
TTTCTGATTT ATCCTATTGG ACAAGGGAGC TTTTCTGAAG GAATGCCCCT CGGTATTAGC
GGCACATTCC ACTTTATGTT AGCCTTCCAA GCTGCCCATA ATATCCTCAT GCACCCCTTA
CATATGTTAG CGGTGTCAGG CATCTTTGCG GGGGCTTTAT TAGCTTCTTT GCACGGTTCC
TTAGTCACTT CTAGCCTTAT TCGGGAAACC ACCATCGAAG AATCGATTAA TCAAGGGTAT
CACTTCGGTC AGGAGGAAAC CACCTATAAT TTAGTCGCTG GCCACGCGGG TTATTTAGGT
CGTTTGTTAA TTCCCAGTTT GGGATGGCAA AATAGCCGTT CAATTCACTT TATTTTAGGA
GCTATTCCTG TTATTGGAAT TTGGTGTGCT GCCTTGGCTA TTGGGGTTAT GGCTTTTAAC
CTCAATGGGT TTAATTTTAA TCAATCTATT CATGATAGCC AAGGTCATCC TATCCTCACC
GAAGCTGATA TGTTAAATCG CGCTAATTTA GGCATTCGTG CCATGCACGC TCCCAATACC
CATCATTTTC CTCTTACCTT AGCCAGTGGT GAAAGTGTTC CCCTTAGTTA A
 
Protein sequence
MTHVIQRRRE WDIGSSWDKF CQWVTSTDNR IYIGWFGLLM IPTLIAAITC FIIAFITAPA 
VDMEGIREPI LGSILSGNNV ISAAVVPTSA AIGLHFYPIW DAASMDEWLY NGGPYQLIIF
HFLIGIWCYL GRLWELSYRL GMRPWISIAF SAPVAAATSI FLIYPIGQGS FSEGMPLGIS
GTFHFMLAFQ AAHNILMHPL HMLAVSGIFA GALLASLHGS LVTSSLIRET TIEESINQGY
HFGQEETTYN LVAGHAGYLG RLLIPSLGWQ NSRSIHFILG AIPVIGIWCA ALAIGVMAFN
LNGFNFNQSI HDSQGHPILT EADMLNRANL GIRAMHAPNT HHFPLTLASG ESVPLS