Gene PCC8801_1866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1866 
Symbol 
ID7104984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1955568 
End bp1956548 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content44% 
IMG OID643474932 
Productbacteriochlorophyll/chlorophyll a synthase 
Protein accessionYP_002372065 
Protein GI218246694 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0382] 4-hydroxybenzoate polyprenyltransferase and related prenyltransferases 
TIGRFAM ID[TIGR01476] bacteriochlorophyll/chlorophyll synthetase
[TIGR02056] chlorophyll synthase, ChlG 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGACT CTTCTACGGC TAAAAGCAAC GAAGGCGGTA GCAAAACCCG TCAATTATTA 
GGGATGAAGG GGGCAACCTC TGGCGAAACG TCCCTCTGGA AATTGCGACT GCAACTGATG
AAACCCATCA CCTGGATACC GCTAATTTGG GGCGTAGTCT GTGGGGCAGC TTCATCGGGG
GGATATACCT GGACGGTGGA AAATGTCCTC AAAATCGCTG CTTGTATGCT GCTATCGGGT
CCTTTGATGG CCGGGTATAC CCAAACCCTC AATGATTTTT ATGACCGCGA AATTGATGCT
ATTAATGAGC CCTATCGTCC TATCCCATCC GGGGCGATTT CGATTCCCCA GGTAGTTACC
CAAATTTTAG TCCTCTTAGG GGCAGGATTA GCCCTAGGAT ACGGTTTAGA TGTCTGGGCA
GGTCATGAGT TCCCGATGAT GTTTTCTTTG ACGTTAGGGG GCGCATTTAT CGCTTATATC
TATTCTGCCC CTCCATTGAA GTTAAAACAA AATGGTTGGT TAGGCAATTA TGCTTTAGGA
TCGAGTTATA TTGCTTTGCC TTGGTGGGCT GGTCATGCCC TATTTGGTCA ATTAAACTGG
ACGATTGTTA TTTTGACCTT GTTTTATAGT TTGGCCGGGT TAGGTATTGC AGTGGTTAAT
GATTTTAAGA GTGTGGAAGG CGATCGTCAA TTAGGCTTAA AATCTCTTCC AGTGATGTTT
GGCATTGATA CGGCTGCTTG GATTTGTGTG ATTATGATTG ATGTGTTTCA AGCAGGAATT
GCTGGTTATT TAATCTATGT TAACCAAAAT CTTTATGCTG CTATCTTGCT GTTATTGGTG
ATTCCTCAAA TTACTTTCCA AGATATGTAT TTCCTCCGTG ATCCACTTAA AAATGATGTT
AAATATCAAG CCAGTGCTCA ACCTTTCTTA GTGTTAGGAA TGTTAGTAGC AGGTCTAGCT
TTGGGTAACG CAGGGGTTTA G
 
Protein sequence
MSDSSTAKSN EGGSKTRQLL GMKGATSGET SLWKLRLQLM KPITWIPLIW GVVCGAASSG 
GYTWTVENVL KIAACMLLSG PLMAGYTQTL NDFYDREIDA INEPYRPIPS GAISIPQVVT
QILVLLGAGL ALGYGLDVWA GHEFPMMFSL TLGGAFIAYI YSAPPLKLKQ NGWLGNYALG
SSYIALPWWA GHALFGQLNW TIVILTLFYS LAGLGIAVVN DFKSVEGDRQ LGLKSLPVMF
GIDTAAWICV IMIDVFQAGI AGYLIYVNQN LYAAILLLLV IPQITFQDMY FLRDPLKNDV
KYQASAQPFL VLGMLVAGLA LGNAGV