Gene P9303_20661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_20661 
SymbolchlG 
ID4776622 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1817688 
End bp1818698 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content54% 
IMG OID640087575 
Productbacteriochlorophyll/chlorophyll a synthase 
Protein accessionYP_001018067 
Protein GI124023760 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0382] 4-hydroxybenzoate polyprenyltransferase and related prenyltransferases 
TIGRFAM ID[TIGR01476] bacteriochlorophyll/chlorophyll synthetase
[TIGR02056] chlorophyll synthase, ChlG 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.22766 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACTGT CAGCTGGTTC TTTGACGACG AAGTTACCTC CGCCTGAGAC GCCTGCCGTG 
AGCGACGCTC GCCAACTCCT TGGCATCAAA GGTGCCTCTG GCACCACCAA CATCTGGAAG
CTGCGTCTGC AGCTCATGAA GCCTGTCACC TGGATCCCAC TGCTCTGGGG GGTGATTTGT
GGAGCAGCTG CCAGTGGTCA ATACCAATGG CGCGTACCAG ATGTGCTTGC AGCCGCGGCC
TGCATGGTCA TGAGTGGCCC GCTTCTAACC GGTTACACCC AAACCATCAA CGACTACTAC
GATCGCGAGA TCGACGCAAT CAATGAGCCC TATAGGCCGA TTCCCTCCGG GGCCATCCCG
CTGACACAAG TCAAGCTTCA GATCTGGATG CTGCTTCTCG GTGGTCTTGC AGTTGCCTAC
GGCTTAGATC GTTGGGCTGA ACACACCACT CCCGTTGTGC TGTTTCTTGC CCTAGGCGGC
TCATTTGTGA GCTTCATTTA TTCAGCCCCA CCTCTCAAGC TGAAACAGAA CGGTTGGATA
GGAAACTACG CGCTTGGTGC CAGCTACATC GCCCTGCCAT GGTGGGCTGG TCAGGCGCTG
TTCGGACAAC TGACTTGGAC CACAGCACTG CTGACACTTG CCTATAGCTT GGCTGGTCTT
GGTATAGCCG TCATCAACGA TTTCAAGAGT GTAGAGGGAG ATCGAGCCCT CGGTCTTCAA
TCCCTGCCAG TTGTCTTTGG AATCAAAAAG GCCAGCTGGA TCAGTGCAGG AATGATCGAC
ATCTTCCAGC TCGCGATGGT TGTTGTTCTG ATCGCCATTG GCCAGCATTT TGCCTCAGTC
GTTTTGATCT TATTGATCAT TCCCCAGATC ACATTCCAAG ACATCTGGTT ACTGCGTGAT
CCCCTGGCCT TTGATGTGAG ATATCAGACA AGTGCTCAAC CATTCCTAAT CCTTGGCATG
TTGGTTACTG CTCTAGCCAT AGGCCATAGC CCATTAACAC AGGTGATGTG A
 
Protein sequence
MALSAGSLTT KLPPPETPAV SDARQLLGIK GASGTTNIWK LRLQLMKPVT WIPLLWGVIC 
GAAASGQYQW RVPDVLAAAA CMVMSGPLLT GYTQTINDYY DREIDAINEP YRPIPSGAIP
LTQVKLQIWM LLLGGLAVAY GLDRWAEHTT PVVLFLALGG SFVSFIYSAP PLKLKQNGWI
GNYALGASYI ALPWWAGQAL FGQLTWTTAL LTLAYSLAGL GIAVINDFKS VEGDRALGLQ
SLPVVFGIKK ASWISAGMID IFQLAMVVVL IAIGQHFASV VLILLIIPQI TFQDIWLLRD
PLAFDVRYQT SAQPFLILGM LVTALAIGHS PLTQVM