Gene NATL1_04811 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_04811 
SymbolchlG 
ID4781184 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp438485 
End bp439435 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content39% 
IMG OID640083758 
Productbacteriochlorophyll/chlorophyll a synthase 
Protein accessionYP_001014310 
Protein GI124025194 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0382] 4-hydroxybenzoate polyprenyltransferase and related prenyltransferases 
TIGRFAM ID[TIGR01476] bacteriochlorophyll/chlorophyll synthetase
[TIGR02056] chlorophyll synthase, ChlG 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.233119 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGATG CTAGGCAACT CCTTGGGATA AAAGGAGGTT CTGAAACAAC AAACATTTGG 
AAGCTTCGTT TGCAATTGAT GAAGCCCATT ACATGGATTC CCTTGTTATG GGGAGTTATA
TGTGGCGCAG CGGCCAGTGG TAATTATCAC TGGGAATTAA GCAACATTCT TGCTTCGATA
AGTTGCATGT TTATGAGCGG CCCACTCTTA ACTGGATATA CCCAAACAAT AAATGATTAC
TTTGATAGAG AAATTGACGC AATAAATGAA CCTAATAGAC CAATACCTTC AGGGGCAATT
TCTCTGTTCC AAGTAAAATG TCAAATTTGG GTTTTACTAA TAGCCGGTCT TGGAGTTGCC
TATTTATTAG ATTTGTGGGC ACATCACACG ATTCCTTCCG TCCTTCTTTT GGCTTTGGGA
GGTTCATTTG TAAGTTTTAT TTACTCAGCA CCACCTTTAA AACTTAAACA AAATGGGTGG
CTTGGGAATT ATGCACTTGG TGCGAGCTAT ATAGCTCTTC CTTGGTGGGC TGGACAGGCT
CTATTTGGAC ATTTAACCTG GACAACTGCC TTGCTTACTC TTGCCTATAG CTTGTCAGGT
TTAGGGATTG CTGTAATAAA TGATTTTAAA AGCGTGGAAG GGGATAAAAG CCTTGGGCTT
GAATCACTTC CTGTTGTTTT TGGTATTAAA AATGCAAGTC GTATTAGCGC AGGAATGATA
GATATCTTTC AGCTGGCAAT GGTAGTAGTT TTAATAGCTA TAGGACAACA TTTTGCATCT
GTCATTCTGG TTTTACTAAT AATTCCTCAA ATCACATTCC AAGACATATG GCTATTGCGC
GATCCATTAA AATTTGATGT TAAATACCAA GCCAGTGCAC AACCATTTCT AATTTTAGGA
ATGCTTGTGA CTGCAATAGC TATTGGACAT AGTTCCTTAA TTAGTTTATA A
 
Protein sequence
MSDARQLLGI KGGSETTNIW KLRLQLMKPI TWIPLLWGVI CGAAASGNYH WELSNILASI 
SCMFMSGPLL TGYTQTINDY FDREIDAINE PNRPIPSGAI SLFQVKCQIW VLLIAGLGVA
YLLDLWAHHT IPSVLLLALG GSFVSFIYSA PPLKLKQNGW LGNYALGASY IALPWWAGQA
LFGHLTWTTA LLTLAYSLSG LGIAVINDFK SVEGDKSLGL ESLPVVFGIK NASRISAGMI
DIFQLAMVVV LIAIGQHFAS VILVLLIIPQ ITFQDIWLLR DPLKFDVKYQ ASAQPFLILG
MLVTAIAIGH SSLISL