Gene Cphamn1_2226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_2226 
Symbol 
ID6375920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp2411348 
End bp2412358 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content45% 
IMG OID642684713 
Productbacteriochlorophyll c synthase 
Protein accessionYP_001960612 
Protein GI189501142 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0382] 4-hydroxybenzoate polyprenyltransferase and related prenyltransferases 
TIGRFAM ID[TIGR01476] bacteriochlorophyll/chlorophyll synthetase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGTTA GTGTCAACCG CAGTATGAGT TTTACCGACA AGGTCAGAGC CCACCTTGAA 
ATTCTTGATC CGGTTACCTG GATCAGTGTT TTCCCTTGCC TTGCCGGTGG TGTCATGGCT
TCGGGCGCCA TGCAGCCCAC GCTTCATGAT TATTTTCTCC TTCTGGCGAT TTTTTTAATG
TTCGGTCCTC TGGGCACCGG ATTCAGCCAG TCAATAAACG ATTATTATGA TCTTGAACTG
GACAAGGTCA ACGAGCCGAC ACGGCCTATT CCCTCAGGGC GAATGACTGA GAAAGAAGCT
GTCTGGAACA GCGTCGTGGT TTGTCTGCTG GCTCTTTGTC TTGGTGTTTT TCTCGGCTTT
TACATTGGCG GCGAAAGAGG ACTGATTATC ACGTCCTCGA TAGTCGCTGG TCTGATCGTT
GCCTACATCT ATTCTGCGCC ACCGCTGAAG CTCAAGAAAA ACATACTGAC TTCTGCACCG
GCCGTAGGTT TTTCATACAG TCTGGTAACC TGGTTTTCGG CAAATGCCCT GTTCAGTGAA
ATTCGGCCGG AAGTATACTG GCTGGCGGGA CTTAACTTTT TTATGGCAAT GGCGCTTATC
ATCATGAATG ATTTCAAATC CGCAAAGGGA GACAAAGAAG GGGGGATGAA GTCGCTTACA
GTTATGATAG GTATGAAAAA TACTTTTCTG GTTTCATTTA TTATGATCGA TCTGGTGTTT
CTTGTTTTTG CCTGGCTTGA ATATCAATGG GGCTTTTATT ATCTGGTTGT TTTAATGCTT
GGCGGATTGA TCCTTAACAT ATACATGCAG GTAAAACTGT ATGCCGATCC GAAAGGCGGC
GTGGCATTTA TGGGAAGTGC TGTAGATGAT GTTTTTGGTA ACACTATTGG ACAGAGTGAA
GTCGAAGAAC ATAAGGCCTA TCTCCGGTTT CAGATCGCAA ACAATGTTCT GTTTCTTTCC
AACAATCTGT TCGCTGCAGG CGCGATTGGT ATGAAGTATA TGCAAGGATA G
 
Protein sequence
MSVSVNRSMS FTDKVRAHLE ILDPVTWISV FPCLAGGVMA SGAMQPTLHD YFLLLAIFLM 
FGPLGTGFSQ SINDYYDLEL DKVNEPTRPI PSGRMTEKEA VWNSVVVCLL ALCLGVFLGF
YIGGERGLII TSSIVAGLIV AYIYSAPPLK LKKNILTSAP AVGFSYSLVT WFSANALFSE
IRPEVYWLAG LNFFMAMALI IMNDFKSAKG DKEGGMKSLT VMIGMKNTFL VSFIMIDLVF
LVFAWLEYQW GFYYLVVLML GGLILNIYMQ VKLYADPKGG VAFMGSAVDD VFGNTIGQSE
VEEHKAYLRF QIANNVLFLS NNLFAAGAIG MKYMQG