Gene Clim_1155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1155 
Symbol 
ID6353671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1251631 
End bp1252761 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content52% 
IMG OID642668772 
Productbacteriochlorophyll/chlorophyll a synthase 
Protein accessionYP_001943203 
Protein GI189346674 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0382] 4-hydroxybenzoate polyprenyltransferase and related prenyltransferases 
TIGRFAM ID[TIGR01476] bacteriochlorophyll/chlorophyll synthetase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.142622 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACAA ACGCTGAGAG AAAAAGCCTC ACCGGGTCGG ATTTTTCCAT GCAAGGCGTG 
CTGTTCTTTC ATTTTATCGT GTTATTTTTG ATGCCGGCAG CAATTTTTCA CTTATTAATG
ACAATGAGCG TGGCCAGAGG TGATTCAAAA CAGCGTACGG ACATATCGGA TAAAACAACA
AACGGCATAC AGAAACCGCT CAATGTCCGC AAGTTCGTTG CGCCCCTGAA CCGCTCAACC
GAAATCGGCT CAAGGCTTGC GCTCTTTATA CGCTTCCTCA AGCCGGTAAC CTGGATTCCG
GTGATGTGGA GTTTTCTCTG CGGAGCCGTA GCAAGCGGAA AATTCGGATG GCACGACATC
ATCGAAACGA AATTCATTCT TGCCATGCTG CTTACCGGAC CGCTGGCAAC GGGCACGTGC
CAGATGCTGA ACGACTATTT CGACCGCGAC CTCGATGAAA TCAACGAACC TGACCGCCCT
ATTCCCGGCG GAGCGATATC ATTGCAGAAT GCGACCATCC TGATTGCTGT CTGGTCGATA
CTATCGGTTA TCGCCGGTTA TCTGATCAAT CCGCTGATCG GCTTTTATGT CGTCATCGGT
ATCATCAATG CTCACCTCTA CAGCGCAAAC CCCATCAAAC TCAAGAAGCG CCTCTGGGCC
GGCAATATCA TCGTCGCCGT ATCATACCTG ATCATTCCCT GGGTTGCCGG TGAAATCGCA
TATAACCCTC AACTGAGCCT CGACTCGCTG CAGCCATCCC TGATCATCGC CTCCATGTAC
ACCATTGCCA GCACCGGCAC GATGACGATC AACGACTTTA AATCCATTGA CGGTGACCGT
CAGGCCGGCA TCCGAACCTT GCCTGCCGTA TTCGGCGAAA CCAACGCAGC TCTCATTGCC
TCACTGCTGA TCAATCTTGG GCAGCTCCTT GCCACTGCCT GGCTTCTCCT TTCAGGAATG
ATCTGGTTCG GATGGTTTAC CGCAGCTTTG ATCGTTCCGC AGTTTCTCCT GCAGTTCAGC
CTTGTCCGAT CTCCCCGAAC CATGGATGTT CGCTACAACG CCATTGCCCA GAACTTCCTC
GTGACAGGCA TGCTGGTCTG CGCCCTTGCC ATTAAAGCAT CCCGACCATG A
 
Protein sequence
MATNAERKSL TGSDFSMQGV LFFHFIVLFL MPAAIFHLLM TMSVARGDSK QRTDISDKTT 
NGIQKPLNVR KFVAPLNRST EIGSRLALFI RFLKPVTWIP VMWSFLCGAV ASGKFGWHDI
IETKFILAML LTGPLATGTC QMLNDYFDRD LDEINEPDRP IPGGAISLQN ATILIAVWSI
LSVIAGYLIN PLIGFYVVIG IINAHLYSAN PIKLKKRLWA GNIIVAVSYL IIPWVAGEIA
YNPQLSLDSL QPSLIIASMY TIASTGTMTI NDFKSIDGDR QAGIRTLPAV FGETNAALIA
SLLINLGQLL ATAWLLLSGM IWFGWFTAAL IVPQFLLQFS LVRSPRTMDV RYNAIAQNFL
VTGMLVCALA IKASRP