Gene Cagg_0937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0937 
Symbol 
ID7268010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1166081 
End bp1167076 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content57% 
IMG OID643565785 
ProductSqualene/phytoene synthase 
Protein accessionYP_002462291 
Protein GI219847858 
COG category[I] Lipid transport and metabolism 
COG ID[COG1562] Phytoene/squalene synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0100379 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGCTGA ATTCGATCTC GGTTATCACG CGCGGGAACG GATTGCCGGT TAGCCAGGCA 
CTTCCGGCAG ACGAGCAGCT TGCTCAGCTC TTTCATCTCA GCGATGTACC TTCCTCGTCA
AGTGCCGACC AACTGCCACC GCCGCGTGTT CGTTCGCTGG CAGAGGCATA TGCCTTCTGT
GATGAGGTGA TCCGCCGTCA CTCCAAGAGC TTTTTCTTTA GTACCCAGTT TTTGCCGCCA
CCACAACGAC GAGCCGTGCG AGCGTTGTAT GCGTTCTGTC GCACAACCGA TGATACGGTT
GATATGGCGA GGACTGACCC GGCCAGGGCA TTAGCCGAAT GGGTGCGCGT GGCTCGTCGT
CCGTGCCTCG ATACGGCGCA CCCGGTCCTT TTGGCATGGG CCGATACTTG CCAACGGTAT
AACCTTTCAC CGCACCTCAT CGATGAGTTG TTGGCCGGAG TAGCGATGGA TCTGACGATC
TCGCGCTATG CCACGTTTGC CGATCTGTGG CTCTATTGCT ACCGGGTCGC ATCGGTGGTG
GGGATGTTGG TGATTGGGAT TACCGGTGCT GCGCCCGGAG CGACACCATA CGCGATTAAG
CTAGGAGTGG CCTTGCAGTT GACTAATATC CTGCGCGATG TTGGCGAGGA TGCCAATCGT
GGGCGGGTGT ATCTGCCGAT CGATGAACTG GCTCGCTTTG GTTTGACTGC CGATGATATT
CTGGCCCGGG TCTACGATGA GCGCTTCATT GCATTGATGA AGTTTCAAAT CGAACGTACC
CATCGTTTGT ACGATGAGAG TTGGCCCGGT ATCGCGCTCT TGCCACCTGA AGTGCGATTG
GCCGTAGCGG CCGCAGCGCG CGTCTACCGC GGCATCCTTG ATAAAATCGT TGCTAACCGG
TATGATTCAT ACAACCACCG TGCGTATCTG TCGCTGCGCG AGAAGGTGGC ACGTTTGCCC
GGTATTTGGT GGGATGTTCA TCGCTTGGGT AGGTAG
 
Protein sequence
MSLNSISVIT RGNGLPVSQA LPADEQLAQL FHLSDVPSSS SADQLPPPRV RSLAEAYAFC 
DEVIRRHSKS FFFSTQFLPP PQRRAVRALY AFCRTTDDTV DMARTDPARA LAEWVRVARR
PCLDTAHPVL LAWADTCQRY NLSPHLIDEL LAGVAMDLTI SRYATFADLW LYCYRVASVV
GMLVIGITGA APGATPYAIK LGVALQLTNI LRDVGEDANR GRVYLPIDEL ARFGLTADDI
LARVYDERFI ALMKFQIERT HRLYDESWPG IALLPPEVRL AVAAAARVYR GILDKIVANR
YDSYNHRAYL SLREKVARLP GIWWDVHRLG R