Gene Cagg_1563 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1563 
Symbol 
ID7267340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1912775 
End bp1913950 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content55% 
IMG OID643566405 
ProductLycopene beta and epsilon cyclase 
Protein accessionYP_002462901 
Protein GI219848468 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID[TIGR01790] lycopene cyclase family protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTACG ACTACATCAT TGCCGGCGCA GGTGCTGCCG GTTTGAGTTT AGCTTATCAT 
CTTGGTCAGA GTCCGTTAGC CGGTCATTCA ATCCTTTTAG TAGACCCCAA TGCTGTCCGA
CGCAATGATC GCACATGGTG TTTTTGGGAG GTGGGTACCG GCCCGTTTGA GGCTGTGGTA
TCCCGGCGTT GGGATTATCT CTGGCTGTAC GATGAGAGTT GGTCGGCGCG GTTGGCGATT
GCCCCGTATC AGTATAAGTT GATCCAGGGA ATTGATTTTT ATCGCTACGT TGACGATTGG
ATCGCGCAAC GACCGCAGAT CACCCGCTTA CAGGGGACTA TTGATCGGTT TATCGAGTTG
CCCGACGGTG TAGCTGTCGT TGTTGGTGGT AAGACGTACA CCGCGCGGTT TGGTTTTAAT
AGCGTGTATC GCCCCACTCC TACGCCACCC GGTTATCACT CGTGGCTGCA ACATTTTAAA
GGATGGGTCA TCACTACGCC TCGTCCGGTG TTTGATGCCG AAGCCGCTAC TTTTATGGAC
TTCCGCATTC CACAGGTTGG TGATGTCCGG TTTGGCTATG TCTTACCGTT CGATACCTAT
ACTGCGCTCG TCGAATATAC TATCTTCTCA CCCCAACTCG TCAGCCAGGC CGAATACGAA
GCCGGTTTGC AGCGGTACAT TGCCGATCAA CTCGGTATTG ACCGCTATGA GATTACGCAC
GTCGAGTATG GCGTCATCCC GATGAGTGAT GTGCCGTTGC CGCTGCGCCC TAGCCCTCAT
GTGCTCAATA TCGGTACCGC AGGGGGGATG AGTAAGCCAT CGACCGGCTA CACCTTCCAG
CGTATTCAGT GGCAGGTGCG CCAGATTGTC GAGAGTCTGC TACGGAATGG GCATCCGTTC
TACAAGCCGC CACGCTTCAA CCGGCATGCA CTGATGGATA GCGTGCTGCT TAACGTGCTC
GATGCCGGAC GCACGCCGGG GCATCGTTTC TTCGCCAATC TGTTCCGTCA TAACCCGATC
CAGCGCGTAC TTCGCTTCCT CGATGAGGAG ACGACACTTG CTGAGGATTT GGCCTTGATG
TCAACGGTGG AGATTACGCC GTTTATGGTT GCGGCGTTGG CGGTGGCGTG GGGGCGTATG
AGTGCGTTGG GGCGGCGTGA GTTGGAGATG GGGTGA
 
Protein sequence
MDYDYIIAGA GAAGLSLAYH LGQSPLAGHS ILLVDPNAVR RNDRTWCFWE VGTGPFEAVV 
SRRWDYLWLY DESWSARLAI APYQYKLIQG IDFYRYVDDW IAQRPQITRL QGTIDRFIEL
PDGVAVVVGG KTYTARFGFN SVYRPTPTPP GYHSWLQHFK GWVITTPRPV FDAEAATFMD
FRIPQVGDVR FGYVLPFDTY TALVEYTIFS PQLVSQAEYE AGLQRYIADQ LGIDRYEITH
VEYGVIPMSD VPLPLRPSPH VLNIGTAGGM SKPSTGYTFQ RIQWQVRQIV ESLLRNGHPF
YKPPRFNRHA LMDSVLLNVL DAGRTPGHRF FANLFRHNPI QRVLRFLDEE TTLAEDLALM
STVEITPFMV AALAVAWGRM SALGRRELEM G