Gene Cyan8802_3254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_3254 
Symbol 
ID8392590 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp3330338 
End bp3332293 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content42% 
IMG OID644981201 
Productsqualene-hopene cyclase 
Protein accessionYP_003138927 
Protein GI257061039 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGACAC AAAACAGAGT AACGTCAACT CAAAAAGTTG AGCTATCAAA CCTCACTCAA 
GCTATCATCG CCAGTCAAAA TTATATACTG TCCCGACAAT ATCCTGAAGG ATACTGGTGG
GGTGAACTAG AATCGAATAT TACCCTAACT GCGGAAACTG TTTTACTCCA CAAAATTTGG
AAAACCGACA AAACCCGTCC TTTCCATAAA GTAGAAACCT ATCTGCGTCG TCAACAAAAT
GAACAGGGGG GATGGGAACT TTTTTATGGA GATGGGGGAG AATTAAGTAC CTCTGTTGAA
GCATATATGG CACTCCGTTT ATTGGGAGTT ACCCCAGAAG ATCCTGCCCT AATTCGCGCT
AAAGACTTTA TCCTTAGTAA AGGGGGTATT AGCAAAACCC GCATTTTTAC TAAGTTTCAT
CTAGCATTAA TTGGCTGTTA TGATTGGAAA GGCATCCCTT CTATTCCCCC TTGGATTATG
CTTTTTCCTG ATAACTTCCC TTTCACGATT TATGAGATGT CGAGTTGGGC TAGGGAAAGT
ACCGTTCCTC TGCTAATTGT CTTTGATAAA AAGCCTATTT TTGAGATTGA ACCAGCCTTT
AATCTTGATG AATTGTATGC AGAAGGTGTT GAAAATGTCA AGTATGCTTT ACCGCGTAAT
CATAATTGGT CAGACATCTT TTTAGGACTG GATAAATTGT TTAAATGGAC GGAAAAAAAT
AACTTAGTTC CTTTCCATAA AAAGAGTCTC CAAGCTGCTG AAAAATGGAT GTTAAACCAT
CAACAAGAAA GCGGAGATTG GGGAGGAATT ATGCCGCCGA TGGTTAACTC CTTAATTGCC
TTCAAAGTGC TGAATTACGA TGTTGCTGAT CCCTCGGTTC AACGGGGTTT TGAAGCCATT
GATCGCTTTT CCATTGAAGA AGAAGATACC TATCGGGTTC AAGCTTGTGT GTCTCCTGTT
TGGGACACCG CATGGGTGAT TAGAGCTCTA GTTGATTCAG GGTTAAAACC CGATCATCCT
TCGTTAGTTA AAGCGGGTGA ATGGTTACTG GATAAACAAA TCCTTGAATA CGGAGATTGG
GCAATTAAAA ATAAGCAGGG AAAACCCGGC GGTTGGGCAT TTGAATTTAT TAACCGTTTC
TATCCCGATC TCGATGATTC TGCCGTTGTT GTCATGGCGT TAAATGGCAT CAAATTACCC
GATGAAAATC GCAAAAAAGC AGCTATAAAT CGCTGCCTAG AATGGATGGC AACCATGCAA
TGTAAACCGG GGGGGTGGGC AGCTTTTGAT GTCGATAATG ATCAAGCTTG GATTAATGAA
ATTCCCTATG GCGATCTTAA AGCGATGATC GATCCCAATA CCGCAGATGT CACCGCTAGG
GTATTAGAAA TGGTGGGATC GTGTGGCTTG AAAATGGATG AAAACCGCGT TCAAAAAGCC
CTATTTTATT TAGAAAAAGA ACAGGAATCT GATGGGAGTT GGTTTGGACG ATGGGGAGTT
AACTATATCT ACGGAACCAG TGGTGTTTTA TCAGCATTAG CGGTTATTGC ACCCAATACC
CATAAACCTC AGATGGAAAA AGCCGTTAAT TGGTTAATTA GCTGCCAAAA TGAGGACGGA
GGATGGGGAG AAACCTGTTG GAGTTATAAC GATTCTTCCC TCAAAGGAAC AGGGATTAGT
ACCGCTTCTC AAACCGCTTG GGCAATTATT GGATTATTGG ATGCGGGAGA AGCCTTAGAA
ACCTTAGCCA CAGATGCCAT AAAACGGGGA ATTGACTATT TATTAGCCAC CCAAACCCCT
GACGGAACCT GGGAAGAAGC GGAGTTCACT GGAACAGGGT TTCCCTGTCA TTTCTATATC
CGTTATCACC TCTATCGTCA TTATTTTCCT TTAATTGCTT TGGGACGTTA TTGGAAAATT
GGGTTAAAAA CCCCATCGGT CATTCCCCTC AACTAA
 
Protein sequence
MQTQNRVTST QKVELSNLTQ AIIASQNYIL SRQYPEGYWW GELESNITLT AETVLLHKIW 
KTDKTRPFHK VETYLRRQQN EQGGWELFYG DGGELSTSVE AYMALRLLGV TPEDPALIRA
KDFILSKGGI SKTRIFTKFH LALIGCYDWK GIPSIPPWIM LFPDNFPFTI YEMSSWARES
TVPLLIVFDK KPIFEIEPAF NLDELYAEGV ENVKYALPRN HNWSDIFLGL DKLFKWTEKN
NLVPFHKKSL QAAEKWMLNH QQESGDWGGI MPPMVNSLIA FKVLNYDVAD PSVQRGFEAI
DRFSIEEEDT YRVQACVSPV WDTAWVIRAL VDSGLKPDHP SLVKAGEWLL DKQILEYGDW
AIKNKQGKPG GWAFEFINRF YPDLDDSAVV VMALNGIKLP DENRKKAAIN RCLEWMATMQ
CKPGGWAAFD VDNDQAWINE IPYGDLKAMI DPNTADVTAR VLEMVGSCGL KMDENRVQKA
LFYLEKEQES DGSWFGRWGV NYIYGTSGVL SALAVIAPNT HKPQMEKAVN WLISCQNEDG
GWGETCWSYN DSSLKGTGIS TASQTAWAII GLLDAGEALE TLATDAIKRG IDYLLATQTP
DGTWEEAEFT GTGFPCHFYI RYHLYRHYFP LIALGRYWKI GLKTPSVIPL N