Gene PCC8801_2842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_2842 
Symbol 
ID7104369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp2929733 
End bp2931676 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content41% 
IMG OID643475878 
Productsqualene/oxidosqualene cyclase 
Protein accessionYP_002372997 
Protein GI218247626 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGACAC AAAACAGAGT AACGTCAACT CAAAAAGTTG AGCTATCAAA CCTCACTAAA 
GCTATCATCG CCAGTCAAAA TTATATAATG TCCCGACAAT ATCCTGAAGG ATACTGGTGG
GGTGAACTAG AATCGAATAT AACTCTAACT GCGGAAACTA TCTTACTCCA CAAAATTTGG
AAAACTGACA AAACCCGTCC TTTCCATAAA GTAGAAACCT ATCTGCGTCG TCAACAAAAT
GAACAGGGAG GATGGGAACT TTTTTATGGA GATGGGGGAG AATTGAGTAC CTCTGTTGAA
GCATATATGG CACTCCGTTT ATTAGGAGTT ACCCCAGAAG ATCCTGCCCT AATTCGCGCT
AAAGACTTTA TCCTTAGTCA AGGTGGTATT AGCAAAACCC GCATTTTTAC TAAGTTTCAT
CTAGCGTTAA TTGGCTGTTA TGATTGGAAA GGAATCCCGT CTATTCCGCC TTGGATTATG
CTTTTTCCTG ATAACTTCCC TTTCACAATT TATGAGATGT CGAGTTGGGC TAGGGAAAGT
ACTGTTCCTC TGCTAATTGT CTTTGATAAA AAGCCTATTT TTGAGATTGA ACCAGCCTTT
AATCTTGATG AATTGTATGC AGAAGGTGTT GAAAATGTCA AGTATGCTTT ACCTCGTAAT
CATAATTGGT CAGACATCTT TTTAGGACTG GATAAATTGT TTAAATGGAC AGAAAAAAAT
AACTTAGTTC CTTTCCATAA AAAGAGTCTC CAAGCTGCTG AAAGATGGAT GTTAAACCAT
CAACAAGAAA GCGGAGATTG GGGAGGAATT ATGCCACCGA TGGTTAACTC CTTAATTGCC
TTCAAAGTGC TGAATTACGA TGTTGCTGAT CCCTCGGTTC AACGGGGTTT TGAAGCCATT
GATCGCTTTT CCATTGAAGA AGAAGATACC TATCGGGTTC AAGCTTGTGT CTCACCTGTT
TGGGACACCG CATGGGTGAT TAGAGCCCTA GTTGACTCAG GGTTAAAACC CGATCATCCT
TCGTTAGTTA AAGCTGGTGA ATGGTTACTG GATAAACAAA TCCTTGAATA CGGAGATTGG
GCAATTAAAA ATAAACAAGG AAAACCAGGG GGTTGGGCGT TTGAATTTAT TAACCGTTTC
TATCCCGATC TCGATGATTC TGCCGTTGTT GTCATGGCCT TAAATGGCAT CAAATTACCC
GATGAAAATT GCAAAAAAGC AGCTATAAAT CGCTGCCTAG AATGGATGGC AACCATGCAA
TGTAAACCAG GGGGCTGGGC AGCTTTTGAT GTCGATAATG ATCAAGCTTG GATTAATGAA
ATTCCCTATG GCGATCTTAA AGCGATGATC GATCCCAATA CCGCAGATGT CACCGCTAGG
GTATTAGAAA TGGTGGGATC GTGTGGCTTG AAAATGGATG AAAACCGCGT TCAAAAAGCC
CTATTTTATT TAGAAAAAGA ACAGGAATCT GATGGGAGTT GGTTTGGACG ATGGGGAGTT
AACTATATCT ACGGAACCAG TGGTGTTTTA TCAGCATTAG CGGTTATTGC ACCCAATACC
CATAAACCTC AGATGGAAAA AGCCGTTAAT TGGTTAATTA GCTGCCAAAA TGAGGACGGA
GGATGGGGAG AAACCTGTTG GAGTTATAAC GATCCTTCCC TCAAAGGAAC AGGGGTTAGT
ACCGCTTCTC AAACCGCTTG GGCACTTATT GGATTATTGG ATGCGGGAGA AGCCTTAGAA
ACCTTAGCCA CAGATGCCAT AAAACGGGGA ATTAACTATT TATTAGACAC CCAAACCCCT
GACGGAACCT GGGAAGAAGC GGAGTTTACT GGAACAGGGT TTCCCTGTCA TTTCTATATC
CGTTATCACC TCTATCGTCA TTATTTTCCT TTAATTGCTT TAGGACGTTA TTGGAAAATT
GGGTTGAAAA ATCTGAAGGG GTGA
 
Protein sequence
MQTQNRVTST QKVELSNLTK AIIASQNYIM SRQYPEGYWW GELESNITLT AETILLHKIW 
KTDKTRPFHK VETYLRRQQN EQGGWELFYG DGGELSTSVE AYMALRLLGV TPEDPALIRA
KDFILSQGGI SKTRIFTKFH LALIGCYDWK GIPSIPPWIM LFPDNFPFTI YEMSSWARES
TVPLLIVFDK KPIFEIEPAF NLDELYAEGV ENVKYALPRN HNWSDIFLGL DKLFKWTEKN
NLVPFHKKSL QAAERWMLNH QQESGDWGGI MPPMVNSLIA FKVLNYDVAD PSVQRGFEAI
DRFSIEEEDT YRVQACVSPV WDTAWVIRAL VDSGLKPDHP SLVKAGEWLL DKQILEYGDW
AIKNKQGKPG GWAFEFINRF YPDLDDSAVV VMALNGIKLP DENCKKAAIN RCLEWMATMQ
CKPGGWAAFD VDNDQAWINE IPYGDLKAMI DPNTADVTAR VLEMVGSCGL KMDENRVQKA
LFYLEKEQES DGSWFGRWGV NYIYGTSGVL SALAVIAPNT HKPQMEKAVN WLISCQNEDG
GWGETCWSYN DPSLKGTGVS TASQTAWALI GLLDAGEALE TLATDAIKRG INYLLDTQTP
DGTWEEAEFT GTGFPCHFYI RYHLYRHYFP LIALGRYWKI GLKNLKG