Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_2842 |
Symbol | |
ID | 7104369 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 2929733 |
End bp | 2931676 |
Gene Length | 1944 bp |
Protein Length | 647 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643475878 |
Product | squalene/oxidosqualene cyclase |
Protein accession | YP_002372997 |
Protein GI | 218247626 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGACAC AAAACAGAGT AACGTCAACT CAAAAAGTTG AGCTATCAAA CCTCACTAAA GCTATCATCG CCAGTCAAAA TTATATAATG TCCCGACAAT ATCCTGAAGG ATACTGGTGG GGTGAACTAG AATCGAATAT AACTCTAACT GCGGAAACTA TCTTACTCCA CAAAATTTGG AAAACTGACA AAACCCGTCC TTTCCATAAA GTAGAAACCT ATCTGCGTCG TCAACAAAAT GAACAGGGAG GATGGGAACT TTTTTATGGA GATGGGGGAG AATTGAGTAC CTCTGTTGAA GCATATATGG CACTCCGTTT ATTAGGAGTT ACCCCAGAAG ATCCTGCCCT AATTCGCGCT AAAGACTTTA TCCTTAGTCA AGGTGGTATT AGCAAAACCC GCATTTTTAC TAAGTTTCAT CTAGCGTTAA TTGGCTGTTA TGATTGGAAA GGAATCCCGT CTATTCCGCC TTGGATTATG CTTTTTCCTG ATAACTTCCC TTTCACAATT TATGAGATGT CGAGTTGGGC TAGGGAAAGT ACTGTTCCTC TGCTAATTGT CTTTGATAAA AAGCCTATTT TTGAGATTGA ACCAGCCTTT AATCTTGATG AATTGTATGC AGAAGGTGTT GAAAATGTCA AGTATGCTTT ACCTCGTAAT CATAATTGGT CAGACATCTT TTTAGGACTG GATAAATTGT TTAAATGGAC AGAAAAAAAT AACTTAGTTC CTTTCCATAA AAAGAGTCTC CAAGCTGCTG AAAGATGGAT GTTAAACCAT CAACAAGAAA GCGGAGATTG GGGAGGAATT ATGCCACCGA TGGTTAACTC CTTAATTGCC TTCAAAGTGC TGAATTACGA TGTTGCTGAT CCCTCGGTTC AACGGGGTTT TGAAGCCATT GATCGCTTTT CCATTGAAGA AGAAGATACC TATCGGGTTC AAGCTTGTGT CTCACCTGTT TGGGACACCG CATGGGTGAT TAGAGCCCTA GTTGACTCAG GGTTAAAACC CGATCATCCT TCGTTAGTTA AAGCTGGTGA ATGGTTACTG GATAAACAAA TCCTTGAATA CGGAGATTGG GCAATTAAAA ATAAACAAGG AAAACCAGGG GGTTGGGCGT TTGAATTTAT TAACCGTTTC TATCCCGATC TCGATGATTC TGCCGTTGTT GTCATGGCCT TAAATGGCAT CAAATTACCC GATGAAAATT GCAAAAAAGC AGCTATAAAT CGCTGCCTAG AATGGATGGC AACCATGCAA TGTAAACCAG GGGGCTGGGC AGCTTTTGAT GTCGATAATG ATCAAGCTTG GATTAATGAA ATTCCCTATG GCGATCTTAA AGCGATGATC GATCCCAATA CCGCAGATGT CACCGCTAGG GTATTAGAAA TGGTGGGATC GTGTGGCTTG AAAATGGATG AAAACCGCGT TCAAAAAGCC CTATTTTATT TAGAAAAAGA ACAGGAATCT GATGGGAGTT GGTTTGGACG ATGGGGAGTT AACTATATCT ACGGAACCAG TGGTGTTTTA TCAGCATTAG CGGTTATTGC ACCCAATACC CATAAACCTC AGATGGAAAA AGCCGTTAAT TGGTTAATTA GCTGCCAAAA TGAGGACGGA GGATGGGGAG AAACCTGTTG GAGTTATAAC GATCCTTCCC TCAAAGGAAC AGGGGTTAGT ACCGCTTCTC AAACCGCTTG GGCACTTATT GGATTATTGG ATGCGGGAGA AGCCTTAGAA ACCTTAGCCA CAGATGCCAT AAAACGGGGA ATTAACTATT TATTAGACAC CCAAACCCCT GACGGAACCT GGGAAGAAGC GGAGTTTACT GGAACAGGGT TTCCCTGTCA TTTCTATATC CGTTATCACC TCTATCGTCA TTATTTTCCT TTAATTGCTT TAGGACGTTA TTGGAAAATT GGGTTGAAAA ATCTGAAGGG GTGA
|
Protein sequence | MQTQNRVTST QKVELSNLTK AIIASQNYIM SRQYPEGYWW GELESNITLT AETILLHKIW KTDKTRPFHK VETYLRRQQN EQGGWELFYG DGGELSTSVE AYMALRLLGV TPEDPALIRA KDFILSQGGI SKTRIFTKFH LALIGCYDWK GIPSIPPWIM LFPDNFPFTI YEMSSWARES TVPLLIVFDK KPIFEIEPAF NLDELYAEGV ENVKYALPRN HNWSDIFLGL DKLFKWTEKN NLVPFHKKSL QAAERWMLNH QQESGDWGGI MPPMVNSLIA FKVLNYDVAD PSVQRGFEAI DRFSIEEEDT YRVQACVSPV WDTAWVIRAL VDSGLKPDHP SLVKAGEWLL DKQILEYGDW AIKNKQGKPG GWAFEFINRF YPDLDDSAVV VMALNGIKLP DENCKKAAIN RCLEWMATMQ CKPGGWAAFD VDNDQAWINE IPYGDLKAMI DPNTADVTAR VLEMVGSCGL KMDENRVQKA LFYLEKEQES DGSWFGRWGV NYIYGTSGVL SALAVIAPNT HKPQMEKAVN WLISCQNEDG GWGETCWSYN DPSLKGTGVS TASQTAWALI GLLDAGEALE TLATDAIKRG INYLLDTQTP DGTWEEAEFT GTGFPCHFYI RYHLYRHYFP LIALGRYWKI GLKNLKG
|
| |