Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_3254 |
Symbol | |
ID | 8392590 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | - |
Start bp | 3330338 |
End bp | 3332293 |
Gene Length | 1956 bp |
Protein Length | 651 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 644981201 |
Product | squalene-hopene cyclase |
Protein accession | YP_003138927 |
Protein GI | 257061039 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGACAC AAAACAGAGT AACGTCAACT CAAAAAGTTG AGCTATCAAA CCTCACTCAA GCTATCATCG CCAGTCAAAA TTATATACTG TCCCGACAAT ATCCTGAAGG ATACTGGTGG GGTGAACTAG AATCGAATAT TACCCTAACT GCGGAAACTG TTTTACTCCA CAAAATTTGG AAAACCGACA AAACCCGTCC TTTCCATAAA GTAGAAACCT ATCTGCGTCG TCAACAAAAT GAACAGGGGG GATGGGAACT TTTTTATGGA GATGGGGGAG AATTAAGTAC CTCTGTTGAA GCATATATGG CACTCCGTTT ATTGGGAGTT ACCCCAGAAG ATCCTGCCCT AATTCGCGCT AAAGACTTTA TCCTTAGTAA AGGGGGTATT AGCAAAACCC GCATTTTTAC TAAGTTTCAT CTAGCATTAA TTGGCTGTTA TGATTGGAAA GGCATCCCTT CTATTCCCCC TTGGATTATG CTTTTTCCTG ATAACTTCCC TTTCACGATT TATGAGATGT CGAGTTGGGC TAGGGAAAGT ACCGTTCCTC TGCTAATTGT CTTTGATAAA AAGCCTATTT TTGAGATTGA ACCAGCCTTT AATCTTGATG AATTGTATGC AGAAGGTGTT GAAAATGTCA AGTATGCTTT ACCGCGTAAT CATAATTGGT CAGACATCTT TTTAGGACTG GATAAATTGT TTAAATGGAC GGAAAAAAAT AACTTAGTTC CTTTCCATAA AAAGAGTCTC CAAGCTGCTG AAAAATGGAT GTTAAACCAT CAACAAGAAA GCGGAGATTG GGGAGGAATT ATGCCGCCGA TGGTTAACTC CTTAATTGCC TTCAAAGTGC TGAATTACGA TGTTGCTGAT CCCTCGGTTC AACGGGGTTT TGAAGCCATT GATCGCTTTT CCATTGAAGA AGAAGATACC TATCGGGTTC AAGCTTGTGT GTCTCCTGTT TGGGACACCG CATGGGTGAT TAGAGCTCTA GTTGATTCAG GGTTAAAACC CGATCATCCT TCGTTAGTTA AAGCGGGTGA ATGGTTACTG GATAAACAAA TCCTTGAATA CGGAGATTGG GCAATTAAAA ATAAGCAGGG AAAACCCGGC GGTTGGGCAT TTGAATTTAT TAACCGTTTC TATCCCGATC TCGATGATTC TGCCGTTGTT GTCATGGCGT TAAATGGCAT CAAATTACCC GATGAAAATC GCAAAAAAGC AGCTATAAAT CGCTGCCTAG AATGGATGGC AACCATGCAA TGTAAACCGG GGGGGTGGGC AGCTTTTGAT GTCGATAATG ATCAAGCTTG GATTAATGAA ATTCCCTATG GCGATCTTAA AGCGATGATC GATCCCAATA CCGCAGATGT CACCGCTAGG GTATTAGAAA TGGTGGGATC GTGTGGCTTG AAAATGGATG AAAACCGCGT TCAAAAAGCC CTATTTTATT TAGAAAAAGA ACAGGAATCT GATGGGAGTT GGTTTGGACG ATGGGGAGTT AACTATATCT ACGGAACCAG TGGTGTTTTA TCAGCATTAG CGGTTATTGC ACCCAATACC CATAAACCTC AGATGGAAAA AGCCGTTAAT TGGTTAATTA GCTGCCAAAA TGAGGACGGA GGATGGGGAG AAACCTGTTG GAGTTATAAC GATTCTTCCC TCAAAGGAAC AGGGATTAGT ACCGCTTCTC AAACCGCTTG GGCAATTATT GGATTATTGG ATGCGGGAGA AGCCTTAGAA ACCTTAGCCA CAGATGCCAT AAAACGGGGA ATTGACTATT TATTAGCCAC CCAAACCCCT GACGGAACCT GGGAAGAAGC GGAGTTCACT GGAACAGGGT TTCCCTGTCA TTTCTATATC CGTTATCACC TCTATCGTCA TTATTTTCCT TTAATTGCTT TGGGACGTTA TTGGAAAATT GGGTTAAAAA CCCCATCGGT CATTCCCCTC AACTAA
|
Protein sequence | MQTQNRVTST QKVELSNLTQ AIIASQNYIL SRQYPEGYWW GELESNITLT AETVLLHKIW KTDKTRPFHK VETYLRRQQN EQGGWELFYG DGGELSTSVE AYMALRLLGV TPEDPALIRA KDFILSKGGI SKTRIFTKFH LALIGCYDWK GIPSIPPWIM LFPDNFPFTI YEMSSWARES TVPLLIVFDK KPIFEIEPAF NLDELYAEGV ENVKYALPRN HNWSDIFLGL DKLFKWTEKN NLVPFHKKSL QAAEKWMLNH QQESGDWGGI MPPMVNSLIA FKVLNYDVAD PSVQRGFEAI DRFSIEEEDT YRVQACVSPV WDTAWVIRAL VDSGLKPDHP SLVKAGEWLL DKQILEYGDW AIKNKQGKPG GWAFEFINRF YPDLDDSAVV VMALNGIKLP DENRKKAAIN RCLEWMATMQ CKPGGWAAFD VDNDQAWINE IPYGDLKAMI DPNTADVTAR VLEMVGSCGL KMDENRVQKA LFYLEKEQES DGSWFGRWGV NYIYGTSGVL SALAVIAPNT HKPQMEKAVN WLISCQNEDG GWGETCWSYN DSSLKGTGIS TASQTAWAII GLLDAGEALE TLATDAIKRG IDYLLATQTP DGTWEEAEFT GTGFPCHFYI RYHLYRHYFP LIALGRYWKI GLKTPSVIPL N
|
| |