Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Svir_23080 |
Symbol | |
ID | 8387632 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Saccharomonospora viridis DSM 43017 |
Kingdom | Bacteria |
Replicon accession | NC_013159 |
Strand | - |
Start bp | 2483073 |
End bp | 2484995 |
Gene Length | 1923 bp |
Protein Length | 640 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644976361 |
Product | squalene-hopene cyclase |
Protein accession | YP_003134143 |
Protein GI | 257056311 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.813325 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.224628 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGATG TGTTGACCCG TGAGCTGTCC CCGAACTCGA CGCGGGACAG GGTGCGAAGC TGCGTTTCCT CCGCCCGGCA GTACTTGCTG TCGCTGCAGC ACGAGGAGGG CTGGTGGAAA GGTGAGCTGG ACACGAACGT GACGATGGAG GCCGAGGACC TGCTGCTGCG GCAGTTCCTC GGTATCTCCG ACGAACAGGT CACACAGGAG ACCGCGCGCT GGATCAGGTC GTGTCAACGG GAGGACGGAA CGTGGGCGAC CTTCCACGGC GGTCCCCCCG ACCTGTCCAC GACCGTGGAG GCCTACGTCG CGTTGCGCCT GGCAGGTGAT GCGATGGACG CCGCGCACCT GCGCAAGGCG CGTGAATACA TCTTGGACAG TGGGGGGATC GAGTCCACAC GGGTGTTCAC ACGGATCTGG CTGGCACTGT TCGGGGAATG GCCGTGGAGC AGATTGCCGG TGCTGCCACC GGAGATGATG CTGCTGCCCG ACTGGTTCCC GCTGAACATC TACGACTGGG CGAGCTGGGC CAGACAAACG GTGGTGCCGT TGACGATCGT CGGTTCCCTC CGGCCCACCA GGGATCTCGG ATTCAGCGTC CGGGAGTTGC GTACCGGGAT CCAGCGCCGG GATCTGGAGT CGCCGCTTTC CTGGGCGGGC GTTTTCCACG GCCTCGACTC GGTGTTGCAC CGTTTGGAGA AGCTGCCTTT GAAGCCCTTG CGCAAGGTGG CACTGGCACG AGCGGAACAA TGGATCCTCG ACCGACAGGA ATCGGACGGG GGTTGGGGCG GCATCCAGCC ACCGTGGGTG TATTCGATCC TCGCCCTGCA TCTGCGTGGT TACCCGTTGG ACCACCCGGT GCTGCGTAAG GCGCTCGACG GGCTGGACGG CTTCACCATC CGGCACCGCA CCGAGAACGG GTGGATCCGT AAATTGGAGG CCTGCCAGTC GCCGGTGTGG GACACGGCTC TTGCGATGAC GGCGTTGTTG GACTCCGGAA CACCACCGAA CGACCCGGCG CTGGTCAGGG CGGCCGACTG GATCCTGCGT CAGGAGATCC GGGTCAGTGG TGACTGGCGG GTGCGCAGAC CCGCCCTGGA ACCGTCGGGG TGGGCGTTCG AGTTCGCCAA CGACCATTAC CCCGACACCG ACGACACGGC CGAGGTGGTG CTCGGACTCC AGCGGGTGAG ACACCCCGAG CCGCACCGGG TGAACGCGGC CGTGGAACGG GCCACCGCGT GGCTCGTGGG CATGCAGTCC TCGGACGGCG GATGGGGTGC GTTCGACGCC GACAACACGC GAACACTGTG CGAGAAGCTG CCGTTCTGCG ACTTCGGCGC GGTGATCGAC CCGCCGTCGG CCGACGTGAC GGCGCATATC GTGGAGATGC TGGCGGCCCG GGGAATGGCC GACAGTGAAT CCGCCAGACG CGGCGTGCGC TGGTTGTTGG AGCACCAGGA GGTGGACGGT TCCTGGTTCG GTCGTTGGGG AGCCAACCAC GTCTACGGCA CGGGAGCGGT GGTTCCGGCT CTCGTGGCGT GTGGGATCTC CCCCCAGCAC GAGGCCGTGC GCGCGGCCGT GCAGTGGCTT GTGGCCCACC AGAACGCCGA CGGCGGTTGG GGTGAGGACC TGCGTTCCTA TGTCGACCGG ACCTGGGTGG GCCGGGGCAC CTCCACACCG TCACAGACGG CGTGGGCGTT GTTGGCGCTG CTCGCGGCGG GGGAACGGGG CGAGGTCGTG CGCCGGGGTG TGGAATGGCT GATGGCCGCA CAGCGGCCCG ACGGTGGTTG GGACGAACCG CAGTACACCG GCACCGGTTT CCCCGGCGAC TTCTACATCA GCTACCACAT GTACCGGATC GTCTTCCCGC TCACCGCGTT GGGCCGGTAC CTCGGAAGAG GTGGCGATGT CGGGACGGGC TGA
|
Protein sequence | MTDVLTRELS PNSTRDRVRS CVSSARQYLL SLQHEEGWWK GELDTNVTME AEDLLLRQFL GISDEQVTQE TARWIRSCQR EDGTWATFHG GPPDLSTTVE AYVALRLAGD AMDAAHLRKA REYILDSGGI ESTRVFTRIW LALFGEWPWS RLPVLPPEMM LLPDWFPLNI YDWASWARQT VVPLTIVGSL RPTRDLGFSV RELRTGIQRR DLESPLSWAG VFHGLDSVLH RLEKLPLKPL RKVALARAEQ WILDRQESDG GWGGIQPPWV YSILALHLRG YPLDHPVLRK ALDGLDGFTI RHRTENGWIR KLEACQSPVW DTALAMTALL DSGTPPNDPA LVRAADWILR QEIRVSGDWR VRRPALEPSG WAFEFANDHY PDTDDTAEVV LGLQRVRHPE PHRVNAAVER ATAWLVGMQS SDGGWGAFDA DNTRTLCEKL PFCDFGAVID PPSADVTAHI VEMLAARGMA DSESARRGVR WLLEHQEVDG SWFGRWGANH VYGTGAVVPA LVACGISPQH EAVRAAVQWL VAHQNADGGW GEDLRSYVDR TWVGRGTSTP SQTAWALLAL LAAGERGEVV RRGVEWLMAA QRPDGGWDEP QYTGTGFPGD FYISYHMYRI VFPLTALGRY LGRGGDVGTG
|
| |