Gene Svir_23080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSvir_23080 
Symbol 
ID8387632 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharomonospora viridis DSM 43017 
KingdomBacteria 
Replicon accessionNC_013159 
Strand
Start bp2483073 
End bp2484995 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content67% 
IMG OID644976361 
Productsqualene-hopene cyclase 
Protein accessionYP_003134143 
Protein GI257056311 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.813325 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.224628 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATG TGTTGACCCG TGAGCTGTCC CCGAACTCGA CGCGGGACAG GGTGCGAAGC 
TGCGTTTCCT CCGCCCGGCA GTACTTGCTG TCGCTGCAGC ACGAGGAGGG CTGGTGGAAA
GGTGAGCTGG ACACGAACGT GACGATGGAG GCCGAGGACC TGCTGCTGCG GCAGTTCCTC
GGTATCTCCG ACGAACAGGT CACACAGGAG ACCGCGCGCT GGATCAGGTC GTGTCAACGG
GAGGACGGAA CGTGGGCGAC CTTCCACGGC GGTCCCCCCG ACCTGTCCAC GACCGTGGAG
GCCTACGTCG CGTTGCGCCT GGCAGGTGAT GCGATGGACG CCGCGCACCT GCGCAAGGCG
CGTGAATACA TCTTGGACAG TGGGGGGATC GAGTCCACAC GGGTGTTCAC ACGGATCTGG
CTGGCACTGT TCGGGGAATG GCCGTGGAGC AGATTGCCGG TGCTGCCACC GGAGATGATG
CTGCTGCCCG ACTGGTTCCC GCTGAACATC TACGACTGGG CGAGCTGGGC CAGACAAACG
GTGGTGCCGT TGACGATCGT CGGTTCCCTC CGGCCCACCA GGGATCTCGG ATTCAGCGTC
CGGGAGTTGC GTACCGGGAT CCAGCGCCGG GATCTGGAGT CGCCGCTTTC CTGGGCGGGC
GTTTTCCACG GCCTCGACTC GGTGTTGCAC CGTTTGGAGA AGCTGCCTTT GAAGCCCTTG
CGCAAGGTGG CACTGGCACG AGCGGAACAA TGGATCCTCG ACCGACAGGA ATCGGACGGG
GGTTGGGGCG GCATCCAGCC ACCGTGGGTG TATTCGATCC TCGCCCTGCA TCTGCGTGGT
TACCCGTTGG ACCACCCGGT GCTGCGTAAG GCGCTCGACG GGCTGGACGG CTTCACCATC
CGGCACCGCA CCGAGAACGG GTGGATCCGT AAATTGGAGG CCTGCCAGTC GCCGGTGTGG
GACACGGCTC TTGCGATGAC GGCGTTGTTG GACTCCGGAA CACCACCGAA CGACCCGGCG
CTGGTCAGGG CGGCCGACTG GATCCTGCGT CAGGAGATCC GGGTCAGTGG TGACTGGCGG
GTGCGCAGAC CCGCCCTGGA ACCGTCGGGG TGGGCGTTCG AGTTCGCCAA CGACCATTAC
CCCGACACCG ACGACACGGC CGAGGTGGTG CTCGGACTCC AGCGGGTGAG ACACCCCGAG
CCGCACCGGG TGAACGCGGC CGTGGAACGG GCCACCGCGT GGCTCGTGGG CATGCAGTCC
TCGGACGGCG GATGGGGTGC GTTCGACGCC GACAACACGC GAACACTGTG CGAGAAGCTG
CCGTTCTGCG ACTTCGGCGC GGTGATCGAC CCGCCGTCGG CCGACGTGAC GGCGCATATC
GTGGAGATGC TGGCGGCCCG GGGAATGGCC GACAGTGAAT CCGCCAGACG CGGCGTGCGC
TGGTTGTTGG AGCACCAGGA GGTGGACGGT TCCTGGTTCG GTCGTTGGGG AGCCAACCAC
GTCTACGGCA CGGGAGCGGT GGTTCCGGCT CTCGTGGCGT GTGGGATCTC CCCCCAGCAC
GAGGCCGTGC GCGCGGCCGT GCAGTGGCTT GTGGCCCACC AGAACGCCGA CGGCGGTTGG
GGTGAGGACC TGCGTTCCTA TGTCGACCGG ACCTGGGTGG GCCGGGGCAC CTCCACACCG
TCACAGACGG CGTGGGCGTT GTTGGCGCTG CTCGCGGCGG GGGAACGGGG CGAGGTCGTG
CGCCGGGGTG TGGAATGGCT GATGGCCGCA CAGCGGCCCG ACGGTGGTTG GGACGAACCG
CAGTACACCG GCACCGGTTT CCCCGGCGAC TTCTACATCA GCTACCACAT GTACCGGATC
GTCTTCCCGC TCACCGCGTT GGGCCGGTAC CTCGGAAGAG GTGGCGATGT CGGGACGGGC
TGA
 
Protein sequence
MTDVLTRELS PNSTRDRVRS CVSSARQYLL SLQHEEGWWK GELDTNVTME AEDLLLRQFL 
GISDEQVTQE TARWIRSCQR EDGTWATFHG GPPDLSTTVE AYVALRLAGD AMDAAHLRKA
REYILDSGGI ESTRVFTRIW LALFGEWPWS RLPVLPPEMM LLPDWFPLNI YDWASWARQT
VVPLTIVGSL RPTRDLGFSV RELRTGIQRR DLESPLSWAG VFHGLDSVLH RLEKLPLKPL
RKVALARAEQ WILDRQESDG GWGGIQPPWV YSILALHLRG YPLDHPVLRK ALDGLDGFTI
RHRTENGWIR KLEACQSPVW DTALAMTALL DSGTPPNDPA LVRAADWILR QEIRVSGDWR
VRRPALEPSG WAFEFANDHY PDTDDTAEVV LGLQRVRHPE PHRVNAAVER ATAWLVGMQS
SDGGWGAFDA DNTRTLCEKL PFCDFGAVID PPSADVTAHI VEMLAARGMA DSESARRGVR
WLLEHQEVDG SWFGRWGANH VYGTGAVVPA LVACGISPQH EAVRAAVQWL VAHQNADGGW
GEDLRSYVDR TWVGRGTSTP SQTAWALLAL LAAGERGEVV RRGVEWLMAA QRPDGGWDEP
QYTGTGFPGD FYISYHMYRI VFPLTALGRY LGRGGDVGTG