Gene Msil_3243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3243 
Symbol 
ID7090658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3557595 
End bp3559613 
Gene Length2019 bp 
Protein Length672 aa 
Translation table11 
GC content65% 
IMG OID643466551 
Productsqualene-hopene cyclase 
Protein accessionYP_002363512 
Protein GI217979365 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.291091 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACGATA GAGTTGGCGC CGCGACATTC GAAGCTCAGC CGCGCGCCGG GTTCGGATCG 
GTCGAGGCCG CGATTTCCCG AGCTCGCGAG GCTTTGCTGG CGGTGCAGAA GCCGGACGGC
CATTTTGTCT TCGAGCTCGA AGCCGATGTG TCGATTCCGG CCGAATATAT TCTATTCCGG
CATTTTCTCG GCGATCCGGC CAAGACGGAA ATCGAGCGAA AGATCGGCGT CTATCTGCGC
CGGCGTCAGA CGGCGGCGGG CGGATGGCCG CTGTTCGCCG AGGGCGTTTT TAATGTCTCC
AGCTCGGTCA AAGCCTATTT CGCGCTGAAA ATCATCGGCG ACGATCCCAA CGCCCCGCAT
ATGGCGAAGG CCCGCAACGC CATTCTCGCT CATGGCGGCG CCGCCCAATC CAATGTCTTC
ACCCGATCGC TGCTCGCGCT CTATGGCGAA GTCCCCTGGC GCGCCGTGCC GGCGATGCCG
GTCGAGATCA TGCATCTGCC GCGCTGGTTC CCGTTCCATC TGAGCAAGGT TTCCTATTGG
GGCCGCACCG TCATCGCGCC GCTGATCGTG GTCCATGCCC TGAAGCCGCG CGCGAAGAAT
CCGCGCAAGA TTTCGGTTTC GGAGCTTTTC GTCGCGCCGG CCGAAACCGT TTCGCGGTGG
CCTGGGGCGC CGCACAAGTC CTTTCCCTGG ACGACCATCT TCGGCGCGAT CGACCGCGTG
CTGCATAAGA CCGAGCCGTT GCTGCCGGCC CGCTCGCATC AAACCGCCAT CGATAAGGCG
GTCGCCTTTG TCACCGCGCG CCTCAACGGC GAGGACGGGC TTGGCGCGAT TTATCCGGCG
ATGGCCTATT CCGCGATGAT GTTTTTCGCG CTCGGCGCGC CGCTATCCGA TCCGCGGATC
GTGCAGATCC GAAAGGCGAT CGACCGTCTG CTGGTCATCA AGGACGGTGA AGCCTATTGC
CAGCCCTGCG TTTCCCCCGT CTGGGACACC GCGCTCGCAA GCCACGCTCT GATGGAGAGC
GCCGGACAGC GGCCCGAGGC GAGAACCGCC CCGGCTGCGG CGGCTGTCTT CGAAGCGCTG
GATTGGCTGA AGCCGCTGCA GGTGCTTGAC GTCAAGGGCG ACTGGGCGAC GCAAAACCCC
GACGTACGGC CGGGCGGCTG GGCGTTTCAA TACGCCAATC CGCATTATCC CGATCTCGAC
GACACCGCCG TCGTCGTGCT GGCGATGGAT CGCGCCGTCA AGACCTCGCC CCTGATCGCG
GGGGAGGAAG AGACCGCCTA TGTCGAGGCG ATCTCGCGCG CGCGCGAATG GATTCTTGGG
CTGCAGAGCG CCAATGGCGG CTTCGGCGCC TTCGACGCCG ACAATGACCG CGATTATCTG
AACTATATCC CCTTCGCCGA TCACGGCGCG CTGCTCGATC CGCCGACCGC CGACGTCACG
GCCCGCTGCG TCTCGATGCT TGGCCAGCTT GGCGAGAGGC CGGAGACGAG CCCGGCGCTC
GCCCGCGCCA TTGACTACCT TTTGTCCGAG CAGGAGGAGG AGGGCAGCTG GTTCGGCCGC
TGGGGCATGA ATTATATTTA TGGCACATGG TCAGTGCTGA GCGCCTTCAA CGCCGTTGAA
CGTCCGGCCG ACTGCGCCGC GACGCGGAAG GCGGCGGCGT GGCTGAAGCG CATCCAGAAC
CCCGACGGCG GCTGGGGCGA AGACGGCGAG AGCTATGCGC TCGGCTATAA GGGCTATAAT
CCGGCGCCCA GCACCGCCTC GCAGACGGCA TGGGCGCTGC TGGCGCTGAT GGCGGCCGGC
GAGGTCGACG CGCCGGAAGT CGCCCTCGGC CTCGACTATC TCGTCAGCAC GCAGGCGGAC
GATGGGTTCT GGGATGAGGC GCGCTTTACC GCCACCGGTT TTCCCCGCGT GTTCTATTTG
CGCTATCACG GCTACGCCAA ATTCTTTCCC CTCTGGGCGA TGGCGCGCTA CCGCAATCTG
AAAAGCGGCA ATCGGCTCAA GACGCAGTTT GGGATGTGA
 
Protein sequence
MDDRVGAATF EAQPRAGFGS VEAAISRARE ALLAVQKPDG HFVFELEADV SIPAEYILFR 
HFLGDPAKTE IERKIGVYLR RRQTAAGGWP LFAEGVFNVS SSVKAYFALK IIGDDPNAPH
MAKARNAILA HGGAAQSNVF TRSLLALYGE VPWRAVPAMP VEIMHLPRWF PFHLSKVSYW
GRTVIAPLIV VHALKPRAKN PRKISVSELF VAPAETVSRW PGAPHKSFPW TTIFGAIDRV
LHKTEPLLPA RSHQTAIDKA VAFVTARLNG EDGLGAIYPA MAYSAMMFFA LGAPLSDPRI
VQIRKAIDRL LVIKDGEAYC QPCVSPVWDT ALASHALMES AGQRPEARTA PAAAAVFEAL
DWLKPLQVLD VKGDWATQNP DVRPGGWAFQ YANPHYPDLD DTAVVVLAMD RAVKTSPLIA
GEEETAYVEA ISRAREWILG LQSANGGFGA FDADNDRDYL NYIPFADHGA LLDPPTADVT
ARCVSMLGQL GERPETSPAL ARAIDYLLSE QEEEGSWFGR WGMNYIYGTW SVLSAFNAVE
RPADCAATRK AAAWLKRIQN PDGGWGEDGE SYALGYKGYN PAPSTASQTA WALLALMAAG
EVDAPEVALG LDYLVSTQAD DGFWDEARFT ATGFPRVFYL RYHGYAKFFP LWAMARYRNL
KSGNRLKTQF GM