Gene M446_6347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_6347 
Symbol 
ID6134918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp6973249 
End bp6975240 
Gene Length1992 bp 
Protein Length663 aa 
Translation table11 
GC content71% 
IMG OID641646441 
Productsqualene-hopene cyclase 
Protein accessionYP_001773045 
Protein GI170744390 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.323444 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.123319 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCAAGG TCGAGACGCT CCACCGCACG AGCACGCAGG ACATCACGCT CGACGACGTC 
GAGCGGCGCG TCACGCTCGC GTCGAAGGCT CTCATGCGGC TCGCGAACGC GGACGGGCAC
TGGTGCTTCG AGCTGGAGGC CGACGCCACC ATTCCGTCCG AGTACATCCT CTACCATCAT
TTCCGCGGCT CGATCCCGAC GGCCGAATTG GAGGGGAAGA TCGCCGCCTA CCTGCGCCGC
ACCCAGAGCG CGCAGCACGA CGGCTGGGCC CTGATCCATG ACGGCCCCTT CGACATGAGC
GCGACCGTCA AGGCCTACTT CGCCCTCAAG ATGGTCGGCG ACCCGATCGA CGCGCCCCAC
ATGCGCCGGG CCCGCGACGC GATCCTGCGC CGGGGCGGCG CCGCCCACGC CAACGTCTTC
ACCCGGATCA TGCTCGCCCT CTACGGCGAG GTGCCGTGGA CCGCCGTGCC GGTGATGCCG
GTCGAGGTGA TGCTGCTGCC GCGGTGGTTC CCCTTCCACC TCGACAAGGT CTCCTACTGG
GCCCGCACCG TGATGGTGCC GCTCTTCGTC CTGCAGGCCA AGAAGCCGCG GGCCCGCAAC
CCGCGCGGCA TCGGCATCCG CGAACTCTTC GTCGAGGCGC CCGAGCGCGT GAAGCGCTGG
CCGGCCGGCC CGCAGGAATC CTCGCCCTGG CGCCCGGTCT TCGCGGCCAT CGACAAGGTG
CTGCAGAAGG TCGAGGGCTT CTTCCCGGCC GGCTCGCGGG CGCGGGCGAT CGACAAGGCG
GTGGCCTTCG TCAGCGAGCG CCTGAACGGC GAGGACGGGC TCGGCGCGAT CTTCCCAGCC
ATGGTCAACA CCGTGCTGAT GTTCGAGGCG CTCGGCTACC CGGACGACCA CCCCTTCGCG
GTCACGGCCC GCTCTTCGGT CGAGAAGCTC GTCACCGTCA AGGAGCACGA GGCCTACGTC
CAGCCCTGCC TGTCCCCGGT CTGGGACACG GCGCTCGCCG CCCACGCCCT GATGGAAGCC
GGCGGGACCG AGGCGGAGCG CCACGCCAAG CGCGCCATGG ACTGGCTGAA GCCCCTGCAG
GTGCTCGACA TCAAGGGCGA CTGGGCGGCC TCCAAGCCGG ACGTGCGGCC GGGCGGCTGG
GCCTTCCAGT ACGCCAACCC GCACTACCCG GACCTCGACG ACACCGCGGT CGTGGTGATG
GCCATGGACC GGGTGCAGAG CCGCCGCAGC CCCGGGCCCG ACGCGGCCGA TTACGGGCTC
TCGATCGCCC GCGCCCGCGA ATGGGTCGAG GGCCTGCAGA GCCGCGACGG CGGCTGGGCG
GCCTTCGACG CGGACAACAC CTACCACTAC CTCAACTACA TTCCGTTCTC GGATCACGGG
GCGCTGCTCG ACCCGCCGAC CGCCGACGTG ACGGCGCGCT GCGTCTCGAT GCTGTCCCAG
CTCGGCGAGA CCCGGGAGAC CTGCCCGCCC CTCGACCGCG GCGTCGCCTA CCTGCTCGCC
GATCAGGAGG CGGATGGCAG CTGGTACGGC CGCTGGGGCA TGAACTACAT CTACGGGACG
TGGTCGGTGC TCTGCGCCCT CAACGCGGCC GGGATCGACC CCGCCTGCGA GCCGGTGCGG
CGGGCGGTGA CCTGGCTCAC CGCGATCCAG AACCCCGACG GCGGCTGGGG CGAGGACGCG
TCGAGCTACA AGCTCGAATA TCGCGGCTAC GAGCGGGCGC CGAGCACGGC CTCGCAGACC
GCCTGGGCGC TGCTCGCGCT GATGGCGGCC GGCGAGGCGG ACAACCCGGC CGTGGCGCGC
GGCATCAACT ACCTGACCCG CACCCAGGGG GCGGACGGGC TCTGGGCCGA GGACCGCTAC
ACGGCGACCG GGTTCCCGCG CGTCTTCTAC CTGCGCTACC ACGGCTACGC GAAGTTCTTC
CCCCTCTGGG CGCTGGCCCG CTACCGCAAC CTCCAGCGGG GCAACAGCCT CAAGGTGGCG
GTGGGGATGT GA
 
Protein sequence
MGKVETLHRT STQDITLDDV ERRVTLASKA LMRLANADGH WCFELEADAT IPSEYILYHH 
FRGSIPTAEL EGKIAAYLRR TQSAQHDGWA LIHDGPFDMS ATVKAYFALK MVGDPIDAPH
MRRARDAILR RGGAAHANVF TRIMLALYGE VPWTAVPVMP VEVMLLPRWF PFHLDKVSYW
ARTVMVPLFV LQAKKPRARN PRGIGIRELF VEAPERVKRW PAGPQESSPW RPVFAAIDKV
LQKVEGFFPA GSRARAIDKA VAFVSERLNG EDGLGAIFPA MVNTVLMFEA LGYPDDHPFA
VTARSSVEKL VTVKEHEAYV QPCLSPVWDT ALAAHALMEA GGTEAERHAK RAMDWLKPLQ
VLDIKGDWAA SKPDVRPGGW AFQYANPHYP DLDDTAVVVM AMDRVQSRRS PGPDAADYGL
SIARAREWVE GLQSRDGGWA AFDADNTYHY LNYIPFSDHG ALLDPPTADV TARCVSMLSQ
LGETRETCPP LDRGVAYLLA DQEADGSWYG RWGMNYIYGT WSVLCALNAA GIDPACEPVR
RAVTWLTAIQ NPDGGWGEDA SSYKLEYRGY ERAPSTASQT AWALLALMAA GEADNPAVAR
GINYLTRTQG ADGLWAEDRY TATGFPRVFY LRYHGYAKFF PLWALARYRN LQRGNSLKVA
VGM