Gene Mnod_7146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_7146 
Symbol 
ID7304688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp7228948 
End bp7230939 
Gene Length1992 bp 
Protein Length663 aa 
Translation table11 
GC content70% 
IMG OID643604700 
Productsqualene-hopene cyclase 
Protein accessionYP_002502189 
Protein GI220926887 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGAAAGG TCGAGACGCT GCACCGCATG AGCACGCAGG ACATCACGCT GGACGATGTC 
GAGCGGCGCG TGTCGCTCGC GTCCAAGGCT CTGATGCGGC TCGCCGGCCC CGACGGGCAT
TGGTGCTTCG AGCTGGAGGC CGACGCCACC ATCCCGTCCG AGTACATTCT CTATCATCAT
TTCCGCGGCT CGATCCCCTC CGCGGAGCTC GAGGGCAAGA TCGCCAATTA CCTGCGCCGC
ACGCAGAGCG CGCAGCACGA CGGCTGGTCC CTCGTCCATG ACGGCCCGTT CGACATGAGC
GCGACCGTCA AGGCGTATTT CGCCCTCAAG ATGATCGGCG ATTCGATCGA GGCGCCGCAT
ATGCGCCGCG CCCGCGAGGC GATCCTGCGC CGGGGCGGCG CCGCGCACGC CAACGTCTTC
ACCCGGACCC TTCTGGCCCT CTACGGCGAG GTGCCGTGGA GCGCCGTGCC GGTAATGCCC
GTCGAGGTGA TGCTGCTGCC GCGGTGGTTC CCCTTCCACC TCGACAAGGT GTCCTACTGG
GCCCGCACCG TGATGGTGCC GCTCTTCGTG CTGCAGGCCA AGAAGCCGCG GGCCAGGAAT
CCGCGGGGCA TCGGCATCCA GGAGCTGTTC GTCGAGCCGC CGGAGCGGGT GAAACGCTGG
CCGGCCGGCC CGCAGGAATC CTCGCCGTGG CGCCCGGTCT TCGCCGCCAT CGACAAGGTG
CTGCAGAAGG TCGAGGGCTC GTTCCCGGCG GGCTCCCGTG CCCGGGCGAT CGACAAGGCG
GTGGCCTTCG TCAGCGAGCG CCTGAACGGC GAGGACGGGC TCGGCGCGAT CTTCCCCGCG
ATGGTCAACG CGGTGCTGAT GTACGAGGCG CTCGGCTACC CCGAAGATCA CCCCCTGGTC
GCGACCGCCC GCTCCTCGGT GGAGAAGCTC GTCACCGTCA AGGAGCACGA GGCCTACGTG
CAGCCCTGCC TGTCGCCGGT CTGGGACACG GCGCTCTCGG CCCATGCGCT CATGGAGGCG
GGCGGCGTCG AGGCGGAGCG GCACGCCAAG CGCGCCCTCG ACTGGCTCAA GCCCCTGCAG
GTGCTCGACA TCAAGGGCGA CTGGGCCGCC TCCAAGCCGA ATGTGCGGCC GGGCGGCTGG
GCCTTCCAGT ACGCCAACCC GCATTATCCG GACCTCGACG ACACCGCCGT GGTGGTGATG
GCGATGGACC GGGCGCAGGT GCGCCGCAGC CCCGGCCCGG ACGCGGCCGA TTACGGTCAG
TCGATCGCGC GGGCGCGCGA ATGGGTCGAG GGCCTGCAGA GCCGCGACGG CGGCTGGGCG
GCCTTCGACG CGGACAACAC CTACCATTAC CTCAACTACA TCCCGTTCTC CGATCACGGG
GCGCTGCTCG ACCCGCCGAC CGCCGACGTG ACGGCGCGCT GCGTCTCGAT GCTGGCGCAG
CTCGGTGAGA CGCGCGAGAG CTGCCCGCCC CTCGACCGGG GCGTCGCCTA CCTGCTGGCC
GACCAGGAGG CGGATGGCAG CTGGTATGGC CGCTGGGGCA TGAACTACAT CTACGGCACC
TGGTCGGTGC TCTGCGCGCT GAACGCCGCT GGGGTCGACC CGGCCTCGGA GCCGGTGCGG
CGGGCGGTGA ACTGGCTCAC CACCATCCAG AACCCGGATG GCGGCTGGGG CGAGGACGCG
GCGAGCTACA AGCTCGAATA TCGCGGCTAC GAGCGGGCGC CGAGCACCGC CTCGCAGACC
GCCTGGGCGC TCCTCGGGCT CATGGCCGCG GGCGAGGCGG ACAGCCCGGC AGTGGCGCGA
GGCATCAACT ACCTGACCCG CAGCCAGGGG GCGGACGGGC TCTGGACCGA GGACCGCTAT
ACGGCGACCG GGTTCCCGCG CGTCTTCTAC CTGCGCTATC ACGGCTACGC GAAGTTCTTC
CCGCTCTGGG CGCTTGCCCG CTACCGCAAC CTCCAGCAGA GCAACAGCCG TCGGGTCGCC
GTCGGGATGT GA
 
Protein sequence
MGKVETLHRM STQDITLDDV ERRVSLASKA LMRLAGPDGH WCFELEADAT IPSEYILYHH 
FRGSIPSAEL EGKIANYLRR TQSAQHDGWS LVHDGPFDMS ATVKAYFALK MIGDSIEAPH
MRRAREAILR RGGAAHANVF TRTLLALYGE VPWSAVPVMP VEVMLLPRWF PFHLDKVSYW
ARTVMVPLFV LQAKKPRARN PRGIGIQELF VEPPERVKRW PAGPQESSPW RPVFAAIDKV
LQKVEGSFPA GSRARAIDKA VAFVSERLNG EDGLGAIFPA MVNAVLMYEA LGYPEDHPLV
ATARSSVEKL VTVKEHEAYV QPCLSPVWDT ALSAHALMEA GGVEAERHAK RALDWLKPLQ
VLDIKGDWAA SKPNVRPGGW AFQYANPHYP DLDDTAVVVM AMDRAQVRRS PGPDAADYGQ
SIARAREWVE GLQSRDGGWA AFDADNTYHY LNYIPFSDHG ALLDPPTADV TARCVSMLAQ
LGETRESCPP LDRGVAYLLA DQEADGSWYG RWGMNYIYGT WSVLCALNAA GVDPASEPVR
RAVNWLTTIQ NPDGGWGEDA ASYKLEYRGY ERAPSTASQT AWALLGLMAA GEADSPAVAR
GINYLTRSQG ADGLWTEDRY TATGFPRVFY LRYHGYAKFF PLWALARYRN LQQSNSRRVA
VGM