Gene Mchl_5491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_5491 
Symbol 
ID7119274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011758 
Strand
Start bp116385 
End bp118322 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content70% 
IMG OID643528164 
Productsqualene-hopene cyclase 
Protein accessionYP_002424160 
Protein GI218533345 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.264934 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACGG AGCCTCGCTT CTCCGCGCCC GAGACCCTGC GCGCTATCGC CGGCGCGGGG 
CGTGCCCTGG GCCGCCACCA GCGCCGGGAC GGCCACTGGG TCTTCGAGTT GGAGGCGGAC
GCGACCATCC CGGCCGAGTA CGTGCTCCTG GAGCACTACA TGGACCGCAT CACGCCCGAG
CGGCAGGCCC GGATCGGAGC CTACCTCCGG CGCATCCAGG GCGAGCATGG CGGCTGGCCC
ATGTTCCACG CGGGCGAGTT CAACATCTCG GCCAGCGTGA AGGCCTACTG CGCGCTGAAG
GCCATCGGCG ACGATCCGCA AGCGCCGCAT ATGGTCCGGG CCCGCCAGGC CATCCTCGGC
CATGGGGGCG CGGAGCGCGC CAACGTCTTC ACGCGCATCC AACTCGCCCT GTTCGGCGCC
ATCCCCTGGC GCGGCGTGCC GGTGATGCCG GTCGAGATCA TGCACCTGCC CAAGTGGTTC
TTCTTCAACA TCTGGGCGAT GTCCTACTGG GCCCGCACCT GCGTGGTGCC GCTCCTCGTG
CTGCAGGCGC GGAAGCCCCG TGCCCGCAAC CCGCGCCAGG TGAGCTTCGA CGAGATCTTC
CGGACCGAGC CGGACGAGGT CCGAGACTGG ATCCGCGGCC CCTACCGCTC ACGCTGGGGC
GTGGTGTTCA AGCACATCGA CACGGTGCTG CGCTGGACCG AGCCCCTGTT CTCGAAGGTC
GCGCGCGAGA GCGCCATCTT CAAGGCCGTC GACTTCGTGG AGGAGCGTCT GAACGGCGAG
GACGGGCTCG GCGCGATCTA CCCGGCCATG GCCTACGCGC TGATGATGTA CGACGTGCTC
GGCTACCCCG AGGACGACCC GCGCTGCGTC ACGATCTGGA AGGCCATCGA CAAGCTTCTC
ATCGAGACGG ACGAGGAGGT TTACTGCCAG CCCTGCGTCT CGCCCGTATG GGACACGAGC
CTGTCCGGGC ATGCCATGAT CGAGGCGGCG CGCACCGGGG GCATCGAGGC CCAAGCGGAG
CTCGACGCCG CGTGCGACTG GCTGGTGGCG CGCCAGGTCA AGGACGTGCG GGGCGACTGG
GCCGAGACGC GGCCGGACGC CGAGCCCGGC GGCTGGGCCT TCCAGTACCG CAACGACCAC
TACCCCGACG TCGACGACAC GGCGGTGGTC GCCATGCTGC TCCACCGCAA CGGCCGGCCC
GAGCACGCGG AGGCAATCGA GAAGGCGCGC CGCTGGGTCG TCGGCGTGCA GAGCCGCAAT
GGAGGCTGGG GTGCCTTCGA CGCCGACAAC GACCGCGAGT TCCTCAACCA CATCCCGTTC
TCGGACCACG GCGCGCTGCT CGACCCGCCG ACCGCCGACG TGACCGGCCG CTGCATCTCC
TTCCTGTCCC AGCTCGGGCA CGAGGAGGAC CGGCCGGTGA TCGAGCGCGC CTTGGCCTAC
CTTCGGGCCG AACAGGAGCG CGACGGCAGT TGGTACGGGC GCTGGGGCAC CAACTACGTC
TACGGCACCT GGACGGTCCT GTGCGGCCTG AACGCGGCCG GCATCCCGCA CGACGACCCG
ATGGTGCGCC GGGCCGTGGA CTGGCTGGTC TCGATCCAGC GCGCGGACGG CGGCTGGGGC
GAGGACGAGC GCAGCTACGA CGTCGGCCAC TACGTCGAGA ACGCCGAGAG CCTGCCTTCG
CAGACGGCCT GGGCGATGCT CGGCCTGATG TCGGTCGGCC AGGCCGACCA CCCCGCCGTC
CTACGCGGTG CGGCCTACCT GCAGCGCACG CAAGGGCCGG ACGGCGAGTG GCAGGAGCGG
GCCTACAACG CCGTCGGCTT CCCGCGCGTG TTCTACCTCA AGTATCACGG CTACCGGCTG
TTCTTCCCAC TGTTCGCCCT CTCGCGCCTT CACAACCTAC AACGGGGCAA CAGCCGGGAG
GTCAGCTTCG GCTTTTGA
 
Protein sequence
MNTEPRFSAP ETLRAIAGAG RALGRHQRRD GHWVFELEAD ATIPAEYVLL EHYMDRITPE 
RQARIGAYLR RIQGEHGGWP MFHAGEFNIS ASVKAYCALK AIGDDPQAPH MVRARQAILG
HGGAERANVF TRIQLALFGA IPWRGVPVMP VEIMHLPKWF FFNIWAMSYW ARTCVVPLLV
LQARKPRARN PRQVSFDEIF RTEPDEVRDW IRGPYRSRWG VVFKHIDTVL RWTEPLFSKV
ARESAIFKAV DFVEERLNGE DGLGAIYPAM AYALMMYDVL GYPEDDPRCV TIWKAIDKLL
IETDEEVYCQ PCVSPVWDTS LSGHAMIEAA RTGGIEAQAE LDAACDWLVA RQVKDVRGDW
AETRPDAEPG GWAFQYRNDH YPDVDDTAVV AMLLHRNGRP EHAEAIEKAR RWVVGVQSRN
GGWGAFDADN DREFLNHIPF SDHGALLDPP TADVTGRCIS FLSQLGHEED RPVIERALAY
LRAEQERDGS WYGRWGTNYV YGTWTVLCGL NAAGIPHDDP MVRRAVDWLV SIQRADGGWG
EDERSYDVGH YVENAESLPS QTAWAMLGLM SVGQADHPAV LRGAAYLQRT QGPDGEWQER
AYNAVGFPRV FYLKYHGYRL FFPLFALSRL HNLQRGNSRE VSFGF