Gene Mchl_2220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_2220 
Symbol 
ID7116167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp2323810 
End bp2325813 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content68% 
IMG OID643524970 
Productsqualene-hopene cyclase 
Protein accessionYP_002420995 
Protein GI218530179 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.96111 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGAGG CGGCCGTAAG CAAGGTCGAG ACGCTGCAGC GTCCCAAGAC CCGCGACGTG 
TCCCTCGACG ACGTGGAGCG TGGCGTCCAG AGCGCCACCC GCGCCCTCAC CGAGATGACG
CAGGCGGATG GCCACATCTG CTTCGAGCTC GAGGCGGATG CGACCATCCC CTCCGAGTAC
ATCCTGTTCC ACCAGTTCCG TGGGACCGAG CCGCGGCCCG GGCTCGAGGC CAAGATCGGC
AACTACCTGC GCCGCACGCA GTCGAAGGTG CATGGCGGCT GGGCGCTGGT GCATGACGGC
CCGTTCGACA TGAGCGCGTC GGTGAAGGCC TACTTCGCCC TCAAGATGAT CGGCGACGAC
ATCGAGGCGC CGCACATGCG CGCGGTGCGC AAGGCGATCC TCCAGCGCGG GGGCGCGGCC
AACGCCAACG TCTTCACCCG CATCCTGCTC GCGCTCTACG GCGAAGTGCC GTGGGCGGCG
GTGCCGGTGA TGCCGGTGGA GGTGATGCAC CTGCCGAAGT GGTTCCCGTT CCACCTCGAC
AAGGTGTCGT ACTGGGCCCG CTGCACCATG GTGCCGCTGT TCGTGATCCA GGCCAAGAAG
CCGCGGGCGA AGAACCCGCG CGGCGTCGGC GTGGCCGAAC TGTTCGTGAC GCCGCCCGAT
TCGGTGCGGA CCTGGCCGGG CTCGCCCCAC GCCACCTGGC CGTGGACGCC GATCTTCGGC
GGCATCGACC GCGTGCTGCA GAAGACGCAG GACCACTTCC CGAAGGTGCC GCGCCAGCGC
GCCATCGACA AGGCGGTCGC CTGGGTCTCC GAGCGGCTGA ACGGCGAGGA CGGCCTCGGC
GCCATCTTCC CGGCCATGGT CAACTCGGTG CTGATGTACG AGGTGCTGGG CTATCCGCCC
GAGCACCCGC AGGTGAAGAT CGCGCTGGAA GCCATCGAGA AGCTCGTCGC CGAGAAGGAG
GACGAGGCCT ATGTCCAGCC CTGCCTGTCC CCGGTCTGGG ATACGGCGCT GAACAGCCAC
GCCATGCTCG AAGCCGGCGG GCACCAGGCG GAGGCCAATG CCCGCGCCGG CCTCGATTGG
CTCAAGCCGC TCCAGATCCT CGACATCAAG GGCGACTGGG CTGAGACCAA GCCCAACGTG
CGTCCTGGCG GCTGGGCGTT CCAGTACGCC AACCCGCACT ATCCCGACCT CGACGACACC
GCCGTCGTCG TGATGGCGAT GGATCGCGCC CAGCGTCAGC ACGGGCTGGT CAGCGGCATG
CCGGACTATT CGGAGTCGAT CGCCCGTGCC CGCGAATGGG TCGAAGGCCT TCAGAGCGCC
GATGGCGGCT GGGCGGCGTT CGATGCCGAC AACAACCACC ACTATCTCAA CCACATCCCG
TTCTCGGATC ACGGCGCCCT GCTCGATCCG CCGACCGCGG ACGTGACCGC CCGCGTCGTC
TCGATGCTGT CGCAGCTCGG CGAGACCCGC GCGACCAGCC GGGCGCTCGA CCGCGGCGTG
ACCTACCTCC TCAACGACCA GGAGAAGGAC GGGAGCTGGT ACGGCCGCTG GGGCATGAAC
TTCATCTACG GCACGTGGTC GGTGCTGTGC GCGCTCAACA CCGCCGGGGT CGATCCGCAA
TCGCCTGAGA TCCGCAAGGC GGTGGCGTGG CTCATCCGCA TCCAGAACCC GGATGGCGGC
TGGGGCGAGG ATGCCTCCTC CTACAAGCTC AACCCCGAAT TCGAGCCGGG CTACTCGACC
GCCTCGCAGA CGGCCTGGGC CCTGCTCGCG CTGATGGCGG CGGGCGAAGT CGACGATCCG
GCGGTCGCCC GCGGCGTCAA CTACCTCGTG CGCACGCAGG GGCAGGACGG GCTGTGGAGC
GAGGAGCGCT ACACCGCTAC CGGCTTCCCG CGGGTGTTCT ACCTGCGCTA CCACGGCTAC
CCGAAGTTCT TCCCGCTCTG GGCGATGGCC CGCTTCCGCA ACCTGAAGCG GGGCAACAGC
CGGCAGGTGC AGTTCGGCAT GTGA
 
Protein sequence
MREAAVSKVE TLQRPKTRDV SLDDVERGVQ SATRALTEMT QADGHICFEL EADATIPSEY 
ILFHQFRGTE PRPGLEAKIG NYLRRTQSKV HGGWALVHDG PFDMSASVKA YFALKMIGDD
IEAPHMRAVR KAILQRGGAA NANVFTRILL ALYGEVPWAA VPVMPVEVMH LPKWFPFHLD
KVSYWARCTM VPLFVIQAKK PRAKNPRGVG VAELFVTPPD SVRTWPGSPH ATWPWTPIFG
GIDRVLQKTQ DHFPKVPRQR AIDKAVAWVS ERLNGEDGLG AIFPAMVNSV LMYEVLGYPP
EHPQVKIALE AIEKLVAEKE DEAYVQPCLS PVWDTALNSH AMLEAGGHQA EANARAGLDW
LKPLQILDIK GDWAETKPNV RPGGWAFQYA NPHYPDLDDT AVVVMAMDRA QRQHGLVSGM
PDYSESIARA REWVEGLQSA DGGWAAFDAD NNHHYLNHIP FSDHGALLDP PTADVTARVV
SMLSQLGETR ATSRALDRGV TYLLNDQEKD GSWYGRWGMN FIYGTWSVLC ALNTAGVDPQ
SPEIRKAVAW LIRIQNPDGG WGEDASSYKL NPEFEPGYST ASQTAWALLA LMAAGEVDDP
AVARGVNYLV RTQGQDGLWS EERYTATGFP RVFYLRYHGY PKFFPLWAMA RFRNLKRGNS
RQVQFGM