Gene Mpop_1835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpop_1835 
Symbol 
ID6312724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium populi BJ001 
KingdomBacteria 
Replicon accessionNC_010725 
Strand
Start bp1973399 
End bp1975402 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content68% 
IMG OID642650558 
Productsqualene-hopene cyclase 
Protein accessionYP_001924533 
Protein GI188581088 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.807169 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGAGG CGGCCGTGAG CAAGGTCGAG ACGCTGCAGC GTCCCAAGAC CCGCGACGTG 
TCCCTCGACG ATGTCGAGCG TGGCGTCCAG AACGCCGCCC GCGCCCTCAC CGAGATGACG
CAGACCGACG GCCACATCTG CTTCGAGCTC GAAGCGGATG CGACCATCCC CTCCGAATAC
ATCCTGTTCC ACCAGTTCCG CGGAACCGTG CCGCGCGACG GCCTGGAAGC CAAGATCGGC
AACTACCTGC GCCGCACGCA GTCGAAGGTG CATGGCGGCT GGGCGCTGGT CCATGACGGC
CCGTTCGACA TGAGCGCGAC CGTGAAGGCC TATTTCGCCC TCAAGATGAT CGGCGACGAC
ATCGAGGCGC CGCACATGCG CGCGGCCCGC AAGGCGATCC TGCAGCGCGG GGGCGCGGCC
AACGCCAACG TCTTCACTCG CATCCTGCTC GCCCTCTACG GCGAGGTGCC CTGGGCCGCG
GTGCCGGTGA TGCCGGTGGA GGTGATGCAC CTGCCGAAGT GGTTCCCGTT CCACCTCGAC
AAGGTGTCCT ACTGGGCCCG CTGCACCATG GTGCCGCTGT TCGTGATCCA GGCCAAGAAG
CCGCGGGCGA AGAACCCGCG CGGCATCGGC GTGGCCGAAC TGTTCGTGAC CCCGCCCGAT
TCGGTGCGGA CCTGGCCGGG CTCGCCCCAC GCCACTTGGC CGTGGACGCC GATCTTCGGC
GCCATCGACC GCGTGCTGCA GAAGACGCAG GACCACTTCC CGAAAGTGCC GCGCCAGCGC
GCCATCGACA AGGCGGTGGC CTGGGTGTCC GAGCGCCTGA ACGGCGAGGA CGGCCTCGGC
GCCATCTTCC CGTCGATGGT CAACTCGGTG CTGATGTACG AGGTGCTCGG CTATCCCCCC
GATCATCCGC AGGTGAAGAT CGCGCTGGAA GCCATCGAAA AGCTCGTCGC CGAGAAGGAC
GACGAGGCCT ATGTCCAGCC CTGCCTGTCG CCGGTCTGGG ACACGGCGCT GACCAGCCAC
GCCATGCTGG AGACCGGCGG CGCCGCGGCC GAGGCCAATG CCCGCGCCGG CCTCGACTGG
CTGAAGCCGC TGCAGATCCT CGACATCAAG GGCGACTGGG CCGAGACCAA GCCGAACGTG
CGCCCCGGCG GCTGGGCCTT CCAGTACGCC AACCCGCACT ATCCCGATCT CGACGACACC
GCCGTGGTGG TGATGGCGAT GGACCGCGCC CAGCGCCAGC ACGGTCTGGT GAGCGGAATG
CCGGACTACT CGGCCTCGAT CGCCCGCGCC CGCGAGTGGG TCGAGGGGCT CCAGAGCGCC
GACGGCGGCT GGGCGGCCTT CGACGCCGAC AACAACCACC ACTACCTCAA CCACATCCCG
TTCTCGGATC ACGGCGCGCT GCTCGATCCG CCGACCGCGG ACGTGACCGC CCGCGTCGTC
TCGATGCTGT CGCAGCTCGG CGAGACCCGC GAGACCAGCC GGGCGCTCGA CCGCGGTGTG
ACCTACCTGC TCAACGACCA GGAGAAGGAC GGGAGCTGGT ACGGCCGCTG GGGCATGAAC
TTCATCTACG GCACGTGGTC GGTGCTCTGC GCGCTGAACG CCGCCGGTGT CGATCCGCAA
TCGCCTGAGA TCCGCAAGGC GGTGGCGTGG CTCATCCGCA TCCAGAACCC GGATGGCGGC
TGGGGCGAGG ATGCCTCCTC CTACAAGCTC AACCCCGAAT TCGAGCCGGG CTACTCCACC
GCCTCGCAGA CGGCCTGGGC GCTGCTCGCC CTCATGGCGG TGGGCGAGGT GGACGATCCG
GCGGTCGCCC GCGGCGTCAA CTACCTGATG CGCACGCAAG GGCAGGACGG GTTGTGGAAC
GAGGAGCGCT ACACCGCGAC CGGCTTCCCG CGGGTGTTCT ACCTGCGCTA CCACGGCTAC
CCGAAATTCT TCCCGCTCTG GGCGATGGCC CGCTTCCGCA ACCTGAAGAA GGGTAACAGC
CGTCAGGTGC AGTTCGGGAT GTGA
 
Protein sequence
MREAAVSKVE TLQRPKTRDV SLDDVERGVQ NAARALTEMT QTDGHICFEL EADATIPSEY 
ILFHQFRGTV PRDGLEAKIG NYLRRTQSKV HGGWALVHDG PFDMSATVKA YFALKMIGDD
IEAPHMRAAR KAILQRGGAA NANVFTRILL ALYGEVPWAA VPVMPVEVMH LPKWFPFHLD
KVSYWARCTM VPLFVIQAKK PRAKNPRGIG VAELFVTPPD SVRTWPGSPH ATWPWTPIFG
AIDRVLQKTQ DHFPKVPRQR AIDKAVAWVS ERLNGEDGLG AIFPSMVNSV LMYEVLGYPP
DHPQVKIALE AIEKLVAEKD DEAYVQPCLS PVWDTALTSH AMLETGGAAA EANARAGLDW
LKPLQILDIK GDWAETKPNV RPGGWAFQYA NPHYPDLDDT AVVVMAMDRA QRQHGLVSGM
PDYSASIARA REWVEGLQSA DGGWAAFDAD NNHHYLNHIP FSDHGALLDP PTADVTARVV
SMLSQLGETR ETSRALDRGV TYLLNDQEKD GSWYGRWGMN FIYGTWSVLC ALNAAGVDPQ
SPEIRKAVAW LIRIQNPDGG WGEDASSYKL NPEFEPGYST ASQTAWALLA LMAVGEVDDP
AVARGVNYLM RTQGQDGLWN EERYTATGFP RVFYLRYHGY PKFFPLWAMA RFRNLKKGNS
RQVQFGM