Gene Mext_1944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1944 
Symbol 
ID5831828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2164406 
End bp2166409 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content69% 
IMG OID641367744 
Productsqualene-hopene cyclase 
Protein accessionYP_001639414 
Protein GI163851371 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.89702 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.562676 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGAGG CGGCCGTAAG CAAGGTCGAG ACGCTGCAGC GTCCCAAGAC CCGCGACGTG 
TCCCTCGACG ACGTGGAGCG TGGCGTCCAG AGCGCCACCC GCGCCCTCAC CGAGATGACG
CAGGCGGATG GCCACATCTG CTTCGAGCTC GAGGCGGATG CGACCATCCC CTCCGAGTAC
ATCCTGTTCC ACCAGTTCCG CGGGACCGAG CCGCGGCCCG GCCTCGAGGC CAAGATCGGC
AACTACCTGC GCCGCACCCA GTCGAAGGTG CATGGCGGCT GGGCGCTGGT GCATGACGGC
CCGTTCGACA TGAGCGCGTC GGTAAAGGCC TACTTCGCCC TCAAGATGAT CGGCGACGAC
ATCGAGGCGC CGCACATGCG CGCGGTGCGC AAGGCGATCC TCCAGCGCGG GGGCGCGGCC
AACGCCAACG TCTTCACCCG CATCCTGCTC GCGCTCTACG GCGAAGTGCC GTGGGCGGCG
GTGCCGGTGA TGCCGGTGGA GGTGATGCAC CTGCCGAAGT GGTTCCCGTT CCACCTCGAT
AAGGTGTCCT ACTGGGCCCG CTGCACCATG GTGCCGCTGT TCGTGATCCA GGCCAAGAAG
CCGCGGGCGA AGAACCCGCG CGGCGTCGGC GTGGCCGAAT TGTTCGTGAC GCCGCCCGAT
TCGGTGCGGA CCTGGCCGGG CTCGCCCCAC GCCACCTGGC CGTGGACGCC GATCTTCGGC
GGCATCGACC GCGTGCTGCA GAAGACGCAG GACCACTTCC CGAAGGTGCC GCGCCAGCGC
GCCATCGACA AGGCGGTCGC CTGGGTCTCC GAGCGGCTGA ACGGCGAGGA CGGCCTCGGC
GCCATCTTCC CGGCCATGGT CAACTCGGTG CTGATGTACG AGGTGCTGGG CTATCCGCCC
GAGCACCCGC AGGTGAAGAT CGCGCTGGAA GCCATCGAGA AGCTCGTCGC CGAGAAGGAG
GACGAGGCCT ATGTCCAGCC CTGCCTGTCC CCGGTCTGGG ATACGGCGCT GAACAGCCAC
GCCATGCTCG AAGCCGGCGG CCATCAGGCG GAGGCCAATG CCCGCGCCGG CCTCGATTGG
CTGAAGCCGC TCCAGATTCT CGACATCAAG GGCGACTGGG CCGAGACCAA GCCCAACGTG
CGCCCCGGCG GCTGGGCGTT CCAGTACGCC AACCCGCACT ATCCCGACCT CGACGACACC
GCCGTCGTCG TGATGGCGAT GGATCGCGCC CAGCGTCAGC ACGGGCTGGT CAGCGGCATG
CCGGACTATT CGGAGTCGAT CGCCCGTGCC CGCGAGTGGG TCGAGGGGCT TCAGAGCGCC
GATGGCGGCT GGGCGGCGTT CGATGCCGAC AACAACCACC ACTATCTCAA CCACATCCCG
TTCTCGGATC ACGGCGCGCT GCTCGATCCG CCGACCGCGG ACGTGACCGC CCGCGTCGTC
TCGATGCTGT CGCAGCTCGG CGAGACCCGC GCGACCAGCC GGGCGCTCGA CCGCGGCGTG
ACCTACCTGC TCAACGACCA GGAGAAGGAC GGGAGCTGGT ACGGCCGCTG GGGCATGAAC
TTCATCTACG GCACGTGGTC GGTGCTGTGC GCGCTCAACG CCGCCGGGGT CGATCCGCAG
TCGCCTGAGA TCCGCAAGGC GGTGGCGTGG CTCATCCGCA TCCAGAACCC GGATGGCGGC
TGGGGCGAGG ATGCCTCCTC CTACAAGCTC AACCCCGAAT TCGAGCCGGG CTACTCGACC
GCCTCGCAGA CGGCCTGGGC CCTGCTCGCG CTGATGGCGG CGGGCGAGGT CGATGATCCG
GCGGTCGCCC GCGGCGTCAA CTACCTCGTG CGCACGCAGG GGCAGGACGG GCTGTGGAGC
GAGGAGCGCT ACACCGCGAC CGGCTTCCCG CGGGTGTTCT ACCTGCGCTA CCACGGCTAC
CCGAAGTTCT TCCCGCTCTG GGCGATGGCC CGCTTCCGCA ACCTGAAGCG GGGCAACAGC
CGGCAGGTGC AGTTCGGCAT GTGA
 
Protein sequence
MREAAVSKVE TLQRPKTRDV SLDDVERGVQ SATRALTEMT QADGHICFEL EADATIPSEY 
ILFHQFRGTE PRPGLEAKIG NYLRRTQSKV HGGWALVHDG PFDMSASVKA YFALKMIGDD
IEAPHMRAVR KAILQRGGAA NANVFTRILL ALYGEVPWAA VPVMPVEVMH LPKWFPFHLD
KVSYWARCTM VPLFVIQAKK PRAKNPRGVG VAELFVTPPD SVRTWPGSPH ATWPWTPIFG
GIDRVLQKTQ DHFPKVPRQR AIDKAVAWVS ERLNGEDGLG AIFPAMVNSV LMYEVLGYPP
EHPQVKIALE AIEKLVAEKE DEAYVQPCLS PVWDTALNSH AMLEAGGHQA EANARAGLDW
LKPLQILDIK GDWAETKPNV RPGGWAFQYA NPHYPDLDDT AVVVMAMDRA QRQHGLVSGM
PDYSESIARA REWVEGLQSA DGGWAAFDAD NNHHYLNHIP FSDHGALLDP PTADVTARVV
SMLSQLGETR ATSRALDRGV TYLLNDQEKD GSWYGRWGMN FIYGTWSVLC ALNAAGVDPQ
SPEIRKAVAW LIRIQNPDGG WGEDASSYKL NPEFEPGYST ASQTAWALLA LMAAGEVDDP
AVARGVNYLV RTQGQDGLWS EERYTATGFP RVFYLRYHGY PKFFPLWAMA RFRNLKRGNS
RQVQFGM