Gene Rmet_4148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_4148 
SymbolsqhC 
ID4041006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007974 
Strand
Start bp722723 
End bp724732 
Gene Length2010 bp 
Protein Length669 aa 
Translation table11 
GC content68% 
IMG OID637979571 
Productsqualene-hopene cyclase 
Protein accessionYP_586284 
Protein GI94313075 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.146424 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.598571 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCAGG CCGCAACAAT AACCCGCCCC CAGGATGAAA CGCTGACCAC CAGCGCCCGT 
CGCCCTGCCC AGCCGGCCCT GCCCGATCCG CTCGATGCCG GCATCGCCCA TGTCGTCGAA
TCCCTGCTGG CGCAACAGCA GTCCGATGGC CACTGGGTCT ACGAACTCGA AGCCGACGCC
ACGATTCCGG CCGAGTACAT CCTGATGGTC CACTACCTCG GCGAGACGCC GGATCTGGTA
CTGGAAGGCA AGATCGCCAA CTACCTGCGC CGCATCCAGA ACGCCGATGG CGGCTGGCCG
CTGTTCCACG CCGGCGCCTC GGACATCAGT GCCAGCGTGA AGGGCTACTT CGCGCTGAAG
ATGGCCGGCG ACAACCCGGA AGCCGAACAC ATGCGCCGCG CGCGCGCCGC GATCCACGCG
ATGGGCGGCG CCGAGGCCAG CAACGTATTC ACGCGCACGC TGCTCGCGCT GTACGGCGTG
ATGCCGTGGC AGGCGGTGCC GATGATGCCG GTGGAGATCA TGCTGCTGCC CGAGTGGTTT
CCGTTCCATC TGTCGAAGGT GTCGTACTGG GCCCGCACGG TCATCGTGCC GCTGCTGGTG
CTCAACAGCC TGCGTCCGCA GGCGCGCAAT CCGCGCAAGA TCGGCATCGA CGAATTGTTC
GTACGCCCGT GCCAGGCCAC CCGCCTGCCG CGCCGCGCGC CGCACCAGAG CCCGCTATGG
GTCGGCGTGT TCCGCACGCT CGACGCCGTG GTGCGCATGG CCGAGCCACT GTTCCCGCGT
GGCCTGCGCC AGCGGGCCAT CGAGCGCGCG CGTGAATTCA CGGTCGAACG GCTGAACGGC
GAAGACGGCC TCGGCGCGAT CTTCCCGGCG ATGGTCAACT CAGTGCTGAT GTTCGACGTC
CTCGGCGTGC CCGAGAGCGA CCCGAACCGG GCCATCGCCC GCCGCTCGAT CGACAAGCTG
CTCGTGATCA AGGATGACGA GGCTTACTGC CAGCCATGCC TGTCGCCGGT CTGGGATACG
TCGCTGGCCG CCCATGCCTT GCTGGAAGTC GGCGAGCCGC GCACGATCGC GGCTGCGGCA
CGCGGGCTGG ACTGGCTGCT GCCGCTGCAG GAACTCGAAC TGCGCGGCGA CTGGACGGTG
CGCCGTCCCA ACGTCCGCCC GGGCGGCTGG GCTTTCCAGT ACGCCAACCC TCACTACCCC
GACGTGGACG ACACCGCCGT GGTGGCTGCC GCGATGGACC GGGTGGACAA GGGCGACCGC
TCCAACCGTT ATGACGAGGC CGTATCCCGC GCCTGCGAGT GGATCGTCGG CATGCAAAGC
AGCAACGGTG GCTGGGGCGC GTTCGAACCC GAGAACACGC ACCTTTACCT GAACAACATC
CCGTTCGCCG ATCACGGCGC GCTGCTCGAT CCGCCCACGG CCGACGTGTC CGCGCGTTGT
CTGGCGATGC TGTGCCAGCT AGGCCAGATG CCGGCCAACA GCGAGCCGGC CGCGCGCGCG
CTGCGCTACC TGCTCGACGA ACAGGAAGCC GACGGAAGCT GGTTCGGCCG CTGGGGCACC
AACTATATCT ATGGCACGTG GAGCGCGCTG TGCGGGCTGA ACGCCGCCGG CATCGGCACG
GACGCGCCCG AGATGAAGCG CGCGGCCCAA TGGCTGCTGT CGATCCAGAA CGAGGATGGC
GGCTGGGGCG AGTCGGGCGA CAGCTACAAG CTCGAGTATC GGGGCTATGA AAAGGCGCCG
AGCACGGCAT CGCAAACCGC CTGGGCCATG CTCGGCCTGA TGGCCGCCGG CGCAGGCGAC
CACCCCGCCC TGGTGCGCGG CGTCGAGTAC CTGCTGCGCA CGCAGGCCAG CCATGGTTTC
TGGGACGAGC CGTACTTCAC GGCGGTGGGT TTCCCTCGCG TGTTCTATCT GCGCTACCAC
GGCTATTCGC GCTTCTTCCC GCTCTGGGCA CTGGCGCGCT TCCGCAACCT GCTGCGCGAT
GGCAATCGCG CCATCTCCTG GGGGCTCTGA
 
Protein sequence
MNQAATITRP QDETLTTSAR RPAQPALPDP LDAGIAHVVE SLLAQQQSDG HWVYELEADA 
TIPAEYILMV HYLGETPDLV LEGKIANYLR RIQNADGGWP LFHAGASDIS ASVKGYFALK
MAGDNPEAEH MRRARAAIHA MGGAEASNVF TRTLLALYGV MPWQAVPMMP VEIMLLPEWF
PFHLSKVSYW ARTVIVPLLV LNSLRPQARN PRKIGIDELF VRPCQATRLP RRAPHQSPLW
VGVFRTLDAV VRMAEPLFPR GLRQRAIERA REFTVERLNG EDGLGAIFPA MVNSVLMFDV
LGVPESDPNR AIARRSIDKL LVIKDDEAYC QPCLSPVWDT SLAAHALLEV GEPRTIAAAA
RGLDWLLPLQ ELELRGDWTV RRPNVRPGGW AFQYANPHYP DVDDTAVVAA AMDRVDKGDR
SNRYDEAVSR ACEWIVGMQS SNGGWGAFEP ENTHLYLNNI PFADHGALLD PPTADVSARC
LAMLCQLGQM PANSEPAARA LRYLLDEQEA DGSWFGRWGT NYIYGTWSAL CGLNAAGIGT
DAPEMKRAAQ WLLSIQNEDG GWGESGDSYK LEYRGYEKAP STASQTAWAM LGLMAAGAGD
HPALVRGVEY LLRTQASHGF WDEPYFTAVG FPRVFYLRYH GYSRFFPLWA LARFRNLLRD
GNRAISWGL