Gene Bphy_4137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphy_4137 
Symbol 
ID6245665 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phymatum STM815 
KingdomBacteria 
Replicon accessionNC_010623 
Strand
Start bp1126082 
End bp1128115 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content66% 
IMG OID642595897 
Productsqualene-hopene cyclase 
Protein accessionYP_001860304 
Protein GI186472962 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGATT TATCCATGAC CCAGACGCTG GGCGAAGTAC TGCCTCAAAC GCTGATCGAC 
GATCATGCTC CCGTTGCAGC CGCGCTGGCG ACGGGCGCCG CACCCGTCGA TGCGCTCGAC
GCCGCCGTCA CGCGCGCCAC GGAGGCGATC CTTGCGGTGC AAAAAGACGA CGGCCACTGG
GTCTACGAAC TCGAAGCCGA CGCGACGATT CCCGCCGAAT ACGTGCTGCT CGTCCACTTC
CTGGGCGAAA CGCCGAACCT CGAACTCGAA CAGAAGATCG CGCGCTATCT GCGCCGCATC
CAGTTGCCCA ACGGCGGCTG GCCGCTCTTC ACGGACGGCG CGATGGACGT CAGCGCGAGC
GTCAAGGCGT ACTTCGCGCT GAAGATGATC GGCGATCCGG AAGACGCCGC GCACATGGTC
CGCGCGCGCG AGTGCATTCT CGCGAACGGC GGCGCGGAAG CGGCGAACGT GTTCACGCGC
ATCCTGCTCG CGCTATTCGG CGTGGTCACG TGGTACGCCG TGCCGATGAT GCCCGTCGAA
ATCATGCTGC TGCCCAAGTG GTTCCCGTTC CACCTGTCCA AGGTGTCGTA TTGGGCGCGC
ACGGTGATCG TGCCGCTGCT CGTGCTGAAC GCGAAACGGC CTGTCGCGCG CAATCCGCGC
GGCGTGCGCA TCGACGAGCT GTTCCGCGGC GCGCCCGTCA CGACGGGTCT GCTGCCGCGC
TCGGGTCACC AGAGCAAGAG CTGGTTCGCG TTCTTCCGCG CCGTCGACGG CGTGCTGCGC
GTAACAGATG GCCTGTTCCC GAAAGCATCG CGCGAGCGCG CGATCAAGGC GGCCGTCAGC
TTCGTCGATG AACGCCTGAA CGGCGTGGAC GGCCTCGGCG CGATTTTCCC GGCGATGGCG
AACTCGGTGA TGATGTACGA CGTGCTCGGC TACCCCGCCG ACCACCCGAA TCGCGCGATC
GCCCGCGAGT CGATCGAAAA ACTGCTCGTC GTCCACGAAG ACGAGGCGTA TTGCCAGCCG
TGTCTGTCGC CCGTGTGGGA CACATCCCTT GCGGCGCACG CGCTGCTCGA AACGGGCGAC
GCGCGCGCCG AAGAAGCCGC CGAACGCGGC CTCGCCTGGC TGCGTCCGCT ACAGATCCTC
GATGTGCGCG GCGACTGGAT TTCGCGTCGT CCCGACGTGC GGCCGGGTGG CTGGGCGTTC
CAGTACAACA ACGCACATTA CCCAGATGTC GACGATACGG CCGTCGTCGC GATGGCGATG
CACCGTTCGG CGGCAGTCAC GAACTCGAAC GTCGATGCCA ACGCGATTGC GCGCGCCCGC
GAATGGGTGG TCGGCATGCA AAGCAGCGAC GGCGGCTGGG GCGCGTTCGA GCCGGAAAAC
ACGCAGTACT ACTTGAACAA TATCCCGTTC TCGGATCATG GAGCGCTGCT CGATCCGCCC
ACGGCCGACG TGTCAGGCCG TTGCCTGTCG ATGCTCGCGC AACTCGGCGA GATGCCCGCG
ACCAGTGAGC CCGCCCGCCG CGCATACGAC TATCTGCTGA AGGAGCAGGA AGACGACGGC
AGCTGGTACG GCCGGTGGGG CATGAACTAC ATCTACGGCA CGTGGACGGC GCTGTGCGCG
CTCAACGCAG CGGGCATCTC GCTCGAAGAC GCGCGCATCA AGCGCGCCGC GCAGTGGCTG
GTGTCGATCC AGAACGCGGA CGGCGGCTGG GGTGAAGACG GCACGAGCTA CAAGCTCGAC
TATCGCGGCT ACGAAAAGGC GCCGAGTATT CCGTCGCAAA CCGCTTGGGC GCTGCTGGGC
CTGATGGCGG CCGGTTACGT CGATCATCCC GCCGTCGCGC GCGGTATCGA CTATCTGCAA
CGGGAACAGC GCGACCACGG GCTGTGGGAC GAAGAGCGCT TTTCGGCGAC GGGCTTCCCG
CGCGTCTTCT ATCTGCGCTA CCACGGTTAT CGCAAGTACT TCCCGCTGTG GGCGCTTGCG
CGCTACCGCA ACCTGAAGCG CACGGGCGAA AAGCGCGTCA CCGTCGGCAT GTAA
 
Protein sequence
MNDLSMTQTL GEVLPQTLID DHAPVAAALA TGAAPVDALD AAVTRATEAI LAVQKDDGHW 
VYELEADATI PAEYVLLVHF LGETPNLELE QKIARYLRRI QLPNGGWPLF TDGAMDVSAS
VKAYFALKMI GDPEDAAHMV RARECILANG GAEAANVFTR ILLALFGVVT WYAVPMMPVE
IMLLPKWFPF HLSKVSYWAR TVIVPLLVLN AKRPVARNPR GVRIDELFRG APVTTGLLPR
SGHQSKSWFA FFRAVDGVLR VTDGLFPKAS RERAIKAAVS FVDERLNGVD GLGAIFPAMA
NSVMMYDVLG YPADHPNRAI ARESIEKLLV VHEDEAYCQP CLSPVWDTSL AAHALLETGD
ARAEEAAERG LAWLRPLQIL DVRGDWISRR PDVRPGGWAF QYNNAHYPDV DDTAVVAMAM
HRSAAVTNSN VDANAIARAR EWVVGMQSSD GGWGAFEPEN TQYYLNNIPF SDHGALLDPP
TADVSGRCLS MLAQLGEMPA TSEPARRAYD YLLKEQEDDG SWYGRWGMNY IYGTWTALCA
LNAAGISLED ARIKRAAQWL VSIQNADGGW GEDGTSYKLD YRGYEKAPSI PSQTAWALLG
LMAAGYVDHP AVARGIDYLQ REQRDHGLWD EERFSATGFP RVFYLRYHGY RKYFPLWALA
RYRNLKRTGE KRVTVGM