Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bmul_3175 |
Symbol | |
ID | 5769427 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia multivorans ATCC 17616 |
Kingdom | Bacteria |
Replicon accession | NC_010086 |
Strand | + |
Start bp | 18897 |
End bp | 20870 |
Gene Length | 1974 bp |
Protein Length | 657 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641317473 |
Product | squalene-hopene cyclase |
Protein accession | YP_001583151 |
Protein GI | 161519724 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.107081 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGATC TCACCGATAT GGCTACCCTG TCGGCCGGCA CCGTGCCGGC CGAGCTCGAT GCGGCCGTCG CACGCGCGAC CGACGCGCTG CTCGCCGCGC AGAACGCGGA CGGGCACTGG GTCTACGAAC TCGAAGCCGA TTCGACGATT CCCGCCGAAT ACGTGCTGCT CGTCCACTAT CTCGGCGAGA CGCCGAACCT CGAGCTCGAA CAGAAGATCG GCCGCTATCT GCGCCGGATC CAGCAGGCCG ACGGCGGCTG GCCGCTGTTT ACCGACGGCG CACCGAACAT CAGCGCGAGC GTGAAGGCGT ACTTCGCGCT GAAGGTGATC GGCGACGACG AAAACGCCGA GCACATGCAG CGCGCGCGTC GCGCGATCCA CGCGATGGGC GGCGCCGAGA TGTCGAACGT GTTCACGCGG ATTCAGCTCG CGTTGTACGG CGCGATTCCG TGGCGCGCGG TGCCGATGAT GCCGGTCGAG ATCATGCTGC TGCCGCAGTG GTTCCCGTTC CATCTGTCGA AGGTGTCGTA CTGGGCGCGC ACGGTGATCG TGCCGCTGCT CGTGCTGAAT GCGAAGCGAC CGCTTGCGAA GAATCCGCGC GGCGTCCGCA TCGACGAACT GTTCATCGAT CCGCCCGTCA ACGCGGGCCT GCTGCCGCGC CAGGGTCACC AGAGCGCCGG CTGGTTCGCG TTCTTCCGTG CGGTCGATCA CGTGCTGCGC GCGGTCGACG GGCTGTTTCC GGCTTATACG CGCGAGCGCG CGATTCGTCA GGCCGTCGCG TTCGTCGACG AGCGGCTGAA CGGCGAGGAC GGCCTCGGCG CGATCTATCC GGCGATGGCC AACGCGGTGA TGATGTACGA CGTGCTCGGC TACGCGGAAG ATCATCCGAA TCGCGCGATC GCGCGCAAGT CGATCGAGAA GCTGCTCGTC GTGCACGAAG ACGAAGCGTA TTGCCAGCCG TGTCTGTCGC CCGTCTGGGA TACCTCGCTC GCCGCGCATG CGCTGCTCGA GACGCGCGAT CCGCGCGCCG AGCAGGCGGC CGTGCGCGGC CTCGACTGGC TGCGTCCGCT GCAGATCCTC GACGTGCGCG GCGACTGGAT CTCGCGCCGT CCGCACGTGC GTCCCGGCGG CTGGGCGTTC CAGTACGCGA ACCCGCACTA CCCCGACGTC GACGATACGG CCGTCGTCGC GATGGCGATG GATCGCGCGC AAAAGCTGAA CCAGTCCGAC ACGTATCGCG AGTCGATCGC GCGTGCGCGC GAGTGGGTCG TCGGCATGCA GAGCAGCGAC GGCGGCTGGG GCGCGTTCGA GCCGGAAAAC ACGCAGTACT ACCTGAACAA CATCCCGTTC TCCGATCACG GCGCGCTGCT CGATCCGCCG ACGGCCGACG TGTCGGGCCG CTGTCTGTCG ATGCTGTCGC AGCTCGGCGA AACCGCGCTG AACAGCGACG CGGCGCGCCG CGCGCTCGAC TACATGCTGA AGGAGCAGGA GCCGGACGGC AGCTGGTACG GCCGCTGGGG GATGAACTAC GTGTACGGCA CGTGGACGGC GCTGTGCGCG CTGAATGCGG CTGGCCTCGG GCCCGACGAC GCGCGCGTGA AGCGCGCGGC GCAATGGCTG CTGTCGATCC AGAACAAGGA CGGCGGCTGG GGCGAGGACG GCGACAGCTA CAAGCTGAAC TACCGCGGCT ACGAGCCCGC ACCGAGCACG GCATCGCAGA CGGCCTGGGC GCTGCTCGGC CTGATGGCGG CCGGCGAGGT GAACAACCCG GCGGTCAAGC GCGGGATCGA CTATCTGATC GCCGAACAGA AGGAGCACGG CCTGTGGGAC GAAGCGCGCT TCACTGCGAC CGGCTTCCCG CGCGTGTTCT ATCTGCGCTA TCACGGCTAT CGCAAGTTCT TCCCGCTGTG GGCGCTGGCG CGCTACCGCA ACCTGAAGCG CGACAACACG ACGCGCGTCA CGGTCGGGAT CTGA
|
Protein sequence | MNDLTDMATL SAGTVPAELD AAVARATDAL LAAQNADGHW VYELEADSTI PAEYVLLVHY LGETPNLELE QKIGRYLRRI QQADGGWPLF TDGAPNISAS VKAYFALKVI GDDENAEHMQ RARRAIHAMG GAEMSNVFTR IQLALYGAIP WRAVPMMPVE IMLLPQWFPF HLSKVSYWAR TVIVPLLVLN AKRPLAKNPR GVRIDELFID PPVNAGLLPR QGHQSAGWFA FFRAVDHVLR AVDGLFPAYT RERAIRQAVA FVDERLNGED GLGAIYPAMA NAVMMYDVLG YAEDHPNRAI ARKSIEKLLV VHEDEAYCQP CLSPVWDTSL AAHALLETRD PRAEQAAVRG LDWLRPLQIL DVRGDWISRR PHVRPGGWAF QYANPHYPDV DDTAVVAMAM DRAQKLNQSD TYRESIARAR EWVVGMQSSD GGWGAFEPEN TQYYLNNIPF SDHGALLDPP TADVSGRCLS MLSQLGETAL NSDAARRALD YMLKEQEPDG SWYGRWGMNY VYGTWTALCA LNAAGLGPDD ARVKRAAQWL LSIQNKDGGW GEDGDSYKLN YRGYEPAPST ASQTAWALLG LMAAGEVNNP AVKRGIDYLI AEQKEHGLWD EARFTATGFP RVFYLRYHGY RKFFPLWALA RYRNLKRDNT TRVTVGI
|
| |