Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcenmc03_4556 |
Symbol | |
ID | 6127378 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia cenocepacia MC0-3 |
Kingdom | Bacteria |
Replicon accession | NC_010515 |
Strand | + |
Start bp | 1542008 |
End bp | 1543981 |
Gene Length | 1974 bp |
Protein Length | 657 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641651653 |
Product | squalene-hopene cyclase |
Protein accession | YP_001778186 |
Protein GI | 170736926 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.282201 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.128502 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGATC TCACCGAAAT GGCTACCTTG TCCGCCGGCG CCGTGCCGGC AGGCGTCGAC ACGGCGGTCG CGCGTGCGAC CGACGCGCTG CTCGCCGCGC AGCAAGCGGA TGGCCACTGG GTCTACGAAC TCGAGGCGGA TTCGACGATT CCCGCCGAAT ACGTGCTGCT CGTCCACTAT CTGGGCGAGA CGCCGAACCT CGAGCTCGAA CAGAAGATCG GCAAGTATCT GCGCCGCATC CAGCAGGCCG ACGGCGGCTG GCCGCTGTTC ACCGACGGCG CGCCGAACAT CAGCGCGAGC GTGAAGGCGT ATTTCGCGCT GAAGGTGATC GGCGATGATG AAAACGCCGA GCACATGCAG CGCGCGCGCC GTGCCATCCA TGCGATGGGC GGCGCCGAGA TGTCGAACGT GTTCACGCGC ATCCAGCTCG CGCTGTACGG GGCGATTCCG TGGCGCGCGG TGCCGATGAT GCCGGTCGAG ATCATGCTGC TGCCGCAGTG GTTCCCGTTC CACCTGTCGA AGGTGTCGTA CTGGGCGCGC ACCGTGATCG TGCCGCTGCT CGTGCTGAAC GCGAAGCGTC CGCTCGCGAA GAACCCGCGC GGCGTGCGCA TCGACGAGCT GTTCATCGAT CCGCCCGTCA ACGCCGGGCT GCTGCCGCGC CAGGGCCATC AGAGCGCCGG CTGGTTCGCG TTTTTCCGCG TGGTCGACCA TGCGCTGCGC GCGGTCGACG GCCTGTTCCC GAGCTATACG CGCGAACGCG CGATCCGTCA GGCTGTGTCG TTCGTCGACG AGCGCCTGAA CGGCGAGGAC GGCCTCGGCG CGATCTATCC GGCGATGGCC AACGCGGTGA TGATGTACGA CGTGCTCGGC TACGCCGAAG ATCATCCGAA CCGTGCGATC GCGCGCAAGG CGCTCGAGAA GCTGCTCGTC GTGCACGACG ACGAGGCGTA TTGCCAGCCG TGCCTGTCGC CCGTATGGGA TACGTCGCTC GTCGCGCATG CGCTGCTCGA AACTGGCGAT GCGCGTGCCG AGGAAGCCGT GCTCCGCGGC CTCGAATGGC TGCGCCCGCT GCAGATTCTC GACGTGCGCG GCGACTGGAT CTCGCGCCGC CCGAACGTGC GGCCCGGCGG CTGGGCGTTC CAGTACGCGA ACGCGCACTA CCCTGACGTC GACGATACGG CCGTGGTCGT GATGGCGATG GATCGCGCGC AAAAGCTCAA ACAGTCGGAC ACGTATCGCG AATCGATGGC GCGGGCGCGC GAATGGGTCG TCGGCATGCA GAGCAGCGAC GGCGGCTGGG GCGCGTTCGA ACCGGAAAAC ACGCAGTACT ACCTGAACAA CATCCCGTTC TCCGATCACG GCGCGCTGCT CGATCCGCCG ACGGCCGACG TGTCGGGCCG CTGCCTGTCG ATGCTGTCGC AGCTCGGCGA GACGCCGCTG AACAGCGAGC CGGCCCGTCG CGCGCTCGAC TACATGCTGA AGGAACAGGA GCCGGACGGC AGCTGGTACG GTCGCTGGGG GATGAACTAC GTGTACGGCA CGTGGACGGC GCTGTGCTCG CTGAACGCGG CCGGTCTGAC GCCGGACGAC CCGCGCATGA AGCGCGCTGC GCAGTGGCTG CTGTCGATCC AGAACAAGGA CGGCGGCTGG GGCGAGGACG GCGACAGCTA CAAGCTGAAC TACCGGGGTT ACGAGCAGGC GCCGAGCACG GCGTCGCAGA CGGCCTGGGC GTTGCTCGGC CTGATGGCGG CCGGCGAGGT GAACAACCCG GCCGTGGCAC GCGGCGTCGA CTACCTGGTC GCTCAGCAGA ACGAAGAAGG CTTGTGGGAC GAGACGCGCT TCACGGCAAC GGGCTTCCCG CGCGTGTTCT ACCTGCGCTA CCACGGCTAT CGGAAGTTCT TCCCGCTGTG GGCGCTCGCG CGCTACCGCA ACCTGAAGCG CGCGAACGCC ACGCGCGTGA CGGTCGGGAT GTAA
|
Protein sequence | MNDLTEMATL SAGAVPAGVD TAVARATDAL LAAQQADGHW VYELEADSTI PAEYVLLVHY LGETPNLELE QKIGKYLRRI QQADGGWPLF TDGAPNISAS VKAYFALKVI GDDENAEHMQ RARRAIHAMG GAEMSNVFTR IQLALYGAIP WRAVPMMPVE IMLLPQWFPF HLSKVSYWAR TVIVPLLVLN AKRPLAKNPR GVRIDELFID PPVNAGLLPR QGHQSAGWFA FFRVVDHALR AVDGLFPSYT RERAIRQAVS FVDERLNGED GLGAIYPAMA NAVMMYDVLG YAEDHPNRAI ARKALEKLLV VHDDEAYCQP CLSPVWDTSL VAHALLETGD ARAEEAVLRG LEWLRPLQIL DVRGDWISRR PNVRPGGWAF QYANAHYPDV DDTAVVVMAM DRAQKLKQSD TYRESMARAR EWVVGMQSSD GGWGAFEPEN TQYYLNNIPF SDHGALLDPP TADVSGRCLS MLSQLGETPL NSEPARRALD YMLKEQEPDG SWYGRWGMNY VYGTWTALCS LNAAGLTPDD PRMKRAAQWL LSIQNKDGGW GEDGDSYKLN YRGYEQAPST ASQTAWALLG LMAAGEVNNP AVARGVDYLV AQQNEEGLWD ETRFTATGFP RVFYLRYHGY RKFFPLWALA RYRNLKRANA TRVTVGM
|
| |