Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_B0019 |
Symbol | |
ID | 3751776 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007511 |
Strand | + |
Start bp | 19213 |
End bp | 21186 |
Gene Length | 1974 bp |
Protein Length | 657 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637764865 |
Product | Terpene synthase/squalene cyclase |
Protein accession | YP_370780 |
Protein GI | 78060872 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGATC TCACCGAAAT GGCTACCCTG TCCGCCGGCG CCGTGCCGGC CGGCGTCGAT ACGGCCGTCG CGCGTGCGAC CGACGCGCTG CTGGCCGCGC AGAACGCGGA TGGCCACTGG GTCTACGAAC TCGAAGCCGA TTCGACGATT CCCGCCGAAT ACGTGCTGCT CGTCCACTAT CTCGGCGAGA CGCCGAACCT CGAGCTCGAG CAGAAGATCG GCAAGTATCT GCGCCGCATC CAGCAGGCCG ACGGCGGCTG GCCGCTGTTC ACCGACGGTG CGCCGAACAT CAGCGCGAGC GTGAAGGCGT ATTTCGCGCT GAAGGTGATC GGCGACGACG AGAACGCCGA GCACATGCAG CGCGCGCGCC GTGCGATCCA CGCGATGGGC GGCGCCGAGA TGTCGAACGT GTTCACGCGC ATCCAGCTCG CGCTGTACGG TGCGATTCCG TGGCGCGCGG TGCCGATGAT GCCGGTCGAG ATCATGCTGC TGCCGCAGTG GTTCCCGTTC CATCTGTCGA AGGTGTCGTA CTGGGCGCGC ACCGTGATCG TGCCGCTGCT CGTGCTGAAC GCGAAGCGCC CGCTCGCGAA GAACCCGCGC GGCGTGCGCA TCGACGAACT GTTCATCGAT CCGCCCGTCA ACGCCGGGCT GCTGCCGCGC CAGGGCCACC AGAGCGCCGG CTGGTTCGCG TTCTTCCGTG TGGTCGACCA TGCGCTGCGC GCGGTCGACG GCCTGTTCCC GAACTACACG CGCGAACGCG CGATCCGCCA GGCCGTGTCG TTCGTCGACG AGCGCCTGAA CGGCGAGGAC GGCCTCGGCG CGATCTATCC GGCGATGGCC AACTCGGTGA TGATGTACGA CGTGCTCGGC TACGCGGAAG ATCATCCGAA TCGCGCAATC GCGCGCAAGT CGATCGAGAA GCTGCTCGTC GTGCAGGAGG ACGAAGCGTA TTGCCAGCCC TGCCTGTCGC CGGTGTGGGA CACGTCGCTC GCCGCGAATG CGCTGCTCGA AACGCGCGAC GCGCGTGCCG AGGATGCGGC GATCCGCGGC CTCGAATGGC TGCGCCCGCT GCAGATCCTC GACGTGCGCG GCGACTGGAT CTCGCGCCGC CCGCACGTGC GGCCCGGCGG CTGGGCGTTC CAGTACGCGA ACGCGCACTA CCCTGACGTC GACGATACGG CGGTGGTCGC GGTGGCGATG GAGCGCGCGC AGCAACTCAA GCAGAACGAT GCGTATCGCG ATTCGATCGC GCGTGCGCGC GAGTGGGTCG TCGGGATGCA GAGCAGCGAC GGCGGCTGGG GTGCGTTCGA ACCGGAAAAC ACGCAGTACT ACCTGAACAA CATCCCGTTC TCGGATCACG GCGCACTGCT CGATCCGCCG ACGGCCGACG TGTCGGGCCG CTGCCTGTCG ATGCTGTCGC AGCTCGGCGA GACGCCGCTG AACAGCGAAC CGGCCCGTCG CGCACTCGAC TACATGCTCA AGGAACAGGA ACCGGACGGC AGCTGGTATG GCCGCTGGGG GATGAACTAC GTGTACGGCA CGTGGACGGC ACTGTGCTCG CTGAACGCGG CCGGCCTGAC GCCGGACGAT CCGCGCGTGA AGCGCGGCGC GCAGTGGCTG CTGTCGATCC AGAACAAGGA CGGCGGCTGG GGCGAGGACG GCGACAGCTA CAAGCTCAAC TATCGCGGCT TCGAGCAGGC GCCGAGCACC GCGTCGCAAA CCGCGTGGGC GCTGCTCGGC CTGATGGCGG CCGGCGAAGT GAACAACCCG GCGGTGGCGC GCGGCATCGA CTACCTGATC GCCGAGCAGA ACGCAGAGGG CTTGTGGGAC GAAACGCGCT TCACGGCGAC GGGCTTCCCG CGCGTGTTCT ACCTGCGCTA CCACGGCTAT CGCAAGTTCT TCCCGCTGTG GGCGCTCGCC CGCTATCGCA ACCTGAAGCG CGACAACACG ACGCGCGTGA CGGTCGGGCT GTAA
|
Protein sequence | MNDLTEMATL SAGAVPAGVD TAVARATDAL LAAQNADGHW VYELEADSTI PAEYVLLVHY LGETPNLELE QKIGKYLRRI QQADGGWPLF TDGAPNISAS VKAYFALKVI GDDENAEHMQ RARRAIHAMG GAEMSNVFTR IQLALYGAIP WRAVPMMPVE IMLLPQWFPF HLSKVSYWAR TVIVPLLVLN AKRPLAKNPR GVRIDELFID PPVNAGLLPR QGHQSAGWFA FFRVVDHALR AVDGLFPNYT RERAIRQAVS FVDERLNGED GLGAIYPAMA NSVMMYDVLG YAEDHPNRAI ARKSIEKLLV VQEDEAYCQP CLSPVWDTSL AANALLETRD ARAEDAAIRG LEWLRPLQIL DVRGDWISRR PHVRPGGWAF QYANAHYPDV DDTAVVAVAM ERAQQLKQND AYRDSIARAR EWVVGMQSSD GGWGAFEPEN TQYYLNNIPF SDHGALLDPP TADVSGRCLS MLSQLGETPL NSEPARRALD YMLKEQEPDG SWYGRWGMNY VYGTWTALCS LNAAGLTPDD PRVKRGAQWL LSIQNKDGGW GEDGDSYKLN YRGFEQAPST ASQTAWALLG LMAAGEVNNP AVARGIDYLI AEQNAEGLWD ETRFTATGFP RVFYLRYHGY RKFFPLWALA RYRNLKRDNT TRVTVGL
|
| |