Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcen2424_5679 |
Symbol | |
ID | 4452350 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia cenocepacia HI2424 |
Kingdom | Bacteria |
Replicon accession | NC_008543 |
Strand | - |
Start bp | 2800595 |
End bp | 2802568 |
Gene Length | 1974 bp |
Protein Length | 657 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639697740 |
Product | squalene-hopene cyclase |
Protein accession | YP_839305 |
Protein GI | 116693772 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.126345 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00244558 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAACGATC TCACCGAAAT GGCTACCTTG TCCGCCGGCG CCGTGCCGGC CGGCGTCGAC GCGGCTGTCG CGCGTGCGAC CGACGCGCTG CTCGCCGCGC AGCAAGCGGA TGGCCACTGG GTCTACGAAC TCGAGGCGGA TTCGACGATT CCCGCCGAAT ACGTGCTGCT CGTCCACTAT CTTGGCGAGA CGCCGAACCT CGAGCTCGAA CAGAAGATCG GCAAGTATCT GCGCCGCATC CAGCAGGCCG ACGGCGGCTG GCCGCTGTTC ACCGACGGCG CGCCGAACAT CAGCGCGAGC GTGAAGGCGT ATTTCGCGCT GAAGGTGATC GGCGACGACG AGAACGCCGA GCACATGCAG CGCGCGCGCC GTGCCATCCA TGCGATGGGC GGCGCCGAGA TGTCGAACGT GTTCACGCGC ATCCAGCTCG CGCTGTACGG TGCGATTCCG TGGCGCGCGG TGCCGATGAT GCCGGTCGAG ATCATGCTGC TGCCGCAGTG GTTCCCGTTC CACCTGTCGA AGGTGTCGTA CTGGGCGCGC ACCGTGATCG TGCCGCTGCT CGTGCTGAAC GCGAAGCGTC CGCTCGCGAA GAACCCGCGC GGCGTGCGCA TCGACGAGCT GTTCATCGAT CCGCCCGTCA ACGCCGGGCT GCTGCCGCGC CAGGGCCATC AGAGCGCGGG CTGGTTCGCG TTTTTCCGCG TGGTCGACCA TGCGCTGCGC GCGGTCGACG GCCTGTTCCC GAGCTATACG CGCGAACGCG CGATCCGTCA GGCCGTGTCG TTCGTCGACG AGCGCCTGAA CGGCGAGGAC GGCCTCGGCG CGATCTATCC GGCGATGGCC AACGCGGTGA TGATGTACGA CGTGCTCGGC TATGCGGAAG ATCATCCGAA CCGTGCGATC GCGCGCAAGG CGCTCGAGAA GCTGCTCGTC GTGCACGACG ACGAGGCGTA TTGCCAGCCG TGCCTGTCGC CCGTGTGGGA TACGTCGCTC GTCGCGCATG CGCTGCTCGA GACCGGCGAT GCGCGTGCCG AGGAAGCCGT GCTGCGCGGC CTCGAATGGC TGCGCCCGCT GCAGATCCTC GACGTGCGCG GCGACTGGAT CTCGCGCCGC CCGAACGTGC GGCCCGGCGG CTGGGCGTTC CAGTACGCGA ACGCGCACTA CCCTGACGTC GACGATACGG CCGTGGTCGT GATGGCGATG GATCGCGCGC AAAAGCTCAA GCAATCGGAC ACGTATCGCG AATCGATGGC GCGGGCGCGC GAATGGGTCG TCGGCATGCA GAGCAGCGAC GGCGGCTGGG GCGCGTTCGA ACCGGAAAAC ACGCAGTACT ACCTGAACAA CATCCCGTTC TCCGATCACG GCGCGCTGCT CGATCCGCCG ACGGCCGACG TGTCGGGCCG CTGCCTGTCG ATGCTGTCGC AGCTCGGCGA GACGCCGCTG AACAGCGAGC CGGCCCGCCG CGCGCTCGAC TACATGCTGA AGGAACAGGA GCCGGACGGC AGCTGGTACG GCCGTTGGGG GATGAACTAC GTGTACGGCA CGTGGACGGC GCTGTGCTCG CTGAATGCGG CCGGCCTGAC GCCGGACGAC CCGCGCATGA AGCGCGCCGC GCAGTGGCTG CTGTCGATCC AGAACAAGGA CGGCGGCTGG GGCGAGGACG GCGACAGCTA CAAGCTGAAC TACCGCGGTT ACGAGCAGGC GCCGAGCACG GCGTCGCAGA CGGCCTGGGC GCTGCTCGGC CTGATGGCGG CCGGCGAAGT GAACAACCCG GCCGTGGCGC GCGGCGTCGA CTACCTCGTC GCTCAGCAGA ACGAAGAAGG GCTGTGGGAC GAGACGCGCT TCACGGCAAC GGGCTTCCCG CGCGTGTTCT ACCTGCGCTA CCACGGTTAT CGCAAGTTCT TCCCGCTGTG GGCGCTGGCG CGCTACCGCA ACCTGAAGCG CGCGAACGCG ACGCGCGTGA CGGTCGGGAT GTAA
|
Protein sequence | MNDLTEMATL SAGAVPAGVD AAVARATDAL LAAQQADGHW VYELEADSTI PAEYVLLVHY LGETPNLELE QKIGKYLRRI QQADGGWPLF TDGAPNISAS VKAYFALKVI GDDENAEHMQ RARRAIHAMG GAEMSNVFTR IQLALYGAIP WRAVPMMPVE IMLLPQWFPF HLSKVSYWAR TVIVPLLVLN AKRPLAKNPR GVRIDELFID PPVNAGLLPR QGHQSAGWFA FFRVVDHALR AVDGLFPSYT RERAIRQAVS FVDERLNGED GLGAIYPAMA NAVMMYDVLG YAEDHPNRAI ARKALEKLLV VHDDEAYCQP CLSPVWDTSL VAHALLETGD ARAEEAVLRG LEWLRPLQIL DVRGDWISRR PNVRPGGWAF QYANAHYPDV DDTAVVVMAM DRAQKLKQSD TYRESMARAR EWVVGMQSSD GGWGAFEPEN TQYYLNNIPF SDHGALLDPP TADVSGRCLS MLSQLGETPL NSEPARRALD YMLKEQEPDG SWYGRWGMNY VYGTWTALCS LNAAGLTPDD PRMKRAAQWL LSIQNKDGGW GEDGDSYKLN YRGYEQAPST ASQTAWALLG LMAAGEVNNP AVARGVDYLV AQQNEEGLWD ETRFTATGFP RVFYLRYHGY RKFFPLWALA RYRNLKRANA TRVTVGM
|
| |