Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II2359 |
Symbol | shc |
ID | 3846364 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | - |
Start bp | 2896143 |
End bp | 2898116 |
Gene Length | 1974 bp |
Protein Length | 657 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637839659 |
Product | squalene-hopene cyclase |
Protein accession | YP_440546 |
Protein GI | 83716953 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACACGC TCGACGCAAC CGCTGCGCCC GCCGCGCCCA CCGTGGCGAC CGGCCTCGAC GCCGCCGTCG CGCGCGCGAC CGACGCGCTG CTCGCCGCGC AGAACGCGGA CGGCCACTGG GTCTACGAGC TCGAAGCCGA TTCGACGATT CCCGCCGAAT ACGTGCTGCT CGTCCACTAT CTCGGCGAGG CGCCGAACGT CGAGCTCGAA CGGAAGATCG CGCGCTATCT GCGCCGCATC CAGCTGCCGG ACGGCGGCTG GCCGCTCTTC ACCGACGGCG CGCCGAACAT CAGCGCGAGC GTGAAGGCGT ACTTCGCGCT GAAGGTGATC GGCGACGACG AGAACGCCGA GCACATGCAG CGCGCGCGCC GCGCGATCCA CGCGATGGGC GGCGCGGAAA TGTCGAACGT GTTCACGCGA ATCCAGCTCG CGCTGTACGG CGTCGTGCCG TGGTACGCGG TGCCGATGAT GCCGGTCGAG ATCATGCTGC TGCCGCAGTG GTTCCCGTTC CATCTGTCGA AAGTGTCGTA CTGGGCGCGC ACCGTGATCG TGCCGCTCCT CGTGCTGAAC GCGAAGCGCC CGGTCGCGAA GAATCCGCGC GGCGTGCGCA TCGACGAGCT GTTCAAGAGC GCGCCCGTCA ACACCGGTCT GCTGCCGAAG CAGCCGCACC AGAGCGCCGG CTGGTTTGCG TTCTTCCGCG CGGTGGACGG CGTGCTGCGC CTCACCGACG GCCTGTTCCC TCGCTATACG CGCGAGCGTG CGATCCGCCA GGCGGTTGCG TTCGTCGACG AGCGCCTGAA CGGCGAGGAC GGGCTCGGCG CGATCTATCC CGCCATGGCG AACGCGGTGA TGATGTACGC GGCGCTCGGC TATCCCGAGG ATCATCCGAA CCGCGCGATC GCGCGGCAGT CGATCGAGAA GCTGCTCGTC GTCGGCGAGG ACGAGGCGTA TTGCCAGCCG TGCCTGTCGC CGGTGTGGGA TACGTCGCTC GCCGCGCATG CGCTGCTCGA AACGGGCGAC GAGCGCGCGC GCGAAGCAGC CGTGCGCGGC CTCGACTGGC TCGTGCCGCG GCAGATCCTC GACGTGCGCG GCGACTGGAT CTCGCGCCGT CCGCACGTGC GCCCGGGCGG CTGGGCGTTC CAGTACGCGA ATGCGCACTA TCCGGACGTC GACGATACGG CGGTCGTCGC GATGGCGATG GATCGCGTCG CGAAGCTCGA TCGGACGGAC GCGTACCGCG AGTCGATCGC ACGCGCGCGC GAGTGGGTGG TCGGCATGCA GAGCAGCGAC GGCGGCTGGG GCGCGTTCGA GCCGGAAAAC ACGCAGTACT ACCTGAACAA CATCCCGTTC TCCGATCACG GCGCGCTGCT CGATCCGCCG ACGGCCGACG TGTCGGGCCG CTGCCTGTCG ATGCTCGCGC AGTTCGGCGA GACGAGCGCG TCGAGCGAGC CCGCGCGCCG CGCGCTCGAT TACATGCTGA AAGAGCAGGA GCCGGACGGC AGCTGGTATG GCCGCTGGGG GATGAACTAC ATCTACGGCA CGTGGACCGC GCTGTGCTCG TTGAACGCCG CCGGCCTCGG CCACGACGAT CCGCGCGTGA AGCGCGCCGC GCAGTGGCTG CTGTCGATCC AGAACCCCGA CGGCGGCTGG GGCGAGGACG GCGACAGCTA CAAGCTCGAC TACCGCGGCT ACGAGCGCGC GCCGAGCACG TCGTCGCAGA CCGCGTGGGC GCTGCTCGGC CTGATGGCGG CGGGCGAGGT CGATCATCCG GCCGTCGCGC GCGGCATCGA TCATCTGCTC GGCACGCAGC GCGAGCACGG CCTGTGGGAC GAGACGCGCT TTACCGCGAC GGGCTTCCCG CGCGTGTTCT ATCTGCGCTA TCACGGCTAC CGCAAGTTCT TCCCGCTGTG GGCGCTCGCC CGCTATCGCA ACCTGAAGCG CGCGAACGCG ACGCGCGTGA CGGTCGGGAT GTAA
|
Protein sequence | MHTLDATAAP AAPTVATGLD AAVARATDAL LAAQNADGHW VYELEADSTI PAEYVLLVHY LGEAPNVELE RKIARYLRRI QLPDGGWPLF TDGAPNISAS VKAYFALKVI GDDENAEHMQ RARRAIHAMG GAEMSNVFTR IQLALYGVVP WYAVPMMPVE IMLLPQWFPF HLSKVSYWAR TVIVPLLVLN AKRPVAKNPR GVRIDELFKS APVNTGLLPK QPHQSAGWFA FFRAVDGVLR LTDGLFPRYT RERAIRQAVA FVDERLNGED GLGAIYPAMA NAVMMYAALG YPEDHPNRAI ARQSIEKLLV VGEDEAYCQP CLSPVWDTSL AAHALLETGD ERAREAAVRG LDWLVPRQIL DVRGDWISRR PHVRPGGWAF QYANAHYPDV DDTAVVAMAM DRVAKLDRTD AYRESIARAR EWVVGMQSSD GGWGAFEPEN TQYYLNNIPF SDHGALLDPP TADVSGRCLS MLAQFGETSA SSEPARRALD YMLKEQEPDG SWYGRWGMNY IYGTWTALCS LNAAGLGHDD PRVKRAAQWL LSIQNPDGGW GEDGDSYKLD YRGYERAPST SSQTAWALLG LMAAGEVDHP AVARGIDHLL GTQREHGLWD ETRFTATGFP RVFYLRYHGY RKFFPLWALA RYRNLKRANA TRVTVGM
|
| |