Gene BTH_II2359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_II2359 
Symbolshc 
ID3846364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007650 
Strand
Start bp2896143 
End bp2898116 
Gene Length1974 bp 
Protein Length657 aa 
Translation table11 
GC content69% 
IMG OID637839659 
Productsqualene-hopene cyclase 
Protein accessionYP_440546 
Protein GI83716953 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACACGC TCGACGCAAC CGCTGCGCCC GCCGCGCCCA CCGTGGCGAC CGGCCTCGAC 
GCCGCCGTCG CGCGCGCGAC CGACGCGCTG CTCGCCGCGC AGAACGCGGA CGGCCACTGG
GTCTACGAGC TCGAAGCCGA TTCGACGATT CCCGCCGAAT ACGTGCTGCT CGTCCACTAT
CTCGGCGAGG CGCCGAACGT CGAGCTCGAA CGGAAGATCG CGCGCTATCT GCGCCGCATC
CAGCTGCCGG ACGGCGGCTG GCCGCTCTTC ACCGACGGCG CGCCGAACAT CAGCGCGAGC
GTGAAGGCGT ACTTCGCGCT GAAGGTGATC GGCGACGACG AGAACGCCGA GCACATGCAG
CGCGCGCGCC GCGCGATCCA CGCGATGGGC GGCGCGGAAA TGTCGAACGT GTTCACGCGA
ATCCAGCTCG CGCTGTACGG CGTCGTGCCG TGGTACGCGG TGCCGATGAT GCCGGTCGAG
ATCATGCTGC TGCCGCAGTG GTTCCCGTTC CATCTGTCGA AAGTGTCGTA CTGGGCGCGC
ACCGTGATCG TGCCGCTCCT CGTGCTGAAC GCGAAGCGCC CGGTCGCGAA GAATCCGCGC
GGCGTGCGCA TCGACGAGCT GTTCAAGAGC GCGCCCGTCA ACACCGGTCT GCTGCCGAAG
CAGCCGCACC AGAGCGCCGG CTGGTTTGCG TTCTTCCGCG CGGTGGACGG CGTGCTGCGC
CTCACCGACG GCCTGTTCCC TCGCTATACG CGCGAGCGTG CGATCCGCCA GGCGGTTGCG
TTCGTCGACG AGCGCCTGAA CGGCGAGGAC GGGCTCGGCG CGATCTATCC CGCCATGGCG
AACGCGGTGA TGATGTACGC GGCGCTCGGC TATCCCGAGG ATCATCCGAA CCGCGCGATC
GCGCGGCAGT CGATCGAGAA GCTGCTCGTC GTCGGCGAGG ACGAGGCGTA TTGCCAGCCG
TGCCTGTCGC CGGTGTGGGA TACGTCGCTC GCCGCGCATG CGCTGCTCGA AACGGGCGAC
GAGCGCGCGC GCGAAGCAGC CGTGCGCGGC CTCGACTGGC TCGTGCCGCG GCAGATCCTC
GACGTGCGCG GCGACTGGAT CTCGCGCCGT CCGCACGTGC GCCCGGGCGG CTGGGCGTTC
CAGTACGCGA ATGCGCACTA TCCGGACGTC GACGATACGG CGGTCGTCGC GATGGCGATG
GATCGCGTCG CGAAGCTCGA TCGGACGGAC GCGTACCGCG AGTCGATCGC ACGCGCGCGC
GAGTGGGTGG TCGGCATGCA GAGCAGCGAC GGCGGCTGGG GCGCGTTCGA GCCGGAAAAC
ACGCAGTACT ACCTGAACAA CATCCCGTTC TCCGATCACG GCGCGCTGCT CGATCCGCCG
ACGGCCGACG TGTCGGGCCG CTGCCTGTCG ATGCTCGCGC AGTTCGGCGA GACGAGCGCG
TCGAGCGAGC CCGCGCGCCG CGCGCTCGAT TACATGCTGA AAGAGCAGGA GCCGGACGGC
AGCTGGTATG GCCGCTGGGG GATGAACTAC ATCTACGGCA CGTGGACCGC GCTGTGCTCG
TTGAACGCCG CCGGCCTCGG CCACGACGAT CCGCGCGTGA AGCGCGCCGC GCAGTGGCTG
CTGTCGATCC AGAACCCCGA CGGCGGCTGG GGCGAGGACG GCGACAGCTA CAAGCTCGAC
TACCGCGGCT ACGAGCGCGC GCCGAGCACG TCGTCGCAGA CCGCGTGGGC GCTGCTCGGC
CTGATGGCGG CGGGCGAGGT CGATCATCCG GCCGTCGCGC GCGGCATCGA TCATCTGCTC
GGCACGCAGC GCGAGCACGG CCTGTGGGAC GAGACGCGCT TTACCGCGAC GGGCTTCCCG
CGCGTGTTCT ATCTGCGCTA TCACGGCTAC CGCAAGTTCT TCCCGCTGTG GGCGCTCGCC
CGCTATCGCA ACCTGAAGCG CGCGAACGCG ACGCGCGTGA CGGTCGGGAT GTAA
 
Protein sequence
MHTLDATAAP AAPTVATGLD AAVARATDAL LAAQNADGHW VYELEADSTI PAEYVLLVHY 
LGEAPNVELE RKIARYLRRI QLPDGGWPLF TDGAPNISAS VKAYFALKVI GDDENAEHMQ
RARRAIHAMG GAEMSNVFTR IQLALYGVVP WYAVPMMPVE IMLLPQWFPF HLSKVSYWAR
TVIVPLLVLN AKRPVAKNPR GVRIDELFKS APVNTGLLPK QPHQSAGWFA FFRAVDGVLR
LTDGLFPRYT RERAIRQAVA FVDERLNGED GLGAIYPAMA NAVMMYAALG YPEDHPNRAI
ARQSIEKLLV VGEDEAYCQP CLSPVWDTSL AAHALLETGD ERAREAAVRG LDWLVPRQIL
DVRGDWISRR PHVRPGGWAF QYANAHYPDV DDTAVVAMAM DRVAKLDRTD AYRESIARAR
EWVVGMQSSD GGWGAFEPEN TQYYLNNIPF SDHGALLDPP TADVSGRCLS MLAQFGETSA
SSEPARRALD YMLKEQEPDG SWYGRWGMNY IYGTWTALCS LNAAGLGHDD PRVKRAAQWL
LSIQNPDGGW GEDGDSYKLD YRGYERAPST SSQTAWALLG LMAAGEVDHP AVARGIDHLL
GTQREHGLWD ETRFTATGFP RVFYLRYHGY RKFFPLWALA RYRNLKRANA TRVTVGM