Gene Bcep18194_B0019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_B0019 
Symbol 
ID3751776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007511 
Strand
Start bp19213 
End bp21186 
Gene Length1974 bp 
Protein Length657 aa 
Translation table11 
GC content68% 
IMG OID637764865 
ProductTerpene synthase/squalene cyclase 
Protein accessionYP_370780 
Protein GI78060872 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGATC TCACCGAAAT GGCTACCCTG TCCGCCGGCG CCGTGCCGGC CGGCGTCGAT 
ACGGCCGTCG CGCGTGCGAC CGACGCGCTG CTGGCCGCGC AGAACGCGGA TGGCCACTGG
GTCTACGAAC TCGAAGCCGA TTCGACGATT CCCGCCGAAT ACGTGCTGCT CGTCCACTAT
CTCGGCGAGA CGCCGAACCT CGAGCTCGAG CAGAAGATCG GCAAGTATCT GCGCCGCATC
CAGCAGGCCG ACGGCGGCTG GCCGCTGTTC ACCGACGGTG CGCCGAACAT CAGCGCGAGC
GTGAAGGCGT ATTTCGCGCT GAAGGTGATC GGCGACGACG AGAACGCCGA GCACATGCAG
CGCGCGCGCC GTGCGATCCA CGCGATGGGC GGCGCCGAGA TGTCGAACGT GTTCACGCGC
ATCCAGCTCG CGCTGTACGG TGCGATTCCG TGGCGCGCGG TGCCGATGAT GCCGGTCGAG
ATCATGCTGC TGCCGCAGTG GTTCCCGTTC CATCTGTCGA AGGTGTCGTA CTGGGCGCGC
ACCGTGATCG TGCCGCTGCT CGTGCTGAAC GCGAAGCGCC CGCTCGCGAA GAACCCGCGC
GGCGTGCGCA TCGACGAACT GTTCATCGAT CCGCCCGTCA ACGCCGGGCT GCTGCCGCGC
CAGGGCCACC AGAGCGCCGG CTGGTTCGCG TTCTTCCGTG TGGTCGACCA TGCGCTGCGC
GCGGTCGACG GCCTGTTCCC GAACTACACG CGCGAACGCG CGATCCGCCA GGCCGTGTCG
TTCGTCGACG AGCGCCTGAA CGGCGAGGAC GGCCTCGGCG CGATCTATCC GGCGATGGCC
AACTCGGTGA TGATGTACGA CGTGCTCGGC TACGCGGAAG ATCATCCGAA TCGCGCAATC
GCGCGCAAGT CGATCGAGAA GCTGCTCGTC GTGCAGGAGG ACGAAGCGTA TTGCCAGCCC
TGCCTGTCGC CGGTGTGGGA CACGTCGCTC GCCGCGAATG CGCTGCTCGA AACGCGCGAC
GCGCGTGCCG AGGATGCGGC GATCCGCGGC CTCGAATGGC TGCGCCCGCT GCAGATCCTC
GACGTGCGCG GCGACTGGAT CTCGCGCCGC CCGCACGTGC GGCCCGGCGG CTGGGCGTTC
CAGTACGCGA ACGCGCACTA CCCTGACGTC GACGATACGG CGGTGGTCGC GGTGGCGATG
GAGCGCGCGC AGCAACTCAA GCAGAACGAT GCGTATCGCG ATTCGATCGC GCGTGCGCGC
GAGTGGGTCG TCGGGATGCA GAGCAGCGAC GGCGGCTGGG GTGCGTTCGA ACCGGAAAAC
ACGCAGTACT ACCTGAACAA CATCCCGTTC TCGGATCACG GCGCACTGCT CGATCCGCCG
ACGGCCGACG TGTCGGGCCG CTGCCTGTCG ATGCTGTCGC AGCTCGGCGA GACGCCGCTG
AACAGCGAAC CGGCCCGTCG CGCACTCGAC TACATGCTCA AGGAACAGGA ACCGGACGGC
AGCTGGTATG GCCGCTGGGG GATGAACTAC GTGTACGGCA CGTGGACGGC ACTGTGCTCG
CTGAACGCGG CCGGCCTGAC GCCGGACGAT CCGCGCGTGA AGCGCGGCGC GCAGTGGCTG
CTGTCGATCC AGAACAAGGA CGGCGGCTGG GGCGAGGACG GCGACAGCTA CAAGCTCAAC
TATCGCGGCT TCGAGCAGGC GCCGAGCACC GCGTCGCAAA CCGCGTGGGC GCTGCTCGGC
CTGATGGCGG CCGGCGAAGT GAACAACCCG GCGGTGGCGC GCGGCATCGA CTACCTGATC
GCCGAGCAGA ACGCAGAGGG CTTGTGGGAC GAAACGCGCT TCACGGCGAC GGGCTTCCCG
CGCGTGTTCT ACCTGCGCTA CCACGGCTAT CGCAAGTTCT TCCCGCTGTG GGCGCTCGCC
CGCTATCGCA ACCTGAAGCG CGACAACACG ACGCGCGTGA CGGTCGGGCT GTAA
 
Protein sequence
MNDLTEMATL SAGAVPAGVD TAVARATDAL LAAQNADGHW VYELEADSTI PAEYVLLVHY 
LGETPNLELE QKIGKYLRRI QQADGGWPLF TDGAPNISAS VKAYFALKVI GDDENAEHMQ
RARRAIHAMG GAEMSNVFTR IQLALYGAIP WRAVPMMPVE IMLLPQWFPF HLSKVSYWAR
TVIVPLLVLN AKRPLAKNPR GVRIDELFID PPVNAGLLPR QGHQSAGWFA FFRVVDHALR
AVDGLFPNYT RERAIRQAVS FVDERLNGED GLGAIYPAMA NSVMMYDVLG YAEDHPNRAI
ARKSIEKLLV VQEDEAYCQP CLSPVWDTSL AANALLETRD ARAEDAAIRG LEWLRPLQIL
DVRGDWISRR PHVRPGGWAF QYANAHYPDV DDTAVVAVAM ERAQQLKQND AYRDSIARAR
EWVVGMQSSD GGWGAFEPEN TQYYLNNIPF SDHGALLDPP TADVSGRCLS MLSQLGETPL
NSEPARRALD YMLKEQEPDG SWYGRWGMNY VYGTWTALCS LNAAGLTPDD PRVKRGAQWL
LSIQNKDGGW GEDGDSYKLN YRGFEQAPST ASQTAWALLG LMAAGEVNNP AVARGIDYLI
AEQNAEGLWD ETRFTATGFP RVFYLRYHGY RKFFPLWALA RYRNLKRDNT TRVTVGL