Gene Bcep18194_C7519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_C7519 
Symbol 
ID3734951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007509 
Strand
Start bp1135811 
End bp1137847 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content68% 
IMG OID637761220 
ProductTerpene synthase/squalene cyclase 
Protein accessionYP_367207 
Protein GI78060632 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGCC GCATGACTAC ACCCACCCCC TCCCCCTGGT CCGCGCTCGA CACCGCCATC 
GCCCGTGGAC GCGACGCCCT CGTGCGTCTC CAGCAGCCCG ACGGCAGCTG GTGTTTCGAA
CTCGAATCCG ACGCGACGAT CACCGCGGAA TACATCCTGA TGATGCATTT CATGGACAAG
ATCGACGACC TTCGCCAGGA GAAGATGGCG CGCTACCTGC GCGCGAACCA GCGGCTCGAC
ACGCATGGCG GGTGGGCGCT GTACGTCGAC GGCGATCCGG ACGTGTCGTG CAGCGTGAAG
GCGTACTTCG CGCTGAAGGC GGCCGGTGAC AGCGAACACG CGCCGCACAT GGTCCGCGCG
CGTGACGCGA TCCTGAAGCT CGGCGGCGCG GCCCGCGCGA ACGTATTCAC GCGCATCCTG
CTCGCGACGT TCGGCCAGGT ACCGTGGCGC GCGGCGCCGT TCATGCCGAT CGAATTCGTG
CTGTTCCCGA AGTGGGTGCC GATCTCGATG TACAAGGTCG CGTACTGGGC GCGCACGACG
ATGGTGCCGC TGCTCGTGCT GTGCTCGCTG AAAGCGCGCG CCCGCAATCC GCGCAACATC
TCGATCCGCG AGCTGTTCGT CACGCCGCCG GACGAGGAAC GCCAGTACTT CCCGCCCGCC
CGCGGCATGC GCAAGCTGTT CCTCGCGCTC GACCGCACGG TGCGCCACGT CGAGCCGCTG
ATGCCGAAGG GCCTGCGGCA GCGCGCGATC CGCCATGCCG AGGCGTGGTG CGCGGAGCGC
ATGAACGGCG AGGACGGGCT CGGCGGCATC TTCCCGCCGA TCGTGTACTG CTATCAGATG
ATGGAAGTGC TCGGCTACCC GGACGACCAT CCGCTGCGGC GCGATTGCGA GAACGCGCTG
GAGAAGTTGC TGGTCACGCG GCCGGACGGC AGCATGTATT GCCAGCCGTG CCTGTCGCCG
GTGTGGGATA CCGCGTGGAG CACGATGGCG CTCGAGCAGG CGCGCGGCGT GGCAGTGGCG
GAAGACGGCG AGCCGGGCGA CGCACGGCGC GCACTCGACG AACGCATCAC GCGCGCATAC
GACTGGCTGG CCGAACGCCA GGTGAACGAC CTGCGCGGCG ACTGGATCGA GAACGCGCCG
GCCGACGTCC AGCCGGGCGG ATGGGCGTTC CAGTACGCGA ACCCGTACTA CCCCGACATC
GACGACACGG CGGTCGTCAC CGCGATGCTC GATCGCCGCG GCCGCACGCA TGCCAACGCG
GACGGCACGA ACCCGTATGC GACGCGTGTC GCGCGCGCGC TCGACTGGAT GCGCGGGCTG
CAATCGCGCA ACGGCGGCTT CGGCGCATTC GACGCCGACT GCGACCGCCT GTACCTGAAC
GCGATTCCGT TCGCCGATCA CGGCGCACTG CTCGATCCGC CGACCGAGGA CGTGTCGGGC
CGCGTGCTGC TGTGCTTCGG CGTGACGAAA CGCGCGGACG AACACGCGTC GCTCGCGCGC
TGCATCGACT ACGTGAAGCG CACGCAGCAG CCCGACGGCA GTTGGTGGGG CCGCTGGGGC
ACGAACTACA TCTACGGTAC GTGGAGCGTG CTGGCCGGCC TCGCGCTCGC CGGCGAGGAC
AAGTCGCAGC CGTACATCGC CCGCGCGATC GAATGGCTGC GCGCACGGCA GCATGCGGAC
GGCGGCTGGG GCGAGACGAA CGACAGCTAT ATCGACCCGA AGCTGGGCGG CACCAATGGC
GGCGAAAGCA CGTCGAACTT CACCGCATGG GCGCTGCTCG CGCAGATGGC GTTCGGCGAC
TGCGAATCCG ATTCGGTGAA GCGCGGCATC GCGTATCTGC AGTCGGTGCA GCAGGAAGAC
GGCTTCTGGT GGCACCGCTC GCACAATGCG CCGGGCTTTC CGCGCATCTT CTACCTGAAG
TATCACGGCT ATACCGCGTA CTTCCCGCTG TGGGCGCTCG CGCGCTATCG GCGGCTGGCC
GGAGTGGCAA ACAAGCGCGT ATCGACTGCG GACAAGACAG CGGACGCAAT GGCGTAA
 
Protein sequence
MIRRMTTPTP SPWSALDTAI ARGRDALVRL QQPDGSWCFE LESDATITAE YILMMHFMDK 
IDDLRQEKMA RYLRANQRLD THGGWALYVD GDPDVSCSVK AYFALKAAGD SEHAPHMVRA
RDAILKLGGA ARANVFTRIL LATFGQVPWR AAPFMPIEFV LFPKWVPISM YKVAYWARTT
MVPLLVLCSL KARARNPRNI SIRELFVTPP DEERQYFPPA RGMRKLFLAL DRTVRHVEPL
MPKGLRQRAI RHAEAWCAER MNGEDGLGGI FPPIVYCYQM MEVLGYPDDH PLRRDCENAL
EKLLVTRPDG SMYCQPCLSP VWDTAWSTMA LEQARGVAVA EDGEPGDARR ALDERITRAY
DWLAERQVND LRGDWIENAP ADVQPGGWAF QYANPYYPDI DDTAVVTAML DRRGRTHANA
DGTNPYATRV ARALDWMRGL QSRNGGFGAF DADCDRLYLN AIPFADHGAL LDPPTEDVSG
RVLLCFGVTK RADEHASLAR CIDYVKRTQQ PDGSWWGRWG TNYIYGTWSV LAGLALAGED
KSQPYIARAI EWLRARQHAD GGWGETNDSY IDPKLGGTNG GESTSNFTAW ALLAQMAFGD
CESDSVKRGI AYLQSVQQED GFWWHRSHNA PGFPRIFYLK YHGYTAYFPL WALARYRRLA
GVANKRVSTA DKTADAMA