Gene Bcen2424_5679 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcen2424_5679 
Symbol 
ID4452350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia cenocepacia HI2424 
KingdomBacteria 
Replicon accessionNC_008543 
Strand
Start bp2800595 
End bp2802568 
Gene Length1974 bp 
Protein Length657 aa 
Translation table11 
GC content68% 
IMG OID639697740 
Productsqualene-hopene cyclase 
Protein accessionYP_839305 
Protein GI116693772 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.126345 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00244558 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACGATC TCACCGAAAT GGCTACCTTG TCCGCCGGCG CCGTGCCGGC CGGCGTCGAC 
GCGGCTGTCG CGCGTGCGAC CGACGCGCTG CTCGCCGCGC AGCAAGCGGA TGGCCACTGG
GTCTACGAAC TCGAGGCGGA TTCGACGATT CCCGCCGAAT ACGTGCTGCT CGTCCACTAT
CTTGGCGAGA CGCCGAACCT CGAGCTCGAA CAGAAGATCG GCAAGTATCT GCGCCGCATC
CAGCAGGCCG ACGGCGGCTG GCCGCTGTTC ACCGACGGCG CGCCGAACAT CAGCGCGAGC
GTGAAGGCGT ATTTCGCGCT GAAGGTGATC GGCGACGACG AGAACGCCGA GCACATGCAG
CGCGCGCGCC GTGCCATCCA TGCGATGGGC GGCGCCGAGA TGTCGAACGT GTTCACGCGC
ATCCAGCTCG CGCTGTACGG TGCGATTCCG TGGCGCGCGG TGCCGATGAT GCCGGTCGAG
ATCATGCTGC TGCCGCAGTG GTTCCCGTTC CACCTGTCGA AGGTGTCGTA CTGGGCGCGC
ACCGTGATCG TGCCGCTGCT CGTGCTGAAC GCGAAGCGTC CGCTCGCGAA GAACCCGCGC
GGCGTGCGCA TCGACGAGCT GTTCATCGAT CCGCCCGTCA ACGCCGGGCT GCTGCCGCGC
CAGGGCCATC AGAGCGCGGG CTGGTTCGCG TTTTTCCGCG TGGTCGACCA TGCGCTGCGC
GCGGTCGACG GCCTGTTCCC GAGCTATACG CGCGAACGCG CGATCCGTCA GGCCGTGTCG
TTCGTCGACG AGCGCCTGAA CGGCGAGGAC GGCCTCGGCG CGATCTATCC GGCGATGGCC
AACGCGGTGA TGATGTACGA CGTGCTCGGC TATGCGGAAG ATCATCCGAA CCGTGCGATC
GCGCGCAAGG CGCTCGAGAA GCTGCTCGTC GTGCACGACG ACGAGGCGTA TTGCCAGCCG
TGCCTGTCGC CCGTGTGGGA TACGTCGCTC GTCGCGCATG CGCTGCTCGA GACCGGCGAT
GCGCGTGCCG AGGAAGCCGT GCTGCGCGGC CTCGAATGGC TGCGCCCGCT GCAGATCCTC
GACGTGCGCG GCGACTGGAT CTCGCGCCGC CCGAACGTGC GGCCCGGCGG CTGGGCGTTC
CAGTACGCGA ACGCGCACTA CCCTGACGTC GACGATACGG CCGTGGTCGT GATGGCGATG
GATCGCGCGC AAAAGCTCAA GCAATCGGAC ACGTATCGCG AATCGATGGC GCGGGCGCGC
GAATGGGTCG TCGGCATGCA GAGCAGCGAC GGCGGCTGGG GCGCGTTCGA ACCGGAAAAC
ACGCAGTACT ACCTGAACAA CATCCCGTTC TCCGATCACG GCGCGCTGCT CGATCCGCCG
ACGGCCGACG TGTCGGGCCG CTGCCTGTCG ATGCTGTCGC AGCTCGGCGA GACGCCGCTG
AACAGCGAGC CGGCCCGCCG CGCGCTCGAC TACATGCTGA AGGAACAGGA GCCGGACGGC
AGCTGGTACG GCCGTTGGGG GATGAACTAC GTGTACGGCA CGTGGACGGC GCTGTGCTCG
CTGAATGCGG CCGGCCTGAC GCCGGACGAC CCGCGCATGA AGCGCGCCGC GCAGTGGCTG
CTGTCGATCC AGAACAAGGA CGGCGGCTGG GGCGAGGACG GCGACAGCTA CAAGCTGAAC
TACCGCGGTT ACGAGCAGGC GCCGAGCACG GCGTCGCAGA CGGCCTGGGC GCTGCTCGGC
CTGATGGCGG CCGGCGAAGT GAACAACCCG GCCGTGGCGC GCGGCGTCGA CTACCTCGTC
GCTCAGCAGA ACGAAGAAGG GCTGTGGGAC GAGACGCGCT TCACGGCAAC GGGCTTCCCG
CGCGTGTTCT ACCTGCGCTA CCACGGTTAT CGCAAGTTCT TCCCGCTGTG GGCGCTGGCG
CGCTACCGCA ACCTGAAGCG CGCGAACGCG ACGCGCGTGA CGGTCGGGAT GTAA
 
Protein sequence
MNDLTEMATL SAGAVPAGVD AAVARATDAL LAAQQADGHW VYELEADSTI PAEYVLLVHY 
LGETPNLELE QKIGKYLRRI QQADGGWPLF TDGAPNISAS VKAYFALKVI GDDENAEHMQ
RARRAIHAMG GAEMSNVFTR IQLALYGAIP WRAVPMMPVE IMLLPQWFPF HLSKVSYWAR
TVIVPLLVLN AKRPLAKNPR GVRIDELFID PPVNAGLLPR QGHQSAGWFA FFRVVDHALR
AVDGLFPSYT RERAIRQAVS FVDERLNGED GLGAIYPAMA NAVMMYDVLG YAEDHPNRAI
ARKALEKLLV VHDDEAYCQP CLSPVWDTSL VAHALLETGD ARAEEAVLRG LEWLRPLQIL
DVRGDWISRR PNVRPGGWAF QYANAHYPDV DDTAVVVMAM DRAQKLKQSD TYRESMARAR
EWVVGMQSSD GGWGAFEPEN TQYYLNNIPF SDHGALLDPP TADVSGRCLS MLSQLGETPL
NSEPARRALD YMLKEQEPDG SWYGRWGMNY VYGTWTALCS LNAAGLTPDD PRMKRAAQWL
LSIQNKDGGW GEDGDSYKLN YRGYEQAPST ASQTAWALLG LMAAGEVNNP AVARGVDYLV
AQQNEEGLWD ETRFTATGFP RVFYLRYHGY RKFFPLWALA RYRNLKRANA TRVTVGM