Gene Bamb_6104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBamb_6104 
Symbol 
ID4315008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia ambifaria AMMD 
KingdomBacteria 
Replicon accessionNC_008392 
Strand
Start bp662542 
End bp664590 
Gene Length2049 bp 
Protein Length682 aa 
Translation table11 
GC content70% 
IMG OID638153948 
Productsqualene-hopene cyclase 
Protein accessionYP_777982 
Protein GI115360845 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.640777 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.788384 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGCC GCATGAACAA ATCCGGCCCC TCCCCTTGGT CCGCGCTCGA CGCCGCGATC 
GCCCGCGGAC GCGACGCGCT GATGCGCCTT CAGCAGCCTG ACGGCAGCTG GTGTTTCGAA
CTCGAATCCG ACGCGACGAT CACCGCGGAA TACATCCTGA TGATGCATTT CATGGACAAG
ATCGACGACG CGCGCCAGGA GAAGATGGCG CGCTACCTGC GCGCGATCCA GCGGCTCGAC
ACGCACGGCG GGTGGGACCT GTATGTCGAC GGCGACCCGG ACGTGTCGTG CAGCGTGAAG
GCGTACTTCG CGCTGAAGGC CGCCGGCGAC AGCGAGCATG CGCCGCACAT GGTTCGCGCG
CGCGACGCGA TCCTCGAGCT CGGCGGCGCG GCACGCTCGA ACGTGTTCAC GCGCATCCTG
CTCGCGACGT TCGGCCAGGT GCCGTGGCGC GCGACGCCGT TCATGCCGAT CGAATTCGTG
CTGTTTCCGA AGTGGGTGCC GATCTCGATG TACAAGGTCG CGTACTGGGC CCGCACGACG
ATGGTGCCGC TGCTCGTGCT GTGCTCGCTG AAAGCGCGTG CGCGCAACCC GCGCAACATC
GCGATTCCCG AGCTGTTCGT CACGCCGCCC GACCAGGAAC GCCAGTACTT CCCGCCCGCG
CGCGGGATGC GCCGCGCATT CCTCGCGCTC GACCGCGTGG TGCGCCATGT CGAGCCGCTG
CTGCCGAAAC GCCTGCGGCA GCGCGCGATC CGGCATGCGC AAGCATGGTG CGCGGAGCGC
ATGAACGGCG AGGACGGCCT CGGCGGGATC TTTCCGCCGA TCGTGTACAG CTATCAGATG
ATGGACGTGC TCGGCTACCC GGACGATCAT CCGCTGCGCC GCGACTGCGA GAACGCGCTG
GAGAAGCTGC TGGTCACGCG GCCCGACGGC AGCATGTACT GCCAGCCGTG CCTGTCGCCG
GTGTGGGACA CCGCGTGGAG CACGATGGCG CTCGAGCAGG CGCGCGGCGT GGCCGTGCCG
GAAGCCGGCG CGCCCGCGAG CGCACTGGAC GAACTCGACG CACGCATCGC CCGCGCGTAC
GACTGGCTGG CCGAGCGCCA GGTGAACGAC CTGCGCGGCG ACTGGATCGA GAACGCGCCC
GCCGACACGC AACCGGGCGG CTGGGCGTTC CAGTACGCGA ACCCGTACTA CCCCGACATC
GACGACAGCG CGGTCGTCAC CGCGATGCTC GACCGCCGCG GCCGCACGCA TCGCAACGCG
GACGGCTCGC ATCCGTATGC GGCGCGCGTC GCGCGCGCGC TCGACTGGAT GCGCGGGCTG
CAATCGCGCA ACGGCGGCTT CGCGGCCTTC GACGCCGACT GCGACCGCCT GTACCTGAAC
GCGATTCCGT TCGCCGATCA CGGCGCGCTG CTCGATCCGC CGACCGAGGA CGTGTCGGGC
CGCGTGCTGC TGTGCTTCGG CGTCACGAAG CGCGCGGACG ACCGCGCGTC GCTCGCGCGC
GCGATCGACT ACGTGAAGCG CACGCAGCAG CCCGACGGCA GCTGGTGGGG CCGCTGGGGC
ACGAACTACC TGTACGGCAC GTGGAGCGTG CTGGCCGGGC TCGCGCTCGC GGGCGAGGAC
CCGTCGCAGC CGTACATCGC CCGCGCGCTC GCGTGGCTGC GCGCCCGTCA GCACGCGGAC
GGCGGCTGGG GCGAGACGAA CGACAGCTAC ATCGACCCGG CGCTCGCCGG CACCAATGCG
GGCGAAAGCA CGTCGAACTG CACCGCGTGG GCGCTGCTCG CGCAGATGGC GTTCGGCGAC
GGCGAATCGG AATCGGTCAG GCGCGGCATC GCGTATCTGC AATCCGTGCA GCAGGACGAC
GGCTTCTGGT GGCACCGGTC GCACAACGCG CCGGGCTTTC CGCGCATCTT CTACCTGAAG
TATCACGGCT ACACGGCGTA CTTCCCGCTG TGGGCGCTCG CGCGCTATCG GCGGTTGGCT
GGCGGCGTGT CGGCAGCGGG CGCGCACGCG GTGCCGGCGT CCACGGGCGC GGACGCCGCG
CTCGCCTGA
 
Protein sequence
MIRRMNKSGP SPWSALDAAI ARGRDALMRL QQPDGSWCFE LESDATITAE YILMMHFMDK 
IDDARQEKMA RYLRAIQRLD THGGWDLYVD GDPDVSCSVK AYFALKAAGD SEHAPHMVRA
RDAILELGGA ARSNVFTRIL LATFGQVPWR ATPFMPIEFV LFPKWVPISM YKVAYWARTT
MVPLLVLCSL KARARNPRNI AIPELFVTPP DQERQYFPPA RGMRRAFLAL DRVVRHVEPL
LPKRLRQRAI RHAQAWCAER MNGEDGLGGI FPPIVYSYQM MDVLGYPDDH PLRRDCENAL
EKLLVTRPDG SMYCQPCLSP VWDTAWSTMA LEQARGVAVP EAGAPASALD ELDARIARAY
DWLAERQVND LRGDWIENAP ADTQPGGWAF QYANPYYPDI DDSAVVTAML DRRGRTHRNA
DGSHPYAARV ARALDWMRGL QSRNGGFAAF DADCDRLYLN AIPFADHGAL LDPPTEDVSG
RVLLCFGVTK RADDRASLAR AIDYVKRTQQ PDGSWWGRWG TNYLYGTWSV LAGLALAGED
PSQPYIARAL AWLRARQHAD GGWGETNDSY IDPALAGTNA GESTSNCTAW ALLAQMAFGD
GESESVRRGI AYLQSVQQDD GFWWHRSHNA PGFPRIFYLK YHGYTAYFPL WALARYRRLA
GGVSAAGAHA VPASTGADAA LA