Gene Bphyt_7029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphyt_7029 
Symbol 
ID6280355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phytofirmans PsJN 
KingdomBacteria 
Replicon accessionNC_010676 
Strand
Start bp3409083 
End bp3411113 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content66% 
IMG OID642618052 
Productsqualene-hopene cyclase 
Protein accessionYP_001890688 
Protein GI187921656 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.00000008735 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACGATT TATCTCAAGC CCAGCCACTG GACGCCATCC TGCCCGATTT CGCCGACGCA 
GCGCCGAGCG CGCCCGCGCC GGCCGTTACG GGCGAGGCGC CCACGGCATC GCTCGACGCC
GCGATCACGC GCGCGACCGA GGCGATTCTC GCCGCGCAGA AGCCCGACGG CCACTGGGTC
TACGAACTCG AAGCCGACGC GACGATTCCC GCCGAATACG TGCTGCTGGT TCACTATCTC
GGCGAAACGC CGAACCTCGA ACTGGAACAG AAGATCGCGC GCTATCTGCG GCGCATTCAG
TTGCCCGACG GCGGCTGGCC GCTGTTCACC GACGGCGCGC TCGACATCAG CGCCAGCGTG
AAAGCGTATT TCGCGCTGAA GATGATCGGC GACCCGGCCG ACGCCGAGCA CATGGTTCGC
GCGCGCGAGG CGATTCTCGC CCACGGCGGC GCCGAAACCG TGAACGTCTT CACGCGGATT
CTGCTCGCGC TGTTCGGCGT GGTGTCGTGG CGCGCGGTGC CGATGATGCC CGTCGAGATC
ATGCTGTTGC CCATGTGGTT TCCGTTTCAT CTGTCGAAGG TGTCGTACTG GGCGCGTACC
GTGATCGTGC CGCTGCTCGT GCTCAATGCA AAGCGGCCGG TGGCGCGCAA CCCGCGCCGC
GTGCGCATCG ACGAACTGTT CCGCGGCGCG CCGGTCAACA CCGGCCCGCG CGACCGGGCG
CCGCATCAGC ACGCCGGCTG GTTCCGGTTT TTCAGCGGCG TGGACGTGTT GCTGCGCGCC
GTGGATGGTC TGTTCCCGAA GTCCACGCGC GAGCGCGCGG TGCGGCAGGC GGTGGCTTTC
GTCGATGAAC GGCTGAATGG CGAAGACGGC CTGGGCGCGA TTTTCCCCGC GATGGCGAAC
TCGGTGATGA TGTACGACGT GCTCGGTTAT CCGGCCGATC ATCCGAATCG CGCGATTGCT
CGCCAGTCGA TCGACAAGCT CCTCGTCATC AAGGATGACG AAGCATATTG CCAGCCGTGT
CTGTCGCCGG TGTGGGATAC GTCGCTCGCG GCTCATGCGT TGCTCGAAAC CGGCGAGGCG
CATGCCGAAC AGGCCGCCGA ACGCGGGCTC GCGTGGCTGC GTCCGCTGCA GATTCTCGAC
GTGCGCGGCG ACTGGATTTC GCGCCGTCCG AATGTGCGGC CGGGCGGCTG GGCGTTCCAG
TACAACAACG CGCATTATCC GGACGTCGAC GATACGGCGG TGGTCGCCAT GGCGATGCAG
CGCTCGGCAA CGGTGACGCA ATCCGATGTA GATCGCGACG CCATTGCGCG TGCGCGTGAA
TGGGTGGTCG GCATGCAGAG CAGCGACGGC GGCTGGGGCG CGTTCGAGCC GGAAAACACG
CAGTACTACC TGAACAACAT TCCGTTCTCC GATCACGGCG CCCTGCTCGA TCCGCCGACC
GCGGACGTCT CGGGCCGCTG CCTGTCGATG CTCGCGCAAC TCGGCGAACT GCCGCAGAAC
AGCGAGCCGG CGCAACGTGC GTTCGACTAC ATGCTGAAGG AGCAGGAATC GGATGGCAGC
TGGTACGGCC GCTGGGGCTT GAACTACATC TATGGCACGT GGACCGCGCT GTGTTCGCTG
AACGCTGCCG GCCTGCCACA CGACGACCCG CGCATGAAGC GCGCAGCGCA GTGGCTCCTG
TCGATCCAGA ACGAAGACGG CGGCTGGGGC GAGGGCGGCG AGAGCTACAA GCTCGACTAC
CACGGCTACG AGCGTGCGCC GAGCACGGCT TCGCAAACCG CGTGGGCTCT CATGGGCCTG
ATGGCGGCCG GTGAGGTCAA TCACGAAGCG GTGGCGCGCG GCGTCGCGTA CCTGGAGCGT
GAACAGCGTG AGCACGGTCT CTGGGACGAA ACACGTTTCA CCGCGACCGG TTTCCCGCGT
GTGTTCTATC TGCGTTATCA CGGCTATCGC AAGTTCTTCC CGCTGTGGGC GCTGGCGCGC
TTCCGTCATC TGAAGCGCAA CGGCCTCACG CGCGTCGCGG TCGGGATGTA A
 
Protein sequence
MNDLSQAQPL DAILPDFADA APSAPAPAVT GEAPTASLDA AITRATEAIL AAQKPDGHWV 
YELEADATIP AEYVLLVHYL GETPNLELEQ KIARYLRRIQ LPDGGWPLFT DGALDISASV
KAYFALKMIG DPADAEHMVR AREAILAHGG AETVNVFTRI LLALFGVVSW RAVPMMPVEI
MLLPMWFPFH LSKVSYWART VIVPLLVLNA KRPVARNPRR VRIDELFRGA PVNTGPRDRA
PHQHAGWFRF FSGVDVLLRA VDGLFPKSTR ERAVRQAVAF VDERLNGEDG LGAIFPAMAN
SVMMYDVLGY PADHPNRAIA RQSIDKLLVI KDDEAYCQPC LSPVWDTSLA AHALLETGEA
HAEQAAERGL AWLRPLQILD VRGDWISRRP NVRPGGWAFQ YNNAHYPDVD DTAVVAMAMQ
RSATVTQSDV DRDAIARARE WVVGMQSSDG GWGAFEPENT QYYLNNIPFS DHGALLDPPT
ADVSGRCLSM LAQLGELPQN SEPAQRAFDY MLKEQESDGS WYGRWGLNYI YGTWTALCSL
NAAGLPHDDP RMKRAAQWLL SIQNEDGGWG EGGESYKLDY HGYERAPSTA SQTAWALMGL
MAAGEVNHEA VARGVAYLER EQREHGLWDE TRFTATGFPR VFYLRYHGYR KFFPLWALAR
FRHLKRNGLT RVAVGM