Gene BamMC406_5865 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBamMC406_5865 
Symbol 
ID6182764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia ambifaria MC40-6 
KingdomBacteria 
Replicon accessionNC_010557 
Strand
Start bp395877 
End bp397913 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content70% 
IMG OID641688997 
Productsqualene-hopene cyclase 
Protein accessionYP_001815856 
Protein GI172065144 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAAC CCGCCCCCTC CCCCTGGTCC GCGCTCGACA CCGCGATCGC CCGTGGACGC 
GACGCGCTGA TGCGCCTTCA GCAGCCCGAC GGCAGCTGGT GTTTCGAACT CGAATCCGAC
GCGACGATCA CCGCGGAATA CATCCTGATG ATGCATTTCA TGGACAAGAT CGACGACGTG
CGCCAGGAGA AGATGGCGCG CTACCTGCGC GCGATCCAGC GGCTCGACAC GCACGGCGGG
TGGGACCTGT ACGTCGACGG CGACCCGGAC GTGTCGTGCA GCGTGAAGGC GTACTTCGCG
CTGAAGGCCG CCGGCGACAG CGAGCATGCG CCGCACATGG TCCGCGCGCG CGACGCGATC
CTCGCGCTCG GCGGCGCGGC ACGCTCGAAC GTGTTCACGC GGATCCTGCT CGCGACGTTC
GGCCAGGTGC CGTGGCGCGC GACGCCGTTC ATGCCGATCG AATTCGTGCT GTTCCCGAAG
TGGGTACCGA TCTCGATGTA CAAGGTCGCG TACTGGGCCC GCACGACGAT GGTGCCGCTG
CTCGTGCTGT GCTCGCTGAA AGCGCGTGCG CGCAATCCGC GCAACATCGC GATCCCCGAG
CTGTTCGTCA CCCCGCCCGA CGAGGAGCGT CACTACTTCC CGCCCGCGCG CGGGATGCGC
CGAGCGTTCC TCGCGCTCGA CCGCGTGGTG CGCCATGTCG AGCCGCTGCT GCCGAAACGC
CTGCGGCAGC GCGCGATCCG GCATGCGCAA GCGTGGTGCG CGGAGCGCAT GAACGGCGAA
GACGGCCTCG GCGGAATCTT TCCGCCGATC GTGTACAGCT ATCAGATGAT GGACGTGCTC
GGCTACCCGG ACGATCATCC GCGCCGCCGC GACTGCGAGA ACGCGCTGGA GAAGCTGCTG
GTCACGCGGA CGGACGGCAG CATGTACTGC CAGCCGTGCC TGTCGCCGGT ATGGGACACC
GCGTGGAGCA CGATGGCGCT CGAGCAGGCC CGTGCCGTGG CCGTGCCGGA AGCCGGCGCG
CGCGCGAGCG CACTGGACGA ACTCGACGCA CGCATCGCGC GCGCGTACGA CTGGCTGGCC
GAGCGCCAGG TGAACGACCT GCGCGGCGAC TGGATCGAGA ACGCGCCCGC CGATACGCAA
CCGGGCGGCT GGGCATTCCA GTACGCGAAC CCGTACTACC CCGACATCGA CGACACTGCG
GTCGTCACCG CGATGCTCGA CCGCCGCGGC CGCACGCATC GCAACGCGGA CGGCTCGCAT
CCGTATGCGG CGCGTGTCGC GCGCGCGCTC GACTGGATGC GCGGGCTGCA ATCGCGCAAC
GGCGGCTTCG CGGCCTTCGA CGCCGACTGC GACCGCATGT ACCTGAACGC GATTCCGTTC
GCCGATCACG GCGCGCTGCT CGATCCGCCG ACCGAGGACG TGTCGGGCCG CGTGCTGCTG
TGCTTCGGCG TCACGAAGCG CGCGGCCGAC CGCGCGTCGC TCGCGCGCGC GATCGACTAC
GTGAAGCGCA CGCAGCAGCC CGACGGCAGC TGGTGGGGCC GCTGGGGCAC GAACTACCTG
TACGGTACGT GGAGCGTGCT GGCCGGGCTC GCGCTTGCGG GCGAGGACCC GTCGCAGCCG
TACATCGCCC GCGCGCTCGC GTGGCTGCGC GCACGTCAGC ATGCGGACGG CGGCTGGGGC
GAGACGAACG ACAGCTACAT CGACCCTACG CTCGCCGGCA CCAATGCGGG CGAAAGCACG
TCGAACTGCA CCGCGTGGGC GCTGCTCGCG CAGATGGCGT TCGGCGACTG CGAATCGGAA
TCGGTCAGGC GCGGCATCGC GTATCTGCAA TCCGTGCAGC AGGACGACGG CTTCTGGTGG
CACCGCTCGC ACAACGCGCC GGGCTTTCCG CGCATCTTCT ACCTGAAGTA TCACGGCTAT
ACCGCGTACT TCCCGCTGTG GGCGCTTGCG CGCTATCGGC GGTTGGCTAG CGGCGTGTCG
TCGGCCGGCG TGCACGCGGT GCCCGCGTCC ACGGGCGCGG ACGCGGCGCT CGCCTGA
 
Protein sequence
MNKPAPSPWS ALDTAIARGR DALMRLQQPD GSWCFELESD ATITAEYILM MHFMDKIDDV 
RQEKMARYLR AIQRLDTHGG WDLYVDGDPD VSCSVKAYFA LKAAGDSEHA PHMVRARDAI
LALGGAARSN VFTRILLATF GQVPWRATPF MPIEFVLFPK WVPISMYKVA YWARTTMVPL
LVLCSLKARA RNPRNIAIPE LFVTPPDEER HYFPPARGMR RAFLALDRVV RHVEPLLPKR
LRQRAIRHAQ AWCAERMNGE DGLGGIFPPI VYSYQMMDVL GYPDDHPRRR DCENALEKLL
VTRTDGSMYC QPCLSPVWDT AWSTMALEQA RAVAVPEAGA RASALDELDA RIARAYDWLA
ERQVNDLRGD WIENAPADTQ PGGWAFQYAN PYYPDIDDTA VVTAMLDRRG RTHRNADGSH
PYAARVARAL DWMRGLQSRN GGFAAFDADC DRMYLNAIPF ADHGALLDPP TEDVSGRVLL
CFGVTKRAAD RASLARAIDY VKRTQQPDGS WWGRWGTNYL YGTWSVLAGL ALAGEDPSQP
YIARALAWLR ARQHADGGWG ETNDSYIDPT LAGTNAGEST SNCTAWALLA QMAFGDCESE
SVRRGIAYLQ SVQQDDGFWW HRSHNAPGFP RIFYLKYHGY TAYFPLWALA RYRRLASGVS
SAGVHAVPAS TGADAALA