Gene BURPS668_A3275 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A3275 
Symbolshc 
ID4888521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp3108747 
End bp3110720 
Gene Length1974 bp 
Protein Length657 aa 
Translation table11 
GC content70% 
IMG OID640133211 
Productsqualene-hopene cyclase 
Protein accessionYP_001064266 
Protein GI126442500 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.140167 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGACA TGACCGAAAT GCATACGCTC GACGCAACCG CCGCGCCGGC CGGCCTCGAC 
GCCGCCGTCG CGCGCGCGAC CGACGCGCTG CTCGCCGCGC AGCAAGCGGA CGGCCACTGG
GTCTACGAGC TCGAAGCCGA TTCGACGATC CCGGCCGAAT ACGTGCTGCT CGTCCACTAT
CTCGGCGAGG CGCCGAATGT CGAGCTCGAG CAGAAGATCG CGCGCTATCT GCGCCGGATT
CAGCAGCCGG ACGGCGGCTG GCCGCTCTTC ACCGACGGTG CGCCGAACAT TAGCGCGAGC
GTGAAGGCGT ACTTCGCGCT GAAGGTGATC GGCGACGACG AGAACGCCGA GCACATGCAG
CGCGCGCGCC GCGCGATCCA CGCGATGGGC GGCGCGGAGA TGTCGAACGT GTTCACGCGG
ATTCAGCTCG CGCTGTACGG CGTCGTGCCG TGGTACGCGG TGCCGATGAT GCCGGTCGAG
ATCATGCTGC TGCCGCAGTG GTTCCCGTTC CATCTGTCGA AGGTGTCGTA CTGGGCGCGC
ACCGTGATCG TGCCGCTGCT CGTGCTGAAC GCGAAGCGCC CGGTCGCGAA GAATCCGCGC
GGCGTGCGCA TCGACGAGCT GTTCAAGGGC GCACCCGTCA GCACCGGCCT GCTGCCGAAG
CAGCCGCACC AGAGCGCCGG CTGGTTTGCG TTCTTCCGCG CGGTCGACGG GGTGCTGCGT
CTCGTCGACG GCCTCTTCCC GCGCTATACG CGCGAGCGCG CGATCCGCCA GGCGGTCGCG
TTCGTCGACG AGCGCCTGAA CGGCGAGGAC GGGCTCGGCG CGATCTATCC CGCGATGGCC
AACGCGGTGA TGATGTACGC GGCGCTCGGC TATCCCGAAG ATCATCCGAA CCGCGCGATC
GCGCGCCGCT CGATCGAGAA GCTGCTCGTC GTCGGCGAGC AAGAGGCGTA TTGCCAGCCG
TGCCTGTCGC CGGTATGGGA CACGTCGCTT GCCGCGCACG CGCTGCTCGA GACGGGCGAC
GCGCGCGCGC GCGAAGCGGC GGTGCGCGGC CTCGACTGGC TCGTGCCGCG GCAGATCCTC
GACGTGCGCG GCGACTGGAT CTCGCGCCGT CCGCACGTGC GCCCCGGCGG CTGGGCGTTC
CAGTACGCGA ATGCGCACTA TCCGGACGTC GACGACACGG CGGTCGTCGC GATGGCGATG
GACCGCGTCG CGAAGCTCGA CCGGACCGAC GCGTATCGCG AGTCGATCGC GCGCGCGCGC
GAGTGGGTTG TCGGCATGCA GAGCAGCGAC GGCGGCTGGG GCGCGTTCGA GCCGGAAAAC
ACGCAGTACT ACCTGAACAA CATTCCGTTC TCCGATCACG GCGCGCTGCT CGATCCGCCG
ACGGCCGACG TGTCGGGCCG CTGCCTGTCG ATGCTCGCGC AGTTCGGCGA GACGAGCGCG
TCGAGCGAGC CCGCGCGCCG CGCGCTCGAC TACATGCTCA AGGAGCAGGA GCCGGACGGC
AGTTGGTACG GCCGCTGGGG GATGAACTAC ATCTACGGCA CGTGGACCGC GCTGTGCTCG
CTGAACGCGG CGGGCCTCGG CCACGACGAT CCGCGCGTGA AGCGCGCCGC GCAATGGCTG
CTGTCGATCC AGAACGCCGA CGGCGGCTGG GGCGAGGACG GCGACAGCTA CAAGCTCGAC
TACCGCGGCT ACGAGCGCGC GCCGAGCACG TCGTCGCAGA CCGCGTGGGC GCTGCTCGGC
CTGATGGCGG CGGGCGAAGT CGACAATCCC GCCGTCGCGC GCGGCGTCGA TTACCTGCTC
GGCACGCAGC GCGAGCACGG CCTGTGGGAC GAGACGCGCT TCACCGCGAC GGGCTTCCCG
CGTGTGTTCT ATCTGCGCTA CCACGGCTAC CGCAAGTTCT TCCCGCTGTG GGCGCTCGCC
CGCTATCGCA ACCTGAAGCG CGCGAACGCG ACGCGCGTGA CGGTCGGGAT GTAA
 
Protein sequence
MNDMTEMHTL DATAAPAGLD AAVARATDAL LAAQQADGHW VYELEADSTI PAEYVLLVHY 
LGEAPNVELE QKIARYLRRI QQPDGGWPLF TDGAPNISAS VKAYFALKVI GDDENAEHMQ
RARRAIHAMG GAEMSNVFTR IQLALYGVVP WYAVPMMPVE IMLLPQWFPF HLSKVSYWAR
TVIVPLLVLN AKRPVAKNPR GVRIDELFKG APVSTGLLPK QPHQSAGWFA FFRAVDGVLR
LVDGLFPRYT RERAIRQAVA FVDERLNGED GLGAIYPAMA NAVMMYAALG YPEDHPNRAI
ARRSIEKLLV VGEQEAYCQP CLSPVWDTSL AAHALLETGD ARAREAAVRG LDWLVPRQIL
DVRGDWISRR PHVRPGGWAF QYANAHYPDV DDTAVVAMAM DRVAKLDRTD AYRESIARAR
EWVVGMQSSD GGWGAFEPEN TQYYLNNIPF SDHGALLDPP TADVSGRCLS MLAQFGETSA
SSEPARRALD YMLKEQEPDG SWYGRWGMNY IYGTWTALCS LNAAGLGHDD PRVKRAAQWL
LSIQNADGGW GEDGDSYKLD YRGYERAPST SSQTAWALLG LMAAGEVDNP AVARGVDYLL
GTQREHGLWD ETRFTATGFP RVFYLRYHGY RKFFPLWALA RYRNLKRANA TRVTVGM