Gene BMASAVP1_1129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMASAVP1_1129 
Symbolshc 
ID4676950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei SAVP1 
KingdomBacteria 
Replicon accessionNC_008784 
Strand
Start bp1146659 
End bp1148632 
Gene Length1974 bp 
Protein Length657 aa 
Translation table11 
GC content69% 
IMG OID639843648 
Productsqualene-hopene cyclase 
Protein accessionYP_990728 
Protein GI121597515 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGACA TGACCGAAAT GCATACGCTC GACGCAACCG CCGCGCCGGC CGGCCTCGAC 
GCCGCCGTCG CGCGCGCGAC CGACGCGCTG CTCGCCGCGC AGCAAGCGGA CGGCCACTGG
GTCTACGAGC TCGAAGCCGA TTCGACGATC CCGGCCGAAT ACGTGCTGCT CGTCCACTAT
CTCGGCGAGG CGCCGAATGT CGAGCTCGAG CAGAAGATCG CGCGCTATCT GCGCCGGATT
CAGCAGCCGG ACGGCGGCTG GCCGCTCTTC ACCGACGGTG CGCCGAACAT TAGCGCGAGC
GTGAAGGCGT ACTTCGCGCT GAAGGTGATC GGCGACGACG AGAACGCCGA GCACATGCAG
CGCGCGCGCC GCGCGATCCA CGCGATGGGC GGCGCGGAGA TGTCGAACGT GTTCACGCGG
ATTCAGCTCG CGCTGTACGG CGTCGTGCCG TGGTACGCGG TGCCGATGAT GCCGGTCGAG
ATCATGCTGC TGCCGCAGTG GTTCCCGTTC CATCTATCGA AGGTGTCGTA CTGGGCGCGC
ACCGTGATCG TGCCGCTGCT CGTGCTGAAC GCGAAGCGCC CGGTCGCGAA GAATCCGCGC
GGCGTGCGCA TCGACGAGCT GTTCAAGGGC GCACCCGTCA GCACCGGCCT GCTGCCGAAG
CAGCCGCACC AGAGCGCCGG CTGGTTTGCG TTCTTCCGCG CGGTCGACGG GGTGCTGCGT
CTCGTCGACG GCCTCTTCCC GCGCTATACG CGCGAGCGCG CGATCCGCCA GGCGGTCGCG
TTCGTCGACG AGCGCCTGAA CGGCGAGGAC GGGCTCGGCG CGATCTATCC CGCGATGGCC
AACGCGGTGA TGATGTACGC GGCGCTCGGC TATCCCGAAG ATCATCCGAA CCGCGCGATC
GCGCGCCGCT CGATCGAGAA GCTGCTCGTC GTCGGCGAGC AAGAGGCGTA TTGCCAGCCG
TGCCTGTCGC CGGTATGGGA CACGTCGCTT GCCGCGCATG CGCTGCTCGA GACGGGCGAC
GCGCGCGCGC GCGAAGCGGC GGTGCGCGGC CTCGACTGGC TCGTGCCGCG GCAGATCCTC
GACGTGCGCG GCGACTGGAT CTCGCGCCGT CCGCACGTGC GCCCCGGCGG CTGGGCGTTC
CAGTACGCGA ATGCGCACTA TCCGGACGTC GACGACACGG CGGTCGTCGC GATGGCGATG
GACCGCGTCG CGAAGCTCGA CCGGACCGAC GCGTATCGCG AGTCGATCGC GCGCGCGCGC
GAGTGGGTTG TCGGCATGCA GAGCAGCGAC GGCGGCTGGG GCGCGTTCGA GCCGGAAAAC
ACGCAGTACT ACCTGAACAA CATTCCGTTC TCCGATCACG GCGCGCTGCT CGATCCGCCG
ACGGCCGACG TGTCGGGCCG CTGCCTGTCG ATGCTCGCGC AGTTCGGCGA GACGAGCGCG
TCGAGCGAGC CCGCGCGCCG CGCGCTCGAC TACATGCTCA AGGAGCAGGA GCCGGACGGC
AGCTGGTACG GCCGCTGGGG GATGAACTAC ATCTACGGCA CGTGGACCGC GCTGTGCTCG
CTGAACGCGG CGGGCCTCGG CCACGACGAT CCGCGCGTGA AGCGCGCCGC GCAATGGCTG
CTGTCGATCC AGAACGCCGA CGGCGGCTGG GGCGAGGACG GCGACAGCTA CAAGCTCGAC
TACCGCGGCT ACGAGCGCGC GCCGAGCACG TCGTCGCAGA CCGCGTGGGC GCTGCTCGGC
CTGATGGCGG CGGGCGAAGT CGACAATCCC GCCGTCGCGC GCGGCGTCGA TTACCTGCTC
GGCACGCAGC GCGAGCACGG CCTGTGGGAC GAGACGCGCT TCACCGCGAC GGGCTTCCCG
CGCGTGTTCT ATCTGCGCTA CCACGGCTAC CGCAAGTTCT TCCCGCTGTG GGCGCTCGCC
CGCTATCGCA ACCTGAAGCG CGCGAACGCG ATGCGCGTGA CGGTCGGGAT GTAA
 
Protein sequence
MNDMTEMHTL DATAAPAGLD AAVARATDAL LAAQQADGHW VYELEADSTI PAEYVLLVHY 
LGEAPNVELE QKIARYLRRI QQPDGGWPLF TDGAPNISAS VKAYFALKVI GDDENAEHMQ
RARRAIHAMG GAEMSNVFTR IQLALYGVVP WYAVPMMPVE IMLLPQWFPF HLSKVSYWAR
TVIVPLLVLN AKRPVAKNPR GVRIDELFKG APVSTGLLPK QPHQSAGWFA FFRAVDGVLR
LVDGLFPRYT RERAIRQAVA FVDERLNGED GLGAIYPAMA NAVMMYAALG YPEDHPNRAI
ARRSIEKLLV VGEQEAYCQP CLSPVWDTSL AAHALLETGD ARAREAAVRG LDWLVPRQIL
DVRGDWISRR PHVRPGGWAF QYANAHYPDV DDTAVVAMAM DRVAKLDRTD AYRESIARAR
EWVVGMQSSD GGWGAFEPEN TQYYLNNIPF SDHGALLDPP TADVSGRCLS MLAQFGETSA
SSEPARRALD YMLKEQEPDG SWYGRWGMNY IYGTWTALCS LNAAGLGHDD PRVKRAAQWL
LSIQNADGGW GEDGDSYKLD YRGYERAPST SSQTAWALLG LMAAGEVDNP AVARGVDYLL
GTQREHGLWD ETRFTATGFP RVFYLRYHGY RKFFPLWALA RYRNLKRANA MRVTVGM