Gene BMASAVP1_0234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMASAVP1_0234 
Symbol 
ID4677648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei SAVP1 
KingdomBacteria 
Replicon accessionNC_008784 
Strand
Start bp261012 
End bp264407 
Gene Length3396 bp 
Protein Length1131 aa 
Translation table11 
GC content72% 
IMG OID639842762 
Productserine protease 
Protein accessionYP_989845 
Protein GI121597692 
COG category[S] Function unknown 
COG ID[COG4625] Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain
[TIGR02601] autotransporter-associated beta strand repeat 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.411978 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGATGA CCCGGCACAA AAAACGGAAA ACGATGAAGC GCAGCGGAGC GAAGCTGCTT 
GCGCCCGTGG TCGTCGCGGC CGCCGCCGCG GTCGCGGCCC GCCCCGGCTG GGCGCAGGCC
GCGCCGTACC CGGATCCGGG CCGGCGCGGC GATCCGGCGA GTTGGCGCAC GCCGGAATTC
ACGAACGCGT GGGGGCTCGG CGCGATGCAC GCCGAGTATG CATACGCGGC CGGCTATACC
GGCGCGAACG TCGCGATCGG CGTGCTGGAC TCCGGCTACT ACGCGCAGCA TCCGGAACTG
CCCGACAGCC GCTTCGTTCC GGTGACGGCC GCGGGCGTGT CCGGCGTGCT GAATCCGAAC
AACAACAATC ACGGCACGCT GGTGAGCGGC GTCGTCGGCG GCGCGCGCGA CGGTGTCGGC
ATGCACGGCG TCGCACCCGA CGCGACGGTG TACGAAGGCA ACACGAACGC GACCGACGGC
TTCCGGTTCG GCGTATCGGA TCCGAAGTTT CCCGCGTCGG ATGCGAAGTA TTTCGCCGAG
GCCTACGATG CGCTCGCCGC AAAGGGCGTG CGGATCATCA GCAACAGCTG GGGCTCGCAG
CCGGCCAACG AGAACTACAG CACCCTGAAC AAACTCACCG ATGCCTACAA GCTGCACGAG
GCGGTGCGCA CGGCGACCGG CCGGGGCACG TGGCTCGACG CGGCGGCGAA GGTGTCGCGC
GACGGCGTGA TCAACAACTT CAGTTCGGGC AACACCGGCT ACGACAACGC GAGCTTGCGC
GGCGCGTATG CGTACTTCCA CCCCGAACTC GAAGGGCACT GGATGACGAC GACGGGCTAC
GACCAGTTGA GCGGCCAGGT CTACAACCAA TGCGGGATCG CGAAGTGGTG GTGCGTGATG
GCGCCCACGG GCGTGCCGTC GACGTCATAT TCGGGCGGCG CGGCGGCGCC GACCGGGGCG
ACCTACGCGA ACTTCAACGG CACGTCGGCC GCCGCGCCGC ACGCGTCGGC GGCGCTCGCG
CTGATCATGG AGCGCTTTCC GTACATGACG AGCGAGCAGG CGCTGTCGGT GCTGTTCACG
ACCGCGCAGA ACATGGAGCC GGACCCGAGC CGGCCGGACT ACACGAACAA CGGGCTGTTC
TCGACCGTGC ATCCGGCGAA GCCCGGCGCG TCGGGCGTGC CGAACGCGTT CGGCGGCTGG
GGGCTCGTCG ACCTGCGCCG GGCGATGAAC GGCCCGGGCC AACTGCTCGG CACGTTCGAC
GCGGCGCTGC CTGCGGGCAC CGCCGACGTG TGGTCGAACG ACATCTCCGA CGTCGCGCTC
GCCGCGCGCA AGCGCGAAGA CGACGCCGAG CACCGCGCGT GGCTCGACAC GCTGAGGACG
AAGGGATGGG AGCACGGGCT GCCCGCCGGC GCGAGCAATG GCGACCGGAT CGACTATGCG
CTCGGCGTCG CGCGCGAAAC CGCGTATCAG GCGCGCGAGT ATCAGGGCAG TCTCGTGAAA
TCGGGCGGCG GCACGCTGAC GCTCGCCGGC GCGAACACCT ATCGCGGGCC CACGACGGTC
GACGGCGGCG AATTGAGGAT CGACGGCTCG ATCGCCGCGC GCGCCGTCGT CAATCCGGCG
GGCCGGCTCA CGGTGAACGG CCGCGCGGCC GACATCGCGG TCAACGGCGG CGTGGCGACG
ATCGCCGGGA CGAGCGCGAA CCTGTCGATC GACCGGCAGG GCCGGGCCGC CGTGACAGGG
ACGACGGCGG ACGTGCGCGT CGCGAGCGGC TTTGCATCGC TCGGCGGCAC GAGCGGCAAC
GTCGCGGTGG GGGCGCTCGG CGTCGCCGCG ATCACGGGCC GCACGGCCGA CGTGGCGGTC
GACGGCGGCC GCGCGTCGCT CGACGGCGCG AGCGGCAACG TCGCGGTCGG CAACGGCGGC
GTCGTGAGCG GCAGCGGCAC GGTGCGCACG CTCACGGCGG CCGCGAACGG CACGGTCGCG
CCCGGCCATT CGGTCGGTAC GCTGACGGTG TCGGGCGACG TGCGCTTCGC GCCGGGTTCG
ATCTACGCGG TCGAGGTGTC GCCGGGCGGC GCGGGCGACC GGATCGTCGC GGGCGGCCGC
GCGCAAATTG ACGGCGGCGC GTTGGCGCTC GCGCTCGAGA ACACGCCGCC GCCGCTCACG
CCCGAGCAGT CGCGCTCGGT GCTCGGCCGT CGCTTCGAGA TCCTGAATGC GGCGGGCGGC
GTGGCCGGCC GCTTCGATGC GCCGAGCGGC TATCTGTTCG TCAATCCGGT GCTCGCCTAT
GGCCCGACGA CCGTGAGCCT CACGATCGAT CGTAACGCGA CGCCGTTCGC AAGCGTCGCG
CGGACCGCGA ACGAGCGCGG CGTCGCCGAT GCGCTCGAAA CGGCGGACCC GGGCAGCGCG
GTTTACAACA GCGTGCTGTT CGCGGCGTCC GCGCAGGCGC CGCAGGCGAC GCTCGCGCAA
CTGACGGGCG AGATCTATCC GGCCGCCTAC GCGGCGCTCG TCAACGAAAG CCGGCAAGTG
CGCGAAACGG CGCTCGAGCG CCTGTGGACG GCGCGCGGCG CGCCGGGCCG CGCCGGCGCC
TGGGCGCGGC TGCTCGGCGC GTGGGGCAGC GCGCGCGGCG GCGACGTGAA CGGCTACACG
AGCTCGACGG GCGGCTTCCT CGCGGGCGCG GACGCGGCGC TGCTCGACAG CGTGCGGGCG
GGCGGCTTCG CCGGCTACAG CCACACCGGC GTGAACCTGA GGAATCAGCC GTCGTCCGCG
TCGTTCGACA GCTTCCATCT CGGCGCATAC GCGGGGTGGC AGCCCGGCGC ACTCGGCGTG
CGAATCGGCG CGGCGCATGC GTGGCATCGC GGCGGTGTCG ATCGCGCGGT GCAATATGGC
GCGGTTGCCG AGAACGAAAC GACGGCGCTG CACGCGGAAA CGACGCAGGT GTTCGGCGAG
GCCGGCTATC GGTTCGCGCT CGATGGCGCC GCGACGCTCG AGCCGTTCTT CGGCGTCGCG
TATGTGCATC TGAAGAACCA GGGGACGACG GAAACCGGCG GCGCGGCGGC GTTGCGCGTG
CGGCAAGGCA ATCACGACGT GACGTTCTCG ACGCTCGGCG TGCGCGGCGA AACGCGGCTT
GGCCTGACGT CGCGACTGCA GTTGACGCTG CAGGGCAGCG CGGGCTGGCA GCATGCGCTG
ACGGACGGGC AGCCGAGCGG CACGCTCGCG TTCGCGACGG GGAGCGACAC GTTCACCGTG
TCGAGCGTGC CGGTTGCGAA GGATGCGGCG GTGCTGAACG TGGGCGCCGG GCTCGAGCTC
GGCAAGAACG GATGGCTGCG CGTCGGCTAT TCCGGCTCGC TCGCGAGCCG TCAGTCCGAG
CACGCGGTGC AAGGCAGCCT GCACTGGAAG TTCTGA
 
Protein sequence
MLMTRHKKRK TMKRSGAKLL APVVVAAAAA VAARPGWAQA APYPDPGRRG DPASWRTPEF 
TNAWGLGAMH AEYAYAAGYT GANVAIGVLD SGYYAQHPEL PDSRFVPVTA AGVSGVLNPN
NNNHGTLVSG VVGGARDGVG MHGVAPDATV YEGNTNATDG FRFGVSDPKF PASDAKYFAE
AYDALAAKGV RIISNSWGSQ PANENYSTLN KLTDAYKLHE AVRTATGRGT WLDAAAKVSR
DGVINNFSSG NTGYDNASLR GAYAYFHPEL EGHWMTTTGY DQLSGQVYNQ CGIAKWWCVM
APTGVPSTSY SGGAAAPTGA TYANFNGTSA AAPHASAALA LIMERFPYMT SEQALSVLFT
TAQNMEPDPS RPDYTNNGLF STVHPAKPGA SGVPNAFGGW GLVDLRRAMN GPGQLLGTFD
AALPAGTADV WSNDISDVAL AARKREDDAE HRAWLDTLRT KGWEHGLPAG ASNGDRIDYA
LGVARETAYQ AREYQGSLVK SGGGTLTLAG ANTYRGPTTV DGGELRIDGS IAARAVVNPA
GRLTVNGRAA DIAVNGGVAT IAGTSANLSI DRQGRAAVTG TTADVRVASG FASLGGTSGN
VAVGALGVAA ITGRTADVAV DGGRASLDGA SGNVAVGNGG VVSGSGTVRT LTAAANGTVA
PGHSVGTLTV SGDVRFAPGS IYAVEVSPGG AGDRIVAGGR AQIDGGALAL ALENTPPPLT
PEQSRSVLGR RFEILNAAGG VAGRFDAPSG YLFVNPVLAY GPTTVSLTID RNATPFASVA
RTANERGVAD ALETADPGSA VYNSVLFAAS AQAPQATLAQ LTGEIYPAAY AALVNESRQV
RETALERLWT ARGAPGRAGA WARLLGAWGS ARGGDVNGYT SSTGGFLAGA DAALLDSVRA
GGFAGYSHTG VNLRNQPSSA SFDSFHLGAY AGWQPGALGV RIGAAHAWHR GGVDRAVQYG
AVAENETTAL HAETTQVFGE AGYRFALDGA ATLEPFFGVA YVHLKNQGTT ETGGAAALRV
RQGNHDVTFS TLGVRGETRL GLTSRLQLTL QGSAGWQHAL TDGQPSGTLA FATGSDTFTV
SSVPVAKDAA VLNVGAGLEL GKNGWLRVGY SGSLASRQSE HAVQGSLHWK F