Gene BMASAVP1_0221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMASAVP1_0221 
Symbol 
ID4677090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei SAVP1 
KingdomBacteria 
Replicon accessionNC_008784 
Strand
Start bp243006 
End bp244625 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content71% 
IMG OID639842749 
Productserine metalloprotease MrpA 
Protein accessionYP_989832 
Protein GI121597334 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGTCC TCGTTCCGCT CGCCGGTTGC GGCGGCGGTG GCGACGGAGG CGGAAGCGGC 
ACGCCGTCGG CCGCCGCGCA GCCGACCCCC GCGCCGGCAC CGGCGCCGGC CCCGGCACCC
GCGCCGAGCT CGGGTTCGTC GCAATCCACC AATTCGTCGA CCTCGACGGC GGCCTGCCCC
GTCACGCAGG CCGCCTCGAC CGCCGCCGGC GAAACGCTCG TCACCCGCAC CGTTTCGCAC
GAAGCACCCG TCGACCATCT GATCGTCAAG CTGCAACGCA CGGCGGCGGC GAGCGCATCC
GGCGCGCGCA TCATGGCCGC GGCGAACGAC GCGGCCCGAC TCGATTCGGT GATCCAGCGC
GTGATGTCGC AATGGAGCGC GAAGAGCGGC GCCGTTCGCT CGTATGCGCA GAACATCGCG
CCGACGAACG CGGTGCAGGT GGAACGGACG ATGTCGGACG GTGCCGCGCT GCTCGCGCTC
GGACAAAAGA TGAGCGCGGA TAATGCCGGC GCTCTCGCGC AAACGTTCGC GGCCGATCCG
GACGTCGCCT ATGCGGAGCC CGACCGGCGC GTGTTCGCCC GCACGGTGGC GACCGACCCG
GACTACGCGC AGCAGTGGAA CTACTTCGAT CCGGCGGCCG GCATCAATCT GCCGGACGCA
TGGAACGTGA CGAACGGCCT GCCGAGCGTC GTCACCGCGG TGCTCGACAC CGGCTATCGC
CCGCATCCGG ACATCATCGC GAACCTGCTG CCGGGCTACG ATTTCATCTC CGACATCAAC
ACCGGCAACA ACGGCCACGG CCGCGGCCCG GACGCGACCG ACCCGGGCGA CTGGGTCACG
CAGCAGGAAC TGACCGATCC GTCGAGCCCG TTCTACCAAT GCGCGAGCGC GCCGTCGAAC
AGCAGCTGGC ACGGCACGCA GGTCGCCGGC ATCATCGGCG CCGCCGCGAA CAACGGCATC
GGCATCGCGG GCGTCAGCTG GTACGGCAAG ATCCTGCCCG TGCGCGTGCT CGGCAAGTGC
GGCGGCACGA CGAGCGACAT CGCCGACGCG ATGCGCTGGG CGGCGGGCAT TCCCGTCGCG
GGCGCGCCGA CGAACCTCAC GCCGGCGAAG GTGATCAACC TGAGCCTCGG CGGCAGCGGC
CCGTGCGGCG ACACGTTCCA GCAGGCGATC AACGACGTGA TCGCGCGCGG CACGACCGTC
GTCGTCTCGG CCGGCAACGA CGGCCAGGCG ACGACGCTGG ACCGCCCGGC CAACTGCAAG
GGCGTGATCT CGGTCGGCGC GACCGACAGC ACCGGCCAGC GCGCGTGGTA CAGCAACTTC
GGCTCGGACA TCACGCTGAG CGCGCCGGGC TCGAACATCC TGTCGACGAG CAATGCGGGC
ACCACGGTGC CGACCACCGA CGCGTACGGC ACGCACAGCG GCACGAGCCT CGCCGCGCCG
CAGGTGGCGG GCGTCGCCTC GCTGATGCTC GCGGTCAACC CGAACCTCAC GCCCGCGCAG
ATCGCGCAGA AGCTCGCGAG CACCGCGCGG CCGTCGCCGG CCACCGCATC CTGCCTCGCG
CGCGCGCCGG GCGCGGGCAT CGTCGACGCC GGCACGGTGG TTGCGTCCGC AACGAAATAG
 
Protein sequence
MSVLVPLAGC GGGGDGGGSG TPSAAAQPTP APAPAPAPAP APSSGSSQST NSSTSTAACP 
VTQAASTAAG ETLVTRTVSH EAPVDHLIVK LQRTAAASAS GARIMAAAND AARLDSVIQR
VMSQWSAKSG AVRSYAQNIA PTNAVQVERT MSDGAALLAL GQKMSADNAG ALAQTFAADP
DVAYAEPDRR VFARTVATDP DYAQQWNYFD PAAGINLPDA WNVTNGLPSV VTAVLDTGYR
PHPDIIANLL PGYDFISDIN TGNNGHGRGP DATDPGDWVT QQELTDPSSP FYQCASAPSN
SSWHGTQVAG IIGAAANNGI GIAGVSWYGK ILPVRVLGKC GGTTSDIADA MRWAAGIPVA
GAPTNLTPAK VINLSLGGSG PCGDTFQQAI NDVIARGTTV VVSAGNDGQA TTLDRPANCK
GVISVGATDS TGQRAWYSNF GSDITLSAPG SNILSTSNAG TTVPTTDAYG THSGTSLAAP
QVAGVASLML AVNPNLTPAQ IAQKLASTAR PSPATASCLA RAPGAGIVDA GTVVASATK