Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMASAVP1_0221 |
Symbol | |
ID | 4677090 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei SAVP1 |
Kingdom | Bacteria |
Replicon accession | NC_008784 |
Strand | + |
Start bp | 243006 |
End bp | 244625 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639842749 |
Product | serine metalloprotease MrpA |
Protein accession | YP_989832 |
Protein GI | 121597334 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGTCC TCGTTCCGCT CGCCGGTTGC GGCGGCGGTG GCGACGGAGG CGGAAGCGGC ACGCCGTCGG CCGCCGCGCA GCCGACCCCC GCGCCGGCAC CGGCGCCGGC CCCGGCACCC GCGCCGAGCT CGGGTTCGTC GCAATCCACC AATTCGTCGA CCTCGACGGC GGCCTGCCCC GTCACGCAGG CCGCCTCGAC CGCCGCCGGC GAAACGCTCG TCACCCGCAC CGTTTCGCAC GAAGCACCCG TCGACCATCT GATCGTCAAG CTGCAACGCA CGGCGGCGGC GAGCGCATCC GGCGCGCGCA TCATGGCCGC GGCGAACGAC GCGGCCCGAC TCGATTCGGT GATCCAGCGC GTGATGTCGC AATGGAGCGC GAAGAGCGGC GCCGTTCGCT CGTATGCGCA GAACATCGCG CCGACGAACG CGGTGCAGGT GGAACGGACG ATGTCGGACG GTGCCGCGCT GCTCGCGCTC GGACAAAAGA TGAGCGCGGA TAATGCCGGC GCTCTCGCGC AAACGTTCGC GGCCGATCCG GACGTCGCCT ATGCGGAGCC CGACCGGCGC GTGTTCGCCC GCACGGTGGC GACCGACCCG GACTACGCGC AGCAGTGGAA CTACTTCGAT CCGGCGGCCG GCATCAATCT GCCGGACGCA TGGAACGTGA CGAACGGCCT GCCGAGCGTC GTCACCGCGG TGCTCGACAC CGGCTATCGC CCGCATCCGG ACATCATCGC GAACCTGCTG CCGGGCTACG ATTTCATCTC CGACATCAAC ACCGGCAACA ACGGCCACGG CCGCGGCCCG GACGCGACCG ACCCGGGCGA CTGGGTCACG CAGCAGGAAC TGACCGATCC GTCGAGCCCG TTCTACCAAT GCGCGAGCGC GCCGTCGAAC AGCAGCTGGC ACGGCACGCA GGTCGCCGGC ATCATCGGCG CCGCCGCGAA CAACGGCATC GGCATCGCGG GCGTCAGCTG GTACGGCAAG ATCCTGCCCG TGCGCGTGCT CGGCAAGTGC GGCGGCACGA CGAGCGACAT CGCCGACGCG ATGCGCTGGG CGGCGGGCAT TCCCGTCGCG GGCGCGCCGA CGAACCTCAC GCCGGCGAAG GTGATCAACC TGAGCCTCGG CGGCAGCGGC CCGTGCGGCG ACACGTTCCA GCAGGCGATC AACGACGTGA TCGCGCGCGG CACGACCGTC GTCGTCTCGG CCGGCAACGA CGGCCAGGCG ACGACGCTGG ACCGCCCGGC CAACTGCAAG GGCGTGATCT CGGTCGGCGC GACCGACAGC ACCGGCCAGC GCGCGTGGTA CAGCAACTTC GGCTCGGACA TCACGCTGAG CGCGCCGGGC TCGAACATCC TGTCGACGAG CAATGCGGGC ACCACGGTGC CGACCACCGA CGCGTACGGC ACGCACAGCG GCACGAGCCT CGCCGCGCCG CAGGTGGCGG GCGTCGCCTC GCTGATGCTC GCGGTCAACC CGAACCTCAC GCCCGCGCAG ATCGCGCAGA AGCTCGCGAG CACCGCGCGG CCGTCGCCGG CCACCGCATC CTGCCTCGCG CGCGCGCCGG GCGCGGGCAT CGTCGACGCC GGCACGGTGG TTGCGTCCGC AACGAAATAG
|
Protein sequence | MSVLVPLAGC GGGGDGGGSG TPSAAAQPTP APAPAPAPAP APSSGSSQST NSSTSTAACP VTQAASTAAG ETLVTRTVSH EAPVDHLIVK LQRTAAASAS GARIMAAAND AARLDSVIQR VMSQWSAKSG AVRSYAQNIA PTNAVQVERT MSDGAALLAL GQKMSADNAG ALAQTFAADP DVAYAEPDRR VFARTVATDP DYAQQWNYFD PAAGINLPDA WNVTNGLPSV VTAVLDTGYR PHPDIIANLL PGYDFISDIN TGNNGHGRGP DATDPGDWVT QQELTDPSSP FYQCASAPSN SSWHGTQVAG IIGAAANNGI GIAGVSWYGK ILPVRVLGKC GGTTSDIADA MRWAAGIPVA GAPTNLTPAK VINLSLGGSG PCGDTFQQAI NDVIARGTTV VVSAGNDGQA TTLDRPANCK GVISVGATDS TGQRAWYSNF GSDITLSAPG SNILSTSNAG TTVPTTDAYG THSGTSLAAP QVAGVASLML AVNPNLTPAQ IAQKLASTAR PSPATASCLA RAPGAGIVDA GTVVASATK
|
| |