Gene BMA10247_A1078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10247_A1078 
Symbol 
ID4889223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10247 
KingdomBacteria 
Replicon accessionNC_009079 
Strand
Start bp1020135 
End bp1021754 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content71% 
IMG OID640147351 
Productserine metalloprotease MrpA 
Protein accessionYP_001078270 
Protein GI126446775 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.673556 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGTCC TCGTTCCGCT CGCCGGTTGC GGCGGCGGTG GCGACGGAGG CGGAAGCGGC 
ACGCCGTCGG CCGCCGCGCA GCCGACCCCC GCGCCGGCAC CGGCGCCGGC CCCGGCACCC
GCGCCGAGCT CGGGTTCGTC GCAATCCACC AATTCGTCGA CCTCGACGGC GGCCTGCCCC
GTCACGCAGG CCGCCTCGAC CGCCGCCGGC GAAACGCTCG TCACCCGCAC CGTTTCGCAC
GAAGCACCCG TCGACCATCT GATCGTCAAG CTGCAACGCA CGGCGGCGGC GAGCGCATCC
GGCGCGCGCA TCATGGCCGC GGCGAACGAC GCGGCCCGAC TCGATTCGGT GATCCAGCGC
GTGATGTCGC AATGGAGCGC GAAGAGCGGC GCCGTTCGCT CGTATGCGCA GAACATCGCG
CCGACGAACG CGGTGCAGGT GGAACGGACG ATGTCGGACG GTGCCGCGCT GCTCGCGCTC
GGACAAAAGA TGAGCGCGGA TAATGCCGGC GCTCTCGCGC AAACGTTCGC GGCCGATCCG
GACGTCGCCT ATGCGGAGCC CGACCGGCGC GTGTTCGCCC GCACGGTGGC GACCGACCCG
GACTACGCGC AGCAGTGGAA CTACTTCGAT CCGGCGGCCG GCATCAATCT GCCGGACGCA
TGGAACGTGA CGAACGGCCT GCCGAGCGTC GTCACCGCGG TGCTCGACAC CGGCTATCGC
CCGCATCCGG ACATCATCGC GAACCTGCTG CCGGGCTACG ATTTCATCTC CGACATCAAC
ACCGGCAACA ACGGCCACGG CCGCGGCCCG GACGCGACCG ACCCGGGCGA CTGGGTCACG
CAGCAGGAAC TGACCGATCC GTCGAGCCCG TTCTACCAAT GCGCGAGCGC GCCGTCGAAC
AGCAGCTGGC ACGGCACGCA GGTCGCCGGC ATCATCGGCG CCGCCGCGAA CAACGGCATC
GGCATCGCGG GCGTCAGCTG GTACGGCAAG ATCCTGCCCG TGCGCGTGCT CGGCAAGTGC
GGCGGCACGA CGAGCGACAT CGCCGACGCG ATGCGCTGGG CGGCGGGCAT TCCCGTCGCG
GGCGCGCCGA CGAACCTCAC GCCGGCGAAG GTGATCAACC TGAGCCTCGG CGGCAGCGGC
CCGTGCGGCG ACACGTTCCA GCAGGCGATC AACGACGTGA TCGCGCGCGG CACGACCGTC
GTCGTCTCGG CCGGCAACGA CGGCCAGGCG ACGACGCTGG ACCGCCCGGC CAACTGCAAG
GGCGTGATCT CGGTCGGCGC GACCGACAGC ACCGGCCAGC GCGCGTGGTA CAGCAACTTC
GGCTCGGACA TCACGCTGAG CGCGCCGGGC TCGAACATCC TGTCGACGAG CAATGCGGGC
ACCACGGTGC CGACCACCGA CGCGTACGGC ACGCACAGCG GCACGAGCCT CGCCGCGCCG
CAGGTGGCGG GCGTCGCCTC GCTGATGCTC GCGGTCAACC CGAACCTCAC GCCCGCGCAG
ATCGCGCAGA AGCTCGCGAG CACCGCGCGG CCGTCGCCGG CCACCGCATC CTGCCTCGCG
CGCGCGCCGG GCGCGGGCAT CGTCGACGCC GGCACGGTGG TTGCGTCCGC AACGAAATAG
 
Protein sequence
MSVLVPLAGC GGGGDGGGSG TPSAAAQPTP APAPAPAPAP APSSGSSQST NSSTSTAACP 
VTQAASTAAG ETLVTRTVSH EAPVDHLIVK LQRTAAASAS GARIMAAAND AARLDSVIQR
VMSQWSAKSG AVRSYAQNIA PTNAVQVERT MSDGAALLAL GQKMSADNAG ALAQTFAADP
DVAYAEPDRR VFARTVATDP DYAQQWNYFD PAAGINLPDA WNVTNGLPSV VTAVLDTGYR
PHPDIIANLL PGYDFISDIN TGNNGHGRGP DATDPGDWVT QQELTDPSSP FYQCASAPSN
SSWHGTQVAG IIGAAANNGI GIAGVSWYGK ILPVRVLGKC GGTTSDIADA MRWAAGIPVA
GAPTNLTPAK VINLSLGGSG PCGDTFQQAI NDVIARGTTV VVSAGNDGQA TTLDRPANCK
GVISVGATDS TGQRAWYSNF GSDITLSAPG SNILSTSNAG TTVPTTDAYG THSGTSLAAP
QVAGVASLML AVNPNLTPAQ IAQKLASTAR PSPATASCLA RAPGAGIVDA GTVVASATK