Gene BMA0539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA0539 
Symbol 
ID3089279 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei ATCC 23344 
KingdomBacteria 
Replicon accessionNC_006348 
Strand
Start bp564138 
End bp565523 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content69% 
IMG OID637561371 
Productserine protease, MucD 
Protein accessionYP_102335 
Protein GI53725361 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.794345 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCCGCGG CGCGCGCGGG CCTGCCCGAT TTCGCGGACC TCGTCGAGCG GGTCGGCCCG 
GCGGTCGTCA ACATCCGGAC GACGGCGAAC GTGCCGGCCG ATACGCGCGG CGCGCTGCCG
CCCGGCCTCG ACAACGGCGA CATGTCGGAA TTCTTCCGCC GCTTCTTCGG CATTCCGTTG
CCGCAGGGGC CGGGCGGGCA GAAGAACGCG CCGAGCACGC CCGATGCGCC CGACACCGAA
CAGAACCGCG GCGTGGGCTC GGGCTTCATC CTGTCGCCGG ACGGCTATGT GATGACGAAC
GCGCACGTCG TCGACGACGC GGACACGATC TACGTGACGC TCACCGACAA GCGCGAATTC
AAGGCAAAGC TCATCGGCGT CGACGAGCGC ACGGACGTCG CGATCGTGAA GATCAACGCG
TCGAGCCTGC CGACCGTCGC GATCGGCGAT TCGAACCGCG TGCGCGTCGG CGAATGGGTC
GTCGCGATCG GTTCGCCGTT CGGCCTCGAC AACACCGTCA CGGCCGGCAT CGTCAGCGCA
AAGGGCCGCA ACACCGGCGA CTATCTGCCG TTCATCCAGA CGGACGTCGC GGTCAACCCC
GGCAACTCGG GCGGCCCGCT CATCAACATG CAGGGCGAGG TGATCGGCAT CAACTCGCAG
ATCTACAGCC GCACGGGCGG CTTCATGGGC ATTTCGTTCG CGATTCCGAT CGACGAGGCG
ATGCGCGTCG CCGAGCAGCT GAAGGCATCG GGCAAGGTCA CGCGCGGCAG GATCGCGGTC
GCGATCGGCG AGGTGACGAA GGAAGTGGCG GATTCGATCG GCCTGCCGAA GGCCGAAGGC
GCGCTCGTCA GCAGCGTCGA GCCAGGCGGC CCGGCCGACA AGGCGGGCCT GCAGCCGGGT
GACATCATCC TGAAGTTCAA CGGCCGTCCG GTGGAGGCGG CGTCGGATCT GCCGCGCATG
GTCGGCGACA CGAAGCCGGG CGCGAAGGCG ACGGTGACGG TGTGGCGCAA GGGGCAATCG
CGCGATCTGC CGATCACGAT CGCGGAATTC CCGGCCGACA AGGCCGCGAA GGCCGACAGC
CGTCAGGCGC CGCAGCAGAA GCCGCGCAGC AGCGCGCTCG GCCTGACGGT CAGCGACCTG
TCGCCCGAGC AGTTGAAGAC GCTCAAGCTG CGCAACGGCG TGCAGATCGA CGCGGTCGAC
GGCCCGGCCT CGCGCGCGGG GCTGCAGCGC GGCGACATCG TGCTGCGCGT CGGCGACGTC
GACATCACGA GCGCGAAGCA GTTCGTCGAC GTGACGTCGA AGCTCGATCC GCAGCGCGCG
GTCGCGGTGC TCGTGCGGCG CGGCGAGAAC ACGCAGTTCA TCCCGATCCG GCCGCGTCAG
AAGTGA
 
Protein sequence
MPAARAGLPD FADLVERVGP AVVNIRTTAN VPADTRGALP PGLDNGDMSE FFRRFFGIPL 
PQGPGGQKNA PSTPDAPDTE QNRGVGSGFI LSPDGYVMTN AHVVDDADTI YVTLTDKREF
KAKLIGVDER TDVAIVKINA SSLPTVAIGD SNRVRVGEWV VAIGSPFGLD NTVTAGIVSA
KGRNTGDYLP FIQTDVAVNP GNSGGPLINM QGEVIGINSQ IYSRTGGFMG ISFAIPIDEA
MRVAEQLKAS GKVTRGRIAV AIGEVTKEVA DSIGLPKAEG ALVSSVEPGG PADKAGLQPG
DIILKFNGRP VEAASDLPRM VGDTKPGAKA TVTVWRKGQS RDLPITIAEF PADKAAKADS
RQAPQQKPRS SALGLTVSDL SPEQLKTLKL RNGVQIDAVD GPASRAGLQR GDIVLRVGDV
DITSAKQFVD VTSKLDPQRA VAVLVRRGEN TQFIPIRPRQ K