Gene BURPS1106A_A2709 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2709 
SymbolmprA 
ID4904771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2648489 
End bp2649982 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content70% 
IMG OID640145812 
Productserine metalloprotease 
Protein accessionYP_001076739 
Protein GI126455665 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGATTCGTA CTGCTTCGTT CAAGGCGACC GTCCTGTGTG CCGCGCTGGC CGGCCTCGTT 
TCGGCCGCGC AAGCGGAAAC CGCGGCCGCG CCCCAGGTGC CGGGGCCCGC CGACGCGGTC
AATCAGTTGA TCGTCAAGTT GCGCGCGGTG AAGACGCCGC CCGGTGCGAC GGCCGCGAAG
GCCGAGCGCG CGGACGTTCA GGCCGTCATC GATCGCGTGC TCGCCGCGCG CAATGCGCGG
GCGGCGGGGC GTGCGTTCGG CGCGGCCGCC GCATCCGCGC CCGGCAATCC GGACGACCCC
GCCGCGGGCA TTCGCATCAA GCGCGACATG TCGGGCGGCG CGACCGTGCT GTCGCTGCAG
CGCCACGTGT CGCTCGCGCA GGCCGAGGCG CTCGCGCGCG ACTTCGCGGC GGACGGCGCG
ATCGAATATG CGGAGCCCGA TGCGCGGATG CATCCGTTCG TCGTGCCGAA CGATACGCGC
TATTCGGAGC AATGGGGCTA CTTCAATCCG ACCGCCGGCG CGAATCTGCC GAAGGCTTGG
GATCGCACGA CCGGCTCCGC GCGCGTCGTC GTCGCCGTCA TCGATACCGG CTACCGTCCG
CATGCGGATC TCGCCGCGAA CCTGCTGCCG GGCTACGACT TCATCTCCGA TATCCCGAGC
GCGAACGACG GCAATGGCCG CGACAGCGAC GCATCGGATC CCGGCGACTG GGTGAGCGCG
CAGGAAGACG GCGATCCGAG CGGCCCATTC TATGGCTGCG GCGCGAGCGA CAGCTCATGG
CACGGCACGC ACGTCGCGGG CACGATCGGC GCGGTGACGA ACAACGGCGT CGGCGTGGCG
GGCATCTCGT GGGTCGGCAA GGTGCTGCCC GTGCGCGTGC TCGGCAAGTG CGGCGGGATG
CTGAGCGACA TCGCCGACGG CATGCGCTGG GCGGCGGGCC TGCCGGTGCC GGGCGCGCCG
TCGAATCCGA ACCCGGCGAA GGTGCTGAAC CTGAGCCTCG GCGGATACGG CCGCACATGC
AGCTCGACGT ACCAGAACGC GATCAACGAA ATCACGTCGC GCGGCGCGAA CGTCGTTGTC
GCCGCGGGCA ATAACGGCGG CTCGGTGTCG ACGACTCAGC CGGCGAATTG CCGGGGCGTG
ATCGCGGTCG GCGCGATCGA CAGCCGCGGT GTGCGCGCGA GCTTCAGCAA CACCGGCGCC
GCGGTGAAGA TCTCCGCGCC GGGCGTCGGC ATTCTGTCGA CGCTCAATGC GGGCAAGACC
TCGCCGGGCG CGGACAGCTA CGCGAGCTAT AGCGGCACGA GCATGGCAAC GCCGCATGTC
GCGGGCACGG TCGCGCTGAT GCTCGCCGTC AACTCGACGC TGTCGCCTTC GCAGATCTTG
CAGCGGCTGC AATCGAGCGC GCGGCCGTTC TCGAGCGGAT CGAGCTGCTC GACGAGCACG
TGCGGCGCAG GGCTGCTCGA CGCAGGCAAC GCGGTCGACG CCGCCGCGCA GTGA
 
Protein sequence
MIRTASFKAT VLCAALAGLV SAAQAETAAA PQVPGPADAV NQLIVKLRAV KTPPGATAAK 
AERADVQAVI DRVLAARNAR AAGRAFGAAA ASAPGNPDDP AAGIRIKRDM SGGATVLSLQ
RHVSLAQAEA LARDFAADGA IEYAEPDARM HPFVVPNDTR YSEQWGYFNP TAGANLPKAW
DRTTGSARVV VAVIDTGYRP HADLAANLLP GYDFISDIPS ANDGNGRDSD ASDPGDWVSA
QEDGDPSGPF YGCGASDSSW HGTHVAGTIG AVTNNGVGVA GISWVGKVLP VRVLGKCGGM
LSDIADGMRW AAGLPVPGAP SNPNPAKVLN LSLGGYGRTC SSTYQNAINE ITSRGANVVV
AAGNNGGSVS TTQPANCRGV IAVGAIDSRG VRASFSNTGA AVKISAPGVG ILSTLNAGKT
SPGADSYASY SGTSMATPHV AGTVALMLAV NSTLSPSQIL QRLQSSARPF SSGSSCSTST
CGAGLLDAGN AVDAAAQ