Gene BURPS1710b_A0367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A0367 
SymbolptrB 
ID3693993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp516368 
End bp518716 
Gene Length2349 bp 
Protein Length782 aa 
Translation table11 
GC content68% 
IMG OID637730621 
Productprolyl oligopeptidase family protein 
Protein accessionYP_335526 
Protein GI76817831 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1505] Serine proteases of the peptidase family S9A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCGGC GCGCGTGGGT GATAGTCGGA ATCGCGCATG GATACGGGCG AAGCGGAAAA 
CGGCACGGCG TCGGCACCGG CTTCGGACAG CCCCGACACG ACGAAGGCAC GCTTGCGACG
TGTACGCGGC GCGCCACGCG CGCGCCGCGA ATGCCTGCGG CGGACGCGTC GCAATCGACG
AGCCGGCAGC CAAGCGCGAG CGCCGCTCGT CTTACGCTAC CATTACACGC CATGCCCCTT
GCAAAGAACG CCATGCCTCA TGCTTCCTGG CCCGAACAGG CCGATCCGCA TCAATTCCTC
GAAGAACTGG ACAGCGCCGC GAGCGTCGGC TGGGTCGACG CGCAAAACGC CCGCACGCAC
GATGCGCCCT GGCTCGACGA AGCGCACTAT CGCGCGCTGG TCGAGCGCTT CACCCGGGCG
CTGCTGCCGC GCGAGCGCCC GGTGATTCCG CAGCGCTGGC AGGACTGGGC GTACGACGTC
TGGCAGGACG AACAGCATCC GAAGGGCCTG TGGCGGCGCA CGCGATGGAC GAGCTGGCGC
AGCGGCCACG CGGACTGGCA GACGTTGATC GACCTCGATG CGCTTGGTGA AGCGCAAGGC
GTGCAGTGGG TGTTCGACGA TCAGCTCATC CTCGAGCCGG ACGGCGATCG TGCGCTGATC
GTGCTGTCCG ACGGCGGCGC CGACGCGGTC GTCGTCCGCG AGTTCGACAT CGCGCAATGC
CGGTTCGTCG ACGACGGCTT CTCGATCGAA GCGGCCGGCA AGCATTCGGT CGAATGGATC
GATCGCGACA CGATCTACGT CGGCTGGGAC GACGGCGGCG CCACCGTCAC GCGCTCCGGC
TATCCGCGCG AAGTGCGGCG CTGGACGCGC GGCACGCCGC TGTCCAGCGC GCCCGTGGTG
TTTCGCGGCG CGCGCGGCGA CATCTCGGTC GATGCGCAAT ACGATCCGCT CGACCGGCAT
CACGCGATCG AGCAGGCGAT CAATTTCTAC GACGCGAACA CGTATCGCCT CGCCGAGGAC
GGCGCGTGGG CGCGCTACGA CGTGCCACCG CACGTCGAAG TCGGTTACTG GAGCGGGTGG
CTGCTGCTCC AGCCGCGGCT CGACTGGACT TGCGGCGGCG CGCGCTACGC GGGCGGCAGC
CTGCTCGCGA TCCGCGAGGA CGCGTTCGTC GCCGGTGAGC GCGCGTTCGC CGCGCTGTTC
GAGCCGAACG AGCGCACGTC CGCATGCGGC TGGACGCACA CGCGCCGCTA CGTGCTGGTG
TCGTGGCTCG ACGACGTGCT CACGCGCACG ATGCTCTGGC TTCCCGAACG TCAGGATGAC
GGAGCATGGC GCTGGCATGC TCGTCCGTTC CCCGCGCGAG GGCTCGCGCA AGTGGACGTG
TCGCCCGTCG AGCCCACGTT CGACGACGAG GTGTACGTGA GCGTCGACGA TTACCTGAAG
CCGCCCGAGT ATTCGCTCGC GAATCTCGCC AGCGACGACC TGTCCGCCTG GACGCTGCTC
GACCGCTGGC CGACGCAGTT CGACGCGTCC GAACTGACGG TGCGGCGCGA ACACGCGCGC
TCGCGCGACG GCACGCTCGT GCCTTATACG CTGGTCGGGC CGCGCGACGT GCTGGACAAT
GCGGCGCGCG CGCCGCGCCC CTGCCTGTTG AACGGCTACG GCGGCTTCGC GATTGCGCTC
ACGCCCGATT ACGATCCGTT GCTCGGCATC GGCTGGCTCG AGAAAGGCGG CATCGCGGTG
TTCGCCCATA TTCGCGGCGG CGGCGAGTTC GGCACGCAGT GGCACGAATC GGCGCGGCAA
ACGCAACGGC AGCGATCGTT CGACGATTTC ATCGCGGTCG CCGAAAAACT CGTCGCGGAC
GGCGTGACGA GCGCCGCGCA ACTGGGTATT CGCGGCGGCA GCAACGGCGG GCTGCTGGTC
GCGGCATGCA TGATTCAGCG CCCGGACCTG TTCGGCGCGG TGGTGAGCGA CGTGCCGCTT
CTCGACATGC AGCGCTATGC GCTGCTGCAC GCGGGCGCAT CGTGGCTGGA CGAATTCGGC
GATCCCGACG ATCCGGCGCA TGCGTCGGCG CTCGCGGCCT ACTCGCCGTA TCACCGGGTC
GCGCGCGACA TCGCGTATCC GCCCGCGCTG TTCACGACAT CGACGAGCGA CGACCGCGTG
CATCCCGCCC ATGCGAGAAA AATGGTCGCG CGCATGCAGG CGCAAGGGCA CCGAAACGTA
TGGCTGATCG AGAAAACCGA TGGCGGCCAC GGCAGCGCGG ACGCGATCGA TACCGCCGAG
CACGAAGCGA TCGGCTATGT GTTTCTGTGG ACTCACTTGT CCCGCGGCGC GCATGACGCG
CGCGAGTGA
 
Protein sequence
MTRRAWVIVG IAHGYGRSGK RHGVGTGFGQ PRHDEGTLAT CTRRATRAPR MPAADASQST 
SRQPSASAAR LTLPLHAMPL AKNAMPHASW PEQADPHQFL EELDSAASVG WVDAQNARTH
DAPWLDEAHY RALVERFTRA LLPRERPVIP QRWQDWAYDV WQDEQHPKGL WRRTRWTSWR
SGHADWQTLI DLDALGEAQG VQWVFDDQLI LEPDGDRALI VLSDGGADAV VVREFDIAQC
RFVDDGFSIE AAGKHSVEWI DRDTIYVGWD DGGATVTRSG YPREVRRWTR GTPLSSAPVV
FRGARGDISV DAQYDPLDRH HAIEQAINFY DANTYRLAED GAWARYDVPP HVEVGYWSGW
LLLQPRLDWT CGGARYAGGS LLAIREDAFV AGERAFAALF EPNERTSACG WTHTRRYVLV
SWLDDVLTRT MLWLPERQDD GAWRWHARPF PARGLAQVDV SPVEPTFDDE VYVSVDDYLK
PPEYSLANLA SDDLSAWTLL DRWPTQFDAS ELTVRREHAR SRDGTLVPYT LVGPRDVLDN
AARAPRPCLL NGYGGFAIAL TPDYDPLLGI GWLEKGGIAV FAHIRGGGEF GTQWHESARQ
TQRQRSFDDF IAVAEKLVAD GVTSAAQLGI RGGSNGGLLV AACMIQRPDL FGAVVSDVPL
LDMQRYALLH AGASWLDEFG DPDDPAHASA LAAYSPYHRV ARDIAYPPAL FTTSTSDDRV
HPAHARKMVA RMQAQGHRNV WLIEKTDGGH GSADAIDTAE HEAIGYVFLW THLSRGAHDA
RE