Gene BURPS1106A_0496 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_0496 
Symbol 
ID4900921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp462180 
End bp463772 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content66% 
IMG OID640133726 
Productcarboxyl-terminal protease 
Protein accessionYP_001064779 
Protein GI126454825 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTATGA AATTGAAGAA CATCGGCCTG ATTGCCGCGG GCCTCGCGAC TGGCGTCTTC 
GCGACGCTGC AAATCTCCGC GTCGGCCCAG CAGGCCGTCA CGACGGCCGC CGCGCCGCTG
CCGCTCGACC AGTTGCGGCT CTTCGCCGAA GTGTTCGGGC AGATCAAGCG CGAATACGTC
GAGCCCGTCG ACGACAAGAA GCTGCTGACC GCGGCGATCA AGGGCATGGT GTCGAGCCTC
GATCCGCACT CGTCGTACCT CGACAAGACC GATTACCAGG AACTGCAGGA GCAGACGAAG
GGCCGCTTCG CCGGCCTCGG CATCGAGATT TCGCAGGAAG ACGGCCTCGT CAAGGTGATC
TCGCCGATCG AGGACACGCC CGCGTTCCGC GCCGGCATCC GTCCGGGCGA CCTGATCACC
CGCATCAACG ATCGCCCGGT GCGCGGCATG ACGCTCGACA AGGCGGTCAA GCAGATGCGC
GGCGAGCCCG GCACGAAGGT CACGCTGACG ATCTTCCGCA AGAGCGACGA CCGCACGTTC
CCCGTCACGG TCACGCGCGC GGTGATCCGC GTGCAGAGCG TGAAGATGAA GCTGCTCGAT
CCGGGCTACG CGTACATCCG CATCACGAGC TTCCAGGAGC GCACGACGCC CGATCTCGCC
GCGAAGCTGC AGGACATCGC GCGCCAGCAG CCGAACCTGA AGGGCCTGAT CCTCGATCTG
CGCAACAACG GCGGCGGCCT GCTGCAAAGC GCCGTCGGCG TCGCGGGCGC GTTCCTGCCT
CCGGATTCCG TCGTCGTGTC GACGAACGGC CAGATCCCCG ATTCGAAGCA GATCTACCGC
GACAACTACG AGAACTACCG CCTGCCGTCG TTCGACTCCG ATCCGCTGAA GAACCTGCCC
GCCGTCTTCA AGACGGTGCC GATGATCGTG CTGACGAACG CGTATTCGGC GTCGGCCTCG
GAGATCGTCG CGGGCGCGCT GCAGGATTCG CACCGTGCGG TGATCATGGG CAAGGCGACG
TTCGGCAAGG GCTCGGTGCA GACGGTGCGG CCGATGACGG CCGATTCCGC GCTGCGCCTG
ACGACCGCGT ACTACTACAC GCCGAGCGGC CGCTCGATCC AGAACAAGGG CATCCTGCCC
GACATTCCGG TCGATCAGTA CGCGGACGGC GATCCGGACG ACGTGCTCGT CACGCGCGAG
GTCGATTACA CGAACCACCT CGCGAACACG CAGGATCCGA ACGAGAAGAA GGAGCTCGAG
GAACGCGAGC AGCGCCGGAT GGAGCAGTTG CGCATCCTCG AGGAGCAGAA CGACAAGAAG
ACGCCCGAGC AGCGTCAGAA GGATCGCGAG CGCAAGCCGA TCGAATTCGG CAGCGCCGAC
GATTTCATGA TGCAGCAGGC GCTCAACAAG CTCGAAGGCA AGCCGGTCGA GCAGTCGAAG
ATGATCGCCG CCGACAGCAC CGCGAAGAGC GCCGCCGCCA AGGCGGGCAC CGCCTCGGCG
GCGAAGGGCG CGTCGGGCGC GGCGGCCAAG CCCGCGTCGG CTGCCAAGCC CGCGTCGGCA
GCCAAGCCGG TGTCGGCGCC GCAACCGCAG TAA
 
Protein sequence
MRMKLKNIGL IAAGLATGVF ATLQISASAQ QAVTTAAAPL PLDQLRLFAE VFGQIKREYV 
EPVDDKKLLT AAIKGMVSSL DPHSSYLDKT DYQELQEQTK GRFAGLGIEI SQEDGLVKVI
SPIEDTPAFR AGIRPGDLIT RINDRPVRGM TLDKAVKQMR GEPGTKVTLT IFRKSDDRTF
PVTVTRAVIR VQSVKMKLLD PGYAYIRITS FQERTTPDLA AKLQDIARQQ PNLKGLILDL
RNNGGGLLQS AVGVAGAFLP PDSVVVSTNG QIPDSKQIYR DNYENYRLPS FDSDPLKNLP
AVFKTVPMIV LTNAYSASAS EIVAGALQDS HRAVIMGKAT FGKGSVQTVR PMTADSALRL
TTAYYYTPSG RSIQNKGILP DIPVDQYADG DPDDVLVTRE VDYTNHLANT QDPNEKKELE
EREQRRMEQL RILEEQNDKK TPEQRQKDRE RKPIEFGSAD DFMMQQALNK LEGKPVEQSK
MIAADSTAKS AAAKAGTASA AKGASGAAAK PASAAKPASA AKPVSAPQPQ