Gene BURPS1106A_1664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1664 
Symbol 
ID4902450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1617844 
End bp1619400 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content72% 
IMG OID640134893 
Productpeptidase s1, chymotrypsin:pdz/dhr/glgf 
Protein accessionYP_001065934 
Protein GI126455207 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.442424 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCTCATC AAACGCTTGG CCGCGCCCTG ATTCACGTTG CAGCGCTCGG GGCGGTGCTT 
GTCGGCTTCG CTTGCCTGCA GCCGACGCCA CTCGCCGCGG GTACGCTGCA ATCGCCGGCG
AAGGCGAAGC GAATCGCGCA GATCGGCGCC GCGGGGCCGG TCGATTTTCC CACGCTCGTC
GAGCGATACG GGCCCGCCGT CGTCAGCGTG AGCGTGCCTG CGCAGGATCC GCAGATGTCG
GCGTCCGGCC TCGAGGCGCT CGATCCCGAC GATCCGTTCT TCGCCTACTT CAAATCCGCC
GCCACGCAGC CCGCGCTGTC GCCGGAGAAC GGGCCGCGCG CGATGGCGGG CGCCGGATCC
GGTTTTATCG TCGGCGCCGA CGGGATCATC CTGACGACCG CGTACGTGGT CGGGCAGGCG
AGCGAGGCGA CGGTTCGCCT GATCGACCGG CGCGAATTCA AGGCGCGGGT GCTGGCCGTC
GATGATTCGA GCGATGTGGC CGTGTTGCAG ATCGACGCGA CGAAGCTGCC GACGGTGCGG
CTCGGCGATT CGTCCCGGGT GCGCACGGGC GAGCCGGTGC TGACGATCGG CACGCCGGAC
GGCTCGGCGA ACACGGTGAC GACGGGCATC GTCAGCGCGA CGGCGCGCAT GTTGCCCGAC
GGCGGCCGCT TTCCGTTCTT TCAGACCGAC GTGACCGGCA ACCTCGACAA CTCGGGCGGC
CCGGTGTTCA ACCGCGCGGG CGAGGTGATC GGCATCGATG TGCAGATCTA CGGCAGCGGC
GAGCGCAATC CGGGCGTGAC GTTTGCGATT CCGATCGACA TGGCGATGAA GGTGCGTGCG
CAGGTGCTGC AGGCGCAGCG CCAGGCGCGG CAGCAGGCGC AGCCGCCGAT GCAACAGGCG
CAACAGGCGC AACAAGCGCC GCCCGCGGCG GCGCAGAACG CGCTGGGCGT CGACGCGCAG
GACGTCGGTC CGGGGCTCGC GGCGGCGTTC GGCCTGCCGC GGCCGGCGGG CGCGCTTGTC
AATGCGGTGG AGCCGGGGTC GCCGGCGGCG GCGGTCGGGC TGAAGCCGGG CGACGTGATC
GTGCAGATCG GCGATCGGCC GCTCGGCCGC TCGGCGGAAC TGGCCGGCGA CCTCGCGGCG
CTGCCGCCCG GGGCGAGCGC GCCGATCACG CTGATCCGCA ACCGGATGCC GATGACGGTG
ATGCTCGGCT CCGGCGCGGC CGCGAGCGCG CCGACAGGCG CGACCGCATC GCCGGGCAAT
GCGGCCGCCG GCCGCAGCGA GACGGGCGGC GCGGACCGCC TGGGCCTGAC GATGCATCCG
CTGACGGACG ACGAGCGGCG CTCGACGGGA TTGCCCGTCG GCATGGTGGT CGATGCGGTG
CGCGGGCCGG CGGCGAACGC GGGGATTCGG CCGGGCGACG TCGTGCTGGA GCTCGACGAT
ACGCTGATCG AGACGCCGGA CATGGTGCCG GCGCTGGAGG CGAAGGCGGG GAAGGTGGTC
GCGGTGCTGA TTCAGCGGGG GAGCGAGCGC AGGTTCGTGT CGGTGAAGGC GCGGTGA
 
Protein sequence
MSHQTLGRAL IHVAALGAVL VGFACLQPTP LAAGTLQSPA KAKRIAQIGA AGPVDFPTLV 
ERYGPAVVSV SVPAQDPQMS ASGLEALDPD DPFFAYFKSA ATQPALSPEN GPRAMAGAGS
GFIVGADGII LTTAYVVGQA SEATVRLIDR REFKARVLAV DDSSDVAVLQ IDATKLPTVR
LGDSSRVRTG EPVLTIGTPD GSANTVTTGI VSATARMLPD GGRFPFFQTD VTGNLDNSGG
PVFNRAGEVI GIDVQIYGSG ERNPGVTFAI PIDMAMKVRA QVLQAQRQAR QQAQPPMQQA
QQAQQAPPAA AQNALGVDAQ DVGPGLAAAF GLPRPAGALV NAVEPGSPAA AVGLKPGDVI
VQIGDRPLGR SAELAGDLAA LPPGASAPIT LIRNRMPMTV MLGSGAAASA PTGATASPGN
AAAGRSETGG ADRLGLTMHP LTDDERRSTG LPVGMVVDAV RGPAANAGIR PGDVVLELDD
TLIETPDMVP ALEAKAGKVV AVLIQRGSER RFVSVKAR