Gene BURPS668_1641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1641 
Symbol 
ID4884144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1609357 
End bp1610913 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content72% 
IMG OID640127568 
Productpeptidase s1, chymotrypsin:pdz/dhr/glgf 
Protein accessionYP_001058681 
Protein GI126438778 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.554784 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCTCGTC AAACGCTTGG CCGCGCCCTG ATTCACGTTG CAGCGCTCGG GGCGGTGCTT 
GTCGGCTTCG CTTGCCTGCA GCCGACGCCA CTCGCCGCGG GTACGCTGCA ATCGCCGGCG
AAGGCGAAGC GAATCGCGCA GATCGGCGCC GCGGGGCCGG TCGATTTTCC CACGCTCGTC
GAGCGATACG GGCCCGCCGT CGTCAGCGTG AGCGTGCCTG CGCAGGATCC GCAGATGTCG
GCGTCCGGCC TCGAGGCGCT CGATCCCGAC GATCCGTTCT TCGCCTACTT CAAATCCGCC
GCCACGCAGC CCGCGCTGTC GCCGGAGAGC GGGCCGCGCG CGATGGCGGG CGCCGGATCC
GGTTTTATCG TCGGCGCGGA CGGGATCATC CTGACGACCG CGTACGTGGT CGGGCAGGCG
AGCGAGGCGA CGGTTCGCCT GATCGACCGG CGCGAATTCA AGGCGCGGGT GCTGGCCGTC
GATGATTCGA GCGATGTGGC CGTGTTGCAG ATCGACGCGA CGAAGCTGCC GACGGTGCGG
CTCGGCGATT CGTCCCGGGT GCGCACGGGC GAGCCGGTGC TGACGATCGG CACGCCGGAC
GGCTCGGCGA ACACGGTGAC GACGGGCATC GTCAGCGCGA CGGCGCGCAT GTTGCCCGAC
GGCGGCCGCT TTCCGTTCTT TCAGACCGAC GTGACCGGCA ACCTCGACAA CTCGGGCGGC
CCGGTGTTCA ACCGCGCGGG CGAGGTGATC GGCATCGACG TGCAGATCTA CGGCAGCGGC
GAGCGCAATC CGGGCGTGAC GTTTGCGATT CCGATCGACA TGGCGATGAA GGTGCGTGCG
CAGGTGCTGC AGGCGCAGCG CCAGGCGCGA CAGCAGGCGC AGCCGCCGAT GCAACAGGCG
CAACAGGCGC AACAAGCGCC GCCCGCGGCG GCGCAGAACG CGCTGGGCGT CGACGCGCAG
GACGTCGGTC CGGGGCTCGC GGCGGCGTTC GGCCTGCCGC GGCCGGCGGG CGCGCTTGTC
AATGCAGTGG AGCCGGGGTC GCCGGCGGCG GCGGTCGGGC TGAAGCCGGG CGACGTGATC
GTGCAGATCG GCGATCGGCC GCTCGGCCGC TCGGCGGAAC TGGCCGGCGA CCTCGCGGCG
CTGCCGCCCG GGGCGAGCGC GCCGATCACG CTGATCCGCA ACCGGATGCC GATGACGGTG
ATGCTCGGCT CCGGCGCGGC CGCGAGCGCG CCGACAGGCG CGACCGCATC GCCGGGCAAT
GCGGCCGCCG GCCGCAGCGA GACGGGCGGC GCGGACCGCC TGGGCCTGAC GATGCATCCG
CTGACGGACG ACGAGCGGCG CTCGACGGGA TTGCCCGTCG GCATGGTGGT CGATGCGGTG
CGCGGGCCGG CGGCGAACGC GGGGATTCGG CCGGGCGACG TCGTGCTGGA GCTCGACGAT
ACGCTGATCG AGACGCCGGA CATGGTGCCG GCGCTGGAGG CGAAGGCGGG GAAGGTGGTT
GCGGTGCTGA TTCAGCGGGG GAGCGAGCGC AGGTTCGTGT CGGTGAAGGC GCGGTGA
 
Protein sequence
MSRQTLGRAL IHVAALGAVL VGFACLQPTP LAAGTLQSPA KAKRIAQIGA AGPVDFPTLV 
ERYGPAVVSV SVPAQDPQMS ASGLEALDPD DPFFAYFKSA ATQPALSPES GPRAMAGAGS
GFIVGADGII LTTAYVVGQA SEATVRLIDR REFKARVLAV DDSSDVAVLQ IDATKLPTVR
LGDSSRVRTG EPVLTIGTPD GSANTVTTGI VSATARMLPD GGRFPFFQTD VTGNLDNSGG
PVFNRAGEVI GIDVQIYGSG ERNPGVTFAI PIDMAMKVRA QVLQAQRQAR QQAQPPMQQA
QQAQQAPPAA AQNALGVDAQ DVGPGLAAAF GLPRPAGALV NAVEPGSPAA AVGLKPGDVI
VQIGDRPLGR SAELAGDLAA LPPGASAPIT LIRNRMPMTV MLGSGAAASA PTGATASPGN
AAAGRSETGG ADRLGLTMHP LTDDERRSTG LPVGMVVDAV RGPAANAGIR PGDVVLELDD
TLIETPDMVP ALEAKAGKVV AVLIQRGSER RFVSVKAR