Gene BURPS1106A_0853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_0853 
Symbol 
ID4902410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp833854 
End bp835341 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content68% 
IMG OID640134083 
Productserine protease 
Protein accessionYP_001065134 
Protein GI126453300 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTACCC GAATCCTTGC ACGTGGCGCA GTTGCCGTGG CTGTCGCCGC GGCGTTGTCG 
GCAGGCTATG TGGCGGGCAC CCGCCGGGCG GAGCCGCAGA TCATCACGCC GGCGGTCGCC
GCGCTGATGC CGGCCGAGGC GGCCGCGAAG ACGGGCATCC CCGATTTTTC CGGGCTGGTC
GAGACCTACG GGCCGGCCGT CGTGAACATC AGCGCGAAGC ACGTCGTGCA GCGCGCCGCG
CAGCGTCGCG CGGCACCGCA GTTGCCGATC GACCCGGACG ATCCGTTCTA TCAATTCTTC
CGACATTTCT ACGGGCAGAT TCCCGGGATG GGCGGCGGCC GCCAGCCGCA GCCGGACGAC
CAGCCGAGCA CGAGCCTCGG CTCCGGCTTC ATCATCAGCG CCGACGGGTA TATCCTGACT
AACGCGCACG TGATCGACGG TGCGAACGTC GTGACCGTGA AGCTCACCGA CAAGCGCGAG
TACAAGGCGA AGGTCGTCGG CGCCGACAAG CAGTCCGACG TCGCGGTGCT GAAGATCGAC
GCTTCGGGCC TGCCGATCGT GAAGATCGGC GATCCGGCGC AGAGCAAGGT CGGCCAGTGG
GTCGTCGCGA TCGGCTCGCC GTACGGGTTC GACAACACGG TCACCTCGGG CATCATCAGC
GCGAAGTCGC GTGCGTTGCC CGACGAGAAC TACACGCCGT TCATCCAGAC CGACGTGCCC
GTGAACCCCG GCAACTCGGG CGGCCCGCTG TTCAACCTGA ACGGCGAGGT GATCGGCATC
AACTCGATGA TCTACTCGCA GACGGGCGGC TTCCAGGGGC TGTCGTTCGC GATCCCGATC
AACGAGGCGA TGAAGGTGAA GGACGAGCTC GTGAAGACGG GCCACGTGAG CCGCGGCCGG
CTCGGCGTCG CCGTGCAGGG GCTCAATCAG ACGCTCGCGA GTTCGTTCGG CTTGCAAAAG
CCCGACGGCG CGCTCGTCAG CTCGGTCGAT CCGAAGGGGC CGGCCGCGAA GGCCGGGCTG
CAGCCGGGCG ACGTGATCCT CGCGGTCGAC GGCGTGCCGG TTCAGGATTC GTCGACGCTG
CCCGCGCAGA TCGCGGGCAT GAAGCCGGGC ACGAAGGCCG ATCTGCAGAT CTGGCGCGAC
AAGTCGAGGA AGACGGTATC GGTGACGCTC GCGTCGCTCG CCGACGATCA GGCGAAGGCG
GGCGCCGACG AGCCCGTCGA GCAGGGGCGG CTCGGCGTCG CGGTGCGCCC GCTGTCGCCG
CGCGAGCGCA ACGGCTCGTC TCTCACGCAC GGTCTGGTCG TCCAGCAATC GGCGGGGCCC
GCCGCGAGCG CGGGCATCCA GCCCGGCGAC GTGATTCTCG CGGTGAACGG GCGGCCCGTC
ACGAGCGCCG AACAATTGCG CGACGCGGTC AAGCGCGCGG GCAACAGTCT TGCGCTGCTG
ATCCAGCGTG ACGATGCCCA GATTTTCGTG CCGGTCGATC TGGGCTGA
 
Protein sequence
MTTRILARGA VAVAVAAALS AGYVAGTRRA EPQIITPAVA ALMPAEAAAK TGIPDFSGLV 
ETYGPAVVNI SAKHVVQRAA QRRAAPQLPI DPDDPFYQFF RHFYGQIPGM GGGRQPQPDD
QPSTSLGSGF IISADGYILT NAHVIDGANV VTVKLTDKRE YKAKVVGADK QSDVAVLKID
ASGLPIVKIG DPAQSKVGQW VVAIGSPYGF DNTVTSGIIS AKSRALPDEN YTPFIQTDVP
VNPGNSGGPL FNLNGEVIGI NSMIYSQTGG FQGLSFAIPI NEAMKVKDEL VKTGHVSRGR
LGVAVQGLNQ TLASSFGLQK PDGALVSSVD PKGPAAKAGL QPGDVILAVD GVPVQDSSTL
PAQIAGMKPG TKADLQIWRD KSRKTVSVTL ASLADDQAKA GADEPVEQGR LGVAVRPLSP
RERNGSSLTH GLVVQQSAGP AASAGIQPGD VILAVNGRPV TSAEQLRDAV KRAGNSLALL
IQRDDAQIFV PVDLG