Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_0853 |
Symbol | |
ID | 4902410 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | + |
Start bp | 833854 |
End bp | 835341 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640134083 |
Product | serine protease |
Protein accession | YP_001065134 |
Protein GI | 126453300 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTACCC GAATCCTTGC ACGTGGCGCA GTTGCCGTGG CTGTCGCCGC GGCGTTGTCG GCAGGCTATG TGGCGGGCAC CCGCCGGGCG GAGCCGCAGA TCATCACGCC GGCGGTCGCC GCGCTGATGC CGGCCGAGGC GGCCGCGAAG ACGGGCATCC CCGATTTTTC CGGGCTGGTC GAGACCTACG GGCCGGCCGT CGTGAACATC AGCGCGAAGC ACGTCGTGCA GCGCGCCGCG CAGCGTCGCG CGGCACCGCA GTTGCCGATC GACCCGGACG ATCCGTTCTA TCAATTCTTC CGACATTTCT ACGGGCAGAT TCCCGGGATG GGCGGCGGCC GCCAGCCGCA GCCGGACGAC CAGCCGAGCA CGAGCCTCGG CTCCGGCTTC ATCATCAGCG CCGACGGGTA TATCCTGACT AACGCGCACG TGATCGACGG TGCGAACGTC GTGACCGTGA AGCTCACCGA CAAGCGCGAG TACAAGGCGA AGGTCGTCGG CGCCGACAAG CAGTCCGACG TCGCGGTGCT GAAGATCGAC GCTTCGGGCC TGCCGATCGT GAAGATCGGC GATCCGGCGC AGAGCAAGGT CGGCCAGTGG GTCGTCGCGA TCGGCTCGCC GTACGGGTTC GACAACACGG TCACCTCGGG CATCATCAGC GCGAAGTCGC GTGCGTTGCC CGACGAGAAC TACACGCCGT TCATCCAGAC CGACGTGCCC GTGAACCCCG GCAACTCGGG CGGCCCGCTG TTCAACCTGA ACGGCGAGGT GATCGGCATC AACTCGATGA TCTACTCGCA GACGGGCGGC TTCCAGGGGC TGTCGTTCGC GATCCCGATC AACGAGGCGA TGAAGGTGAA GGACGAGCTC GTGAAGACGG GCCACGTGAG CCGCGGCCGG CTCGGCGTCG CCGTGCAGGG GCTCAATCAG ACGCTCGCGA GTTCGTTCGG CTTGCAAAAG CCCGACGGCG CGCTCGTCAG CTCGGTCGAT CCGAAGGGGC CGGCCGCGAA GGCCGGGCTG CAGCCGGGCG ACGTGATCCT CGCGGTCGAC GGCGTGCCGG TTCAGGATTC GTCGACGCTG CCCGCGCAGA TCGCGGGCAT GAAGCCGGGC ACGAAGGCCG ATCTGCAGAT CTGGCGCGAC AAGTCGAGGA AGACGGTATC GGTGACGCTC GCGTCGCTCG CCGACGATCA GGCGAAGGCG GGCGCCGACG AGCCCGTCGA GCAGGGGCGG CTCGGCGTCG CGGTGCGCCC GCTGTCGCCG CGCGAGCGCA ACGGCTCGTC TCTCACGCAC GGTCTGGTCG TCCAGCAATC GGCGGGGCCC GCCGCGAGCG CGGGCATCCA GCCCGGCGAC GTGATTCTCG CGGTGAACGG GCGGCCCGTC ACGAGCGCCG AACAATTGCG CGACGCGGTC AAGCGCGCGG GCAACAGTCT TGCGCTGCTG ATCCAGCGTG ACGATGCCCA GATTTTCGTG CCGGTCGATC TGGGCTGA
|
Protein sequence | MTTRILARGA VAVAVAAALS AGYVAGTRRA EPQIITPAVA ALMPAEAAAK TGIPDFSGLV ETYGPAVVNI SAKHVVQRAA QRRAAPQLPI DPDDPFYQFF RHFYGQIPGM GGGRQPQPDD QPSTSLGSGF IISADGYILT NAHVIDGANV VTVKLTDKRE YKAKVVGADK QSDVAVLKID ASGLPIVKIG DPAQSKVGQW VVAIGSPYGF DNTVTSGIIS AKSRALPDEN YTPFIQTDVP VNPGNSGGPL FNLNGEVIGI NSMIYSQTGG FQGLSFAIPI NEAMKVKDEL VKTGHVSRGR LGVAVQGLNQ TLASSFGLQK PDGALVSSVD PKGPAAKAGL QPGDVILAVD GVPVQDSSTL PAQIAGMKPG TKADLQIWRD KSRKTVSVTL ASLADDQAKA GADEPVEQGR LGVAVRPLSP RERNGSSLTH GLVVQQSAGP AASAGIQPGD VILAVNGRPV TSAEQLRDAV KRAGNSLALL IQRDDAQIFV PVDLG
|
| |