Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I0675 |
Symbol | |
ID | 3850109 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | + |
Start bp | 775210 |
End bp | 776697 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637840348 |
Product | serine protease |
Protein accession | YP_441231 |
Protein GI | 83720454 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTACCC GAATCCTTGC GCGTGGCGCA GTTGCCGTGG CCGTCGCCGC GGCGTTGTCG GCGGGGTATG TGGCGGGCAC CCGCCATGCG GAACCGCAGA TCATCACACC GGCCGTCGCC GCGCTGATGC CGGCCGAGGC AGCCGCGAAG ACGGGCATCC CCGACTTTTC CGGCCTGGTC GAGACCTACG GGCCGGCCGT CGTGAACATC AGCGCGAAGC ACGTCGTGCA GCGCGTCGCG CAGCGCCGCG CAGCGCCGCA ATTGCCGATC GATCCGGAAG ATCCGTTCTA CCAATTCTTC CGACATTTCT ACGGGCAGGT TCCCGGGATG GGCGGTGGCC GCCAGCCGCA GCCGGACGAC CAGCCGAGCA CGAGCCTGGG CTCCGGGTTC ATCATCAGCG CGGACGGGTA TATCCTGACC AACGCGCACG TGATCGACGG CGCGAACGTC GTCACGGTGA AGCTCACCGA CAAGCGCGAG TACAAGGCGA AGGTCGTCGG CACCGACAAG CAATCCGACG TCGCGGTGCT GAAGATTGAC GCGTCGGGCC TGCCGACCGT GAAGATCGGC GATCCGGCGC AGAGCAAGGT CGGCCAGTGG GTCGTCGCGA TCGGTTCGCC GTACGGGTTC GACAACACGG TCACGTCGGG CATCATCAGC GCGAAGTCGC GCGCGCTGCC CGACGAGAAC TACACGCCGT TCATCCAGAC CGACGTGCCG GTGAACCCCG GCAACTCGGG CGGTCCGCTC TTCAACCTGA ACGGCGAGGT GATCGGCATC AACTCGATGA TCTACTCGCA GACGGGCGGC TTCCAGGGCC TGTCGTTCGC GATCCCGATC AACGAGGCGA TGAAGGTGAA GGACGAGCTC GTGAAGACGG GCCACGTGAG CCGCGGCCGG CTCGGCGTCG CGGTGCAGGG GCTCAACCAG ACGCTCGCGA GCTCGTTCGG CCTGCAAAAG CCCGACGGCG CGCTCGTCAG CTCGGTCGAC CCGAAGGGGC CGGCCGCGAA GGCGGGGCTG CAGCCGGGCG ACGTGATCCT CGCGGTCGAC GGCGTGCCGG TTCAGGACTC GACGACGTTG CCCGCGCAGA TCGCGAGCAT GAAGCCGGGC ACGAAGGCCG ACCTGCAGAT CTGGCGAGAC AAGTCGAAGA AGACAGTGTC GGTGACGCTT GCGTCGCTGG CCGACGATCA GGCGAAGGCG GGCGCCGACG AGCCCGTCGA GCAGGGACGC CTGGGCGTCG CGGTGCGGCC GCTTTTGCCG CGCGAGCGCA ACGGCACGTC GCTCACGCAC GGCCTCGTCG TCCAGCAGTC GACGGGCCCC GCCGCAAGCG CGGGCATCCA GCCGGGCGAC GTGATCCTCG CGGTGAACGG GCGGCCCGTC ACGAGCGCCG AACAATTGCG CGACGCGGTC AAGCGCGCGG GCAACAGCCT TGCGCTGCTG ATCCAGCGCG ACGACGCCCA GATTTTCGTG CCGGTCGATC TGGGCTGA
|
Protein sequence | MTTRILARGA VAVAVAAALS AGYVAGTRHA EPQIITPAVA ALMPAEAAAK TGIPDFSGLV ETYGPAVVNI SAKHVVQRVA QRRAAPQLPI DPEDPFYQFF RHFYGQVPGM GGGRQPQPDD QPSTSLGSGF IISADGYILT NAHVIDGANV VTVKLTDKRE YKAKVVGTDK QSDVAVLKID ASGLPTVKIG DPAQSKVGQW VVAIGSPYGF DNTVTSGIIS AKSRALPDEN YTPFIQTDVP VNPGNSGGPL FNLNGEVIGI NSMIYSQTGG FQGLSFAIPI NEAMKVKDEL VKTGHVSRGR LGVAVQGLNQ TLASSFGLQK PDGALVSSVD PKGPAAKAGL QPGDVILAVD GVPVQDSTTL PAQIASMKPG TKADLQIWRD KSKKTVSVTL ASLADDQAKA GADEPVEQGR LGVAVRPLLP RERNGTSLTH GLVVQQSTGP AASAGIQPGD VILAVNGRPV TSAEQLRDAV KRAGNSLALL IQRDDAQIFV PVDLG
|
| |