Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II0646 |
Symbol | |
ID | 3844957 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | - |
Start bp | 754925 |
End bp | 756613 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637837951 |
Product | serine protease |
Protein accession | YP_438845 |
Protein GI | 83717912 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4934] Predicted protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.23311 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTGGCCGC TCGCGCTCGC CGCAGGCGTC GCACACGGCG CGACGGACTG GGTCGATACG CACACCAAAG CTTTCCTGAA TCATGCGCAG ATCGAGACGC TCGCCCGCAG CGCGAACACC GCGTCGCTCG AGCTCGCCTC CGGCGAAGCC ACGCACGTCG TGGTCAGCCT GAAGCTGCGC AACGCCGAGC AGTTGAAAGC CGTCGCGCGC AACGTCAGCG ATCCGCATAG CTCTCAGTAC CGGCAATACA TCACGAGCGC GCAGTTCCTG TCGAACTACG CGCCGACCAA AGCGCAGGTG AAGCAGGTTG TCGACTATCT GCGCAAGAAC GGCTTCGTCG ACATCCGCGT CGCGCCGAAC CGCATGCTCG TCTCCGCGCG CGGGAGCGCG GGCACGGTCA AGCAGGCCTT CAACACGTCT CTCGTCCACT TCGAATACGC GGGCCGCGCG GGCTTCGCCA ACGCGTCGAC GGCGCAAGTG CCGCGCGCGC TCGGCGACAT CGTCGGCTCG GTGCTCGGCC TGCAGAACGT CGCGCGCGCA CGGCCGCTCA CGAAGATCGG CGCGATCGCG AAACCCCTCT CGCTCGCGTC CGGCACGGCG ACGGGTCACT ATCCGTCCGA GTTTCCGGCG CTCTATAACG CGACGGGCGT GCCGAACGCG GCGAACACGA CGGTCGGCAT CATGACGATC GGCGGCGTGT CGCAGGCGCT GTCGGATCTG CAGCAGTTCA CGAGCGCGAA CGGCTATCCG GACGTGTCGA CGCAGACCGT GCAGACCAAC GGTTCCGGCG GCGACTACAG CGACGATCAG GAAGGCCAGG GCGAATGGGA TCTCGACAGC CAGTCGATCG TCGGCGCCGC GGGCGGCCAG GTCGGACAAC TGATCTTCTA CATGGCCGAC CTCAGCGCGT CGGGCAACAC CGGCCTCACG CAGGCGTTCA ACCAGGCCGT ATCGGACAAC ACCGCGAAAG TGATCAACGT GTCGCTCGGC TGGTGCGAAA CCGACGCGAA CGCGGACGGC ACGCTTTCGG CCGAAGAGCA GATCTTCACG CAGGCGGTCG CGCAAGGCCA GACGTTCGCG GTGTCGTCGG GCGACGAGGG AGTCTATGAG TGCAACAACC GCGGCTATCC CGATGGTTCG AACTACTCGG TATCGTGGCC GGCGTCTTCG CCGCATGTGC TCGCGATCGG CGGCACGACG CTCTACACGT CGTCGTCGGG CGCATTCTCG AACGAAACGG TATGGAACGA AGGGCTCGAC GGCAACGGCA AGCTGTGGGC GACGGGCGGC GGCGTCAGCA CGATTCTGCC GAATCCGTCA TGGCAATCGG GCAGCAACCG CAAGCTGCCC GACGTGTCGT TCGACGCCGC GCAAAGCACG GGCGCGTATA TCTACAACTA CGGCCAGTTG CAGCAGATTG GCGGCACGAG CCTGTCGGCG CCGATCTTCA CCGGCTTCTG GGCCCGGCTC CTGTCGGCGA ACGGCGCGAG CCTCGGCTTC CCGGCCGCGC GCTTTTACCA TTCGATTCCG ACGCATGCGT CGCTCGTGCG CTATGACGTC ACGTCCGGCA ACAACGGCTA TCAGGGTTAC GGCTTCACCG CGTCGAAGGG CTGGGACTAC CCGACCGGCT GGGGCAGCAT CAACATCTCG AACCTGAATC AGTTGATCCA GTCGGGCGGC TTCAATTGA
|
Protein sequence | MWPLALAAGV AHGATDWVDT HTKAFLNHAQ IETLARSANT ASLELASGEA THVVVSLKLR NAEQLKAVAR NVSDPHSSQY RQYITSAQFL SNYAPTKAQV KQVVDYLRKN GFVDIRVAPN RMLVSARGSA GTVKQAFNTS LVHFEYAGRA GFANASTAQV PRALGDIVGS VLGLQNVARA RPLTKIGAIA KPLSLASGTA TGHYPSEFPA LYNATGVPNA ANTTVGIMTI GGVSQALSDL QQFTSANGYP DVSTQTVQTN GSGGDYSDDQ EGQGEWDLDS QSIVGAAGGQ VGQLIFYMAD LSASGNTGLT QAFNQAVSDN TAKVINVSLG WCETDANADG TLSAEEQIFT QAVAQGQTFA VSSGDEGVYE CNNRGYPDGS NYSVSWPASS PHVLAIGGTT LYTSSSGAFS NETVWNEGLD GNGKLWATGG GVSTILPNPS WQSGSNRKLP DVSFDAAQST GAYIYNYGQL QQIGGTSLSA PIFTGFWARL LSANGASLGF PAARFYHSIP THASLVRYDV TSGNNGYQGY GFTASKGWDY PTGWGSINIS NLNQLIQSGG FN
|
| |