Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II0811 |
Symbol | |
ID | 3845429 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | + |
Start bp | 949294 |
End bp | 951162 |
Gene Length | 1869 bp |
Protein Length | 622 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637838114 |
Product | serine protease, kumamolysin |
Protein accession | YP_439008 |
Protein GI | 83716631 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4934] Predicted protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.566172 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCGTGAAT CGTCGCGTCT CACGCGCCTT GCGCGCTGCG CGCGCCACCC GTGCGCATGC GCGTCTCGCG CGGCGCGCCG AACGTCGCGA ACGGCCGCCG CGAGCGCTGG AGACGCCGAC GCAGGATCGC CGCGCGTGCT TTACACTTGC TTTGCCGTAC TGTCGCACGG TCCCCGTGGC GGCGTGTGTG CGCCGTCCGA CATACCGCCC GGAACTGCGA TCGCGCGACC GTGTCCGAAT CCCGGCAACC CGTCATCACG GCTTCTGGAG GATCCGAATA TGGCAAGGCA TCTTCACGCC GGCAATGAAT CGCGTATCGT CGCCGAATCC ACGTGCATCG GTCCGTGCGA TCCGGCCGAG ACGATTCGCG TGATGGTGAT GTTGCGGCGA CAGGAAGAGC AGCACCTCGA TTCGCTGTTG CAAGGCCTCG CGAGCGGCGA TCCGAACGCG AAGCCGGTCT CGCGCGAGGC GTTCGCGCGG CGTTTCTGCG CGCATCCCGA CGACGTCAGG AAAGTCGAGG CGTTCGCGCA GCAGCGCGGC CTCGCGGTCG CGCGCGTCGA TCCGGTCGAA AGCCTCGTCG TGCTGACGGG CACGATCGCG CAGTTCGAGG CGGCGTTCGG CGTGAAGCTC GAGCGCTTCG AGCACCGGAC GGTCGGCCGG TATCGCGGCC GCACGGGCGA CATCACGCTG CCGGACGAGT TGCGCGACGT GGTCACCGCG GTGCTCGGGC TCGACGATCG CCCGCAGGCG CGGCCGCACT TCCGGCTGCG GCCGACCTTC CAGCCCGCGC GCGGCGCGGC CGTCACCTAC ACGCCGCCGC AGCTCGCGGC GCTCTACGAT TTTCCGCCCG GCGACGGCGC GGGCCAGTGC ATCGCGATCG TCGAGCTCGG CGGCGGCTAC CGGCCGGCCG AGATCCAGCG GTATTTCAGC GGCCTCGGGC TCGCGCAGCA GCCGAAGCTC GTCGACGTGA ACGTCGGCGC CGGCCGCAAC GCGCCGACGG GCGATCCGAG CGGGCCGGAC GGTGAAGTCG CGCTCGATAT CGAGATCGCG GGCGCGATCG CGCCCGGCGC GACGCTCGCC GTCTATTTCG CGCAGAACAG CGACGCCGGC TTCATCCAGG CGGTCAACCA GGCCGTGCAC GACACGACGA ACCGGCCGTC GGTCGTGTCG ATCAGCTGGG GCGCGGCGGA GGCGAGCTGG ACGTCGCAAT CGATTCAGGC GTTCAACCGC GTGCTGCAAT CGGCCGCGGC GCTCGGCGTG ACGGTGTGCG TGGCGTCCGG CGACGACGGC TCGAACGACG GCCTGCAGGA CGGCGCGAAC CACGTCGATT TCCCGGCGTC GAGCCCGTAT GCGCTCGCGT GCGGCGGCAC GCGGCTCGAC GCGCTGCCGG GGCAGGGCAT CCGCAGCGAG GTCGTGTGGA ACGACGAAGC GGCGGGCGGC GGCGCGACGG GCGGCGGCGT CAGCACCGTG TTCGATGCGC CGCAGTGGCA GAGCGGCCTG AGCGCGACGC TCGCGCGGGG CGGCGGCGCG GCGCCGCTCG CGAAGCGCGG CGTGCCGGAC GTCGCGGGGG ACGCGTCGCC CGCGACGGGC TACGAGGTGC TCGTCGCGGG CACGTCGACG GTGATGGGCG GCACGAGCGC CGTCGCGCCG CTGTGGGCCG CGCTCGTCGC GCGGATCAAC GCGGCGGCGG GCAGCCCGGC GGGCTGGATC AATCCGAAGC TGTACCGGAA CGCGGGAGCG TTGCACGACG TCTCGGTGGG CGAGAACGGC GCATATGCGG CGACGCCGGG CTGGGACGCG TGCACGGGGC TCGGCAGCCC GGACGGCGCG AAGGTCGCGG CGGCGCTGAA GAGCGGCGCG GCGGCCTGA
|
Protein sequence | MRESSRLTRL ARCARHPCAC ASRAARRTSR TAAASAGDAD AGSPRVLYTC FAVLSHGPRG GVCAPSDIPP GTAIARPCPN PGNPSSRLLE DPNMARHLHA GNESRIVAES TCIGPCDPAE TIRVMVMLRR QEEQHLDSLL QGLASGDPNA KPVSREAFAR RFCAHPDDVR KVEAFAQQRG LAVARVDPVE SLVVLTGTIA QFEAAFGVKL ERFEHRTVGR YRGRTGDITL PDELRDVVTA VLGLDDRPQA RPHFRLRPTF QPARGAAVTY TPPQLAALYD FPPGDGAGQC IAIVELGGGY RPAEIQRYFS GLGLAQQPKL VDVNVGAGRN APTGDPSGPD GEVALDIEIA GAIAPGATLA VYFAQNSDAG FIQAVNQAVH DTTNRPSVVS ISWGAAEASW TSQSIQAFNR VLQSAAALGV TVCVASGDDG SNDGLQDGAN HVDFPASSPY ALACGGTRLD ALPGQGIRSE VVWNDEAAGG GATGGGVSTV FDAPQWQSGL SATLARGGGA APLAKRGVPD VAGDASPATG YEVLVAGTST VMGGTSAVAP LWAALVARIN AAAGSPAGWI NPKLYRNAGA LHDVSVGENG AYAATPGWDA CTGLGSPDGA KVAAALKSGA AA
|
| |