Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I2658 |
Symbol | |
ID | 3849575 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | - |
Start bp | 3029475 |
End bp | 3030983 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637842327 |
Product | protease signal peptide protein |
Protein accession | YP_443173 |
Protein GI | 83719609 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCTCGTC AAACGTTTGG CCGCGCCCTG ATTCATGCCG CAGTGCTCGG GGCGGCGCTC GCCGGCTTTG CGTGCCTGCG GCCGGCGGAG GTCGCAGCGG GCACGCTGTC TCCGTCCGCG AAGGCGAAGC GGATCGCGCA GCCCGGCCCC GCAGGGCCGA TCGATTTCCC CACGCTCGTC GAGCGATACG GACCCGCGGT GGTCAGCGTG AGCGTGCCCG CCCAGGACCC GCAGATGTCG GCGTCCGGTC TCGAGGCGCT CGATCCCGAC GATCCGTTCT TCGCGTACTT CAAGTCCGCC GCGATGCCGC CGCCCGCGTC GCAGGACAAC GTACCGCGCG CGATGGCGGG CGCCGGGTCC GGTTTCATCG TCAGCGCCGA CGGGCTCATC CTGACGACCG CCTACGTGGT CGGGCAGGCG AGCGAAGCGA CGGTCCGGCT GATCGACCGG CGCGAATTCA AGGCGCGGGT GCTCGCGGTC GACGACCAGA GCGACGTGGC CGTGCTGCAG ATCGACGCGA CGAAGTTGCC GACCGTGCGG CTCGGCGATT CGTCGCGCGT GCGCGTCGGC GAGCCCGTGC TGACGATCGG CACGCCCGAC GGCTCGGCGA ACACTGTGAC GACGGGCATC GTCAGCGCGA CGTCGCGCAC GCTGCCCGAC GGCAGCCGTT TCCCGTTCTT CCAGACCGAC GTGACCGGCA ACCTCGACAA CTCGGGCGGT CCGGTGTTCA ACCGCGCGGG CGAGGTGATC GGCATCGACG TGCAGATCTA CGGAAGCGGC GATCGCAATC CGGACGTGAC GTTCGCGATC CCGATCGGCA TGGCGGCCAA GGTGCGCGCG CAGGTGCTGC AGGCGCAGCC GCCTGCTCAG CCGGCGCAGC GCGCGGCTGC GCAAAACGGG CTTGGCGTCG ACGTGCAGGA CGTGGGCCCC GGGCTCGCGG CCGCGTTCGG CTTGCCGCGG CCGGCGGGCG CGCTCGTCAA CGCGGTCGAG CCGGGGTCGC CCGCGGCCGC GGTCGGCCTG AAGCCGGGCG ACGTGATCGT GCAGGTGGGC GACCGGCCGC TCGGCCGCTC GTCCGAGCTC GCCGCCGACG TGGCGGCGCT GCCGCCTGCG GCGAGCGTGC CGATCACGCT GGTCCGCAAC CGGATGCCGA TGACGGTGAT GCTCGGCGCG GGCGCGGCGG CGGGCGCATC GCCGGCGGCG GCCTCGGCGA ACGCGGGCGC GGGCAGCAGC GAGGCGGGCG GCGCGGACCG CTTCGGCCTG ACGATGCATC CGCTGACGGA CGACGAGCGC CGCTCGACGG GGCTGCCCGT CGGGATGATG GTCGATGCGG TGCGCGGGCC GGCGGAGAAT GCCGGCATCC GGCCGGGCGA CGTCGTGCTG GAGTTCGACG ACACGCTGAT CGAGACGCCG GACATGGTGC CCGCGTTGGA GGCGAAGGCC GGCAAGGCGG TCGCGGTGCT GATTCAGCGG GGAAACGAGC GGAAGTTCGT GTCGGTGCGC TCGAGGTAG
|
Protein sequence | MSRQTFGRAL IHAAVLGAAL AGFACLRPAE VAAGTLSPSA KAKRIAQPGP AGPIDFPTLV ERYGPAVVSV SVPAQDPQMS ASGLEALDPD DPFFAYFKSA AMPPPASQDN VPRAMAGAGS GFIVSADGLI LTTAYVVGQA SEATVRLIDR REFKARVLAV DDQSDVAVLQ IDATKLPTVR LGDSSRVRVG EPVLTIGTPD GSANTVTTGI VSATSRTLPD GSRFPFFQTD VTGNLDNSGG PVFNRAGEVI GIDVQIYGSG DRNPDVTFAI PIGMAAKVRA QVLQAQPPAQ PAQRAAAQNG LGVDVQDVGP GLAAAFGLPR PAGALVNAVE PGSPAAAVGL KPGDVIVQVG DRPLGRSSEL AADVAALPPA ASVPITLVRN RMPMTVMLGA GAAAGASPAA ASANAGAGSS EAGGADRFGL TMHPLTDDER RSTGLPVGMM VDAVRGPAEN AGIRPGDVVL EFDDTLIETP DMVPALEAKA GKAVAVLIQR GNERKFVSVR SR
|
| |