Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I1726 |
Symbol | |
ID | 3849646 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | + |
Start bp | 1933925 |
End bp | 1935382 |
Gene Length | 1458 bp |
Protein Length | 485 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637841395 |
Product | serine protease, MucD |
Protein accession | YP_442261 |
Protein GI | 83720925 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGACGCCGC TTGCCGCGCA ATCGGCGACG GCGGCTTCGA ATGTCACGAC CGCGCCTGCC GCCACGGGCG CCGCGCCCAC GACGCGCGCC GGTTTGCCCG ATTTCGCGGA CCTCGTCGAG AGGGTCGGCC CGGCCGTCGT CAACATCCGG ACGACGGCGA ACGTGCCGGC CGATACGCGC GGCGCGCTGC CGCCCGGCCT CGACAACGGC GACATGTCGG AGTTCTTCCG CCGCTTCTTC GGCATTCCGC TGCCGCAGCC GCCGGGCGGG CAGAAGAATG CGCCGAGCGC GCCCGACGCA CCGGACACCG AGCAGAACCG CGGCGTCGGC TCGGGCTTCA TCCTGTCGCC GGACGGGTAC GTGATGACGA ACGCGCATGT CGTCGACGAC GCGGACACGA TCTACGTGAC GCTCACCGAC AAGCGCGAAT TCAAGGCGAA GCTCATCGGC GTCGACGAGC GCACCGACGT CGCGATCGTG AAGATCAACG CGTCGAGCCT GCCGACCGTT GCGATCGGCG ATTCGAACCG CGTGCGCGTC GGCGAATGGG TCGTCGCGAT CGGTTCGCCG TTCGGTCTCG ACAACACGGT CACGGCCGGC ATCGTCAGCG CGAAGGGCCG CAACACCGGC GACTATCTGC CGTTCATCCA GACGGACGTC GCGGTCAACC CCGGCAACTC GGGCGGCCCG CTCATCAACA TGCAGGGCGA GGTGATCGGC ATCAACTCGC AGATCTACAG CCGCACGGGC GGTTTCATGG GGATCTCGTT CGCGATTCCG ATCGACGAGG CGATGCGCGT CGCCGAGCAG CTGAAAGCGT CGGGCAAGGT CACGCGCGGC CGGATCGCGG TCGCGATCGG CGAGGTGACG AAGGAAGTCG CCGATTCGAT CGGTTTGCCG AAGGCCGAAG GCGCGCTTGT CAGCAGCGTC GAGTCGGGCG GTCCGGCCGA CAAGGCGGGC CTCCAGCCGG GCGACATCAT CCTGAAGTTC AACGGCCGTT CGGTCGAAAC GGCGTCGGAT CTGCCGCGCA TGGTCGGCGA CACGAAGCCG GGCACGAAGG CGACGGTGAC GGTGTGGCGC AAGGGGCAGT CGCGCGATTT GCCGATCACG ATCGCGGAAT TCCCGGCCGA CAAGATCGCG AAGGCAAGCA GCCGCCAGGC GCCGCAGCAG AAGCCGCGCA GCAGCGCGCT CGGCCTGGCG GTCAGCGATC TGTCGCCCGA GCAGTTGAAG ACGCTCAAGC TGCGCAACGG CGTGCAGATC GACGCGGTCG ACGGCCCGGC CGCGCGCGCG GGGCTGCAGC GCGGCGATAT CGTGCTGCGC GTCGGCGACG TCGACATCTC GAGCGCGAAG CAGTTCGTCG ACGTGACGTC GAAGCTCGAT CCTCAGCGCG CGGTCGCGGT GCTCGTGCGG CGCGGCGACA ACACGCAGTT CATCCCGATC CGGCCGCGCC AGAAGTGA
|
Protein sequence | MTPLAAQSAT AASNVTTAPA ATGAAPTTRA GLPDFADLVE RVGPAVVNIR TTANVPADTR GALPPGLDNG DMSEFFRRFF GIPLPQPPGG QKNAPSAPDA PDTEQNRGVG SGFILSPDGY VMTNAHVVDD ADTIYVTLTD KREFKAKLIG VDERTDVAIV KINASSLPTV AIGDSNRVRV GEWVVAIGSP FGLDNTVTAG IVSAKGRNTG DYLPFIQTDV AVNPGNSGGP LINMQGEVIG INSQIYSRTG GFMGISFAIP IDEAMRVAEQ LKASGKVTRG RIAVAIGEVT KEVADSIGLP KAEGALVSSV ESGGPADKAG LQPGDIILKF NGRSVETASD LPRMVGDTKP GTKATVTVWR KGQSRDLPIT IAEFPADKIA KASSRQAPQQ KPRSSALGLA VSDLSPEQLK TLKLRNGVQI DAVDGPAARA GLQRGDIVLR VGDVDISSAK QFVDVTSKLD PQRAVAVLVR RGDNTQFIPI RPRQK
|
| |