Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II2193 |
Symbol | |
ID | 3845692 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | - |
Start bp | 2694759 |
End bp | 2696003 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637839494 |
Product | proline iminopeptidase |
Protein accession | YP_440381 |
Protein GI | 83717984 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCGCGGCG CTACATCGCG CCGACGGGCC GCACGCTGGT GTGCATGCCG ATCGAGCGGA TCGCATACGC GCGGAAGGCG TAGCGGCGGC GACGCGGCCG GCACGGCCGA CGGCCGGCCG CGTCCGAGGC GAACGCGCGC GCCGGTTTTC CGTCGACTCG GCATAATGAT GAGGCAGTCG TTGCTTTCGA CGCCGTGCCG GATCGGCGCG GCCGGGCGCG TGCGGCGCAT CGCACGTGCC GCATGCGCGC GTTGTCCGCG CGCGATCGCC CCAGCCGTTT TCCTCCATTC AACCGGAGTG CCTCTCTTGT ACCCACCAAT CGAACCTTAC GCACACGGCT TGCTCGATAC CGGCGACGGC CATCGCGTGT ATTGGGAGCT GTGCGGCAAC CCCGACGGCA AGCCGGCCGT CTTCCTGCAC GGCGGCCCGG GCAGTGGCTG CAGCGCCGAG CACCGCCGCC TCTTCGACCC CGCGCGCTAC AACGTGCTGC TGTTCGACCA GCGCGGCTGC GGGCGCTCCG CGCCGCACGC GAGCCTCGAG AACAACACGA CATGGCATCT CGTCGACGAC ATCGAGCGAT TGCGCGAGAT GCTCGGCGTC GAGCGCTGGC TCGTGTTCGG CGGCTCGTGG GGCAGCGCGC TCGCGCTCGC GTATGGCGAG ACGCATCCGG CGCGCGTGAC CGAGCTCGTC GTGCGCGGCG TCTTCACGGT GCGCCGCTCA GAGCTCCTCT GGTATTACCA GGAAGGCGCG TCGTGGCTGT TTCCGGACCT GTGGGAAGAC TTCGTCGCGC CCATTGCGCC CGCCGAGCGC TCGGACCTGA TCGCCGCGTA TCGCCGCCGG CTGACGGGCG GCGACGAAGC GGCGAAGCGC GAAGCGGCGC GCGCGTGGAG CATCTGGGAA GGCCGGACGA TCACGCTGCT GCCGAATGCC GCGCACGAAG CGCATTTCGG CGACGCGCAT TACGCGCTCG CGTTCGCCCG CATCGAAAAC CACTACTTCG TTCATCAAGG TTTCATGGAA GACGGGCAGT TGCTGCGCGA CGCGCATCGT CTGGCGGACA TTCCAGGCGT GATCGTTCAG GGGCGCTACG ACGTCGCGAC GCCCGCGCGC ACCGCGTGGG AGCTCGCGAA GGCGTGGCCG CGCGCGTCGC TCGAGATCGT GCCGGACGCG GGCCACGCGT ACGACGAGCC GGGCATCCTG CGCGCGCTGA TCGCGGCGAC CGACCGCTTC GCGCGCGAAC GCTGA
|
Protein sequence | MRGATSRRRA ARWCACRSSG SHTRGRRSGG DAAGTADGRP RPRRTRAPVF RRLGIMMRQS LLSTPCRIGA AGRVRRIARA ACARCPRAIA PAVFLHSTGV PLLYPPIEPY AHGLLDTGDG HRVYWELCGN PDGKPAVFLH GGPGSGCSAE HRRLFDPARY NVLLFDQRGC GRSAPHASLE NNTTWHLVDD IERLREMLGV ERWLVFGGSW GSALALAYGE THPARVTELV VRGVFTVRRS ELLWYYQEGA SWLFPDLWED FVAPIAPAER SDLIAAYRRR LTGGDEAAKR EAARAWSIWE GRTITLLPNA AHEAHFGDAH YALAFARIEN HYFVHQGFME DGQLLRDAHR LADIPGVIVQ GRYDVATPAR TAWELAKAWP RASLEIVPDA GHAYDEPGIL RALIAATDRF ARER
|
| |