Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMA10247_A2151 |
Symbol | pip |
ID | 4890403 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei NCTC 10247 |
Kingdom | Bacteria |
Replicon accession | NC_009079 |
Strand | - |
Start bp | 2083654 |
End bp | 2084592 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640148412 |
Product | proline iminopeptidase |
Protein accession | YP_001079323 |
Protein GI | 262192849 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTATCCAC CGATCGAACC TTATGCACAC GGCTTCCTCG ATACCGGCGA CGGCCATCGC GTGTACTGGG AGCTGTGCGG CAACCCCAAC GGCAAGCCGG CCGTCTTCCT GCACGGCGGC CCCGGCAGCG GCTGCAGCGC CGATCACCGT CGCCTCTTCG ATCCCGCGCG CTACAACGTG CTGCTGTTCG ACCAACGCGG CTGCGGCCGC TCGACGCCGC ACGCGAGCCT CGAGAACAAC ACGACATGGC ACCTCGTCGA CGACATCGAG CGGCTGCGCG CGATGCTCGG CGTCGAGCGC TGGCTCGTGT TCGGCGGCTC GTGGGGCAGC GCGCTCGCGC TCGCATATGC GCAAACGCAC CCGGCGCGCG TGGCCGAGCT CGTCGTGCGC GGCATCTTCA CGGTGCGCCG GTCCGAGCTG CTCTGGTACT ACCAGGAAGG CGCGTCGTGG CTGTTTCCGG ATCTGTGGGA AGACTTCATC GCGCCCATTC CGAGCGCCGA GCGCGCGGAT CTGATCGCCG CGTATCGCCG CCGGCTGACG GGCGACGACG AGGCGGCCAA GCGCGAGGCC GCGCGCGCGT GGAGCGTCTG GGAGGGCCGG ACGATCGCGC TGCTGCCGAA CGCCGCGCAC GAAACGTATT TCGGCGACGC GCATTTCGCG CTCGCGTTCG CCCGCATCGA AAACCACTAC TTCGTTCATC AAGGTTTCAT GGAAGACGGG CAGTTGCTGC GCGATGCGCA TCGTCTCGCG GACATCCCGG GCGTGATCGT TCAGGGGCGC TACGACGTCG CGACGCCGGC GCGCACCGCG TGGGAACTCG CGAAGGCGTG GCCGCGCGCG TCGCTCGAGA TCGTGCCCGA CGCGGGCCAC GCATACGACG AGCCGGGCAT TCTGCGCGCG CTGATCGCGG CGACCGACCG CTTCGCGCGC GAGCGCTGA
|
Protein sequence | MYPPIEPYAH GFLDTGDGHR VYWELCGNPN GKPAVFLHGG PGSGCSADHR RLFDPARYNV LLFDQRGCGR STPHASLENN TTWHLVDDIE RLRAMLGVER WLVFGGSWGS ALALAYAQTH PARVAELVVR GIFTVRRSEL LWYYQEGASW LFPDLWEDFI APIPSAERAD LIAAYRRRLT GDDEAAKREA ARAWSVWEGR TIALLPNAAH ETYFGDAHFA LAFARIENHY FVHQGFMEDG QLLRDAHRLA DIPGVIVQGR YDVATPARTA WELAKAWPRA SLEIVPDAGH AYDEPGILRA LIAATDRFAR ER
|
| |