Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A1824 |
Symbol | |
ID | 4905436 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 1789797 |
End bp | 1792127 |
Gene Length | 2331 bp |
Protein Length | 776 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640144930 |
Product | prolyl oligopeptidase family protein |
Protein accession | YP_001075858 |
Protein GI | 126455706 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1505] Serine proteases of the peptidase family S9A |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGATAGTCG GAATCGCGCA TGGATACGGG CGAAGCGGAA AACGGCACGG CGTCGGCACC GGCTTCGGAC AGCCCCGGCA CGACGAAGGC ACGCTTGCGA CGTGTACGCG GCGCGCCACG CGCGCGCCGC GAATGCCTGC GGCGGACGCG TCGCAATCGA CGAGCCGGCA GCCAAGCGCG AGCGCCGCTC GTCTTACGCT ACCATTACAC GCCATGCCCC TTGCAAAGAA CGCCATGCCT CATGCTTCCT GGCCCGAACA GGCCGATCCG CATCAATTCC TCGAAGAACT GGACAGCGCC GCGAGCGTCG GCTGGGTCGA CGCGCAAAAC GCCCGCACGC ACGATGCGCC CTGGCTCGAC GAAGCGCACT ATCGCGCGCT GGTCGAGCGC TTCACCCGGG CGCTGCTGCC GCGCGAGCGC CCGGTGATTC CGCAGCGCTG GCAGGACTGG GCGTACGACG TCTGGCAGGA CGAACAGCAT CCGAAGGGCC TGTGGCGGCG CACGCGATGG ACGAGCTGGC GCAGCGGCCA CGCGGACTGG CAGACGTTGA TCGACCTCGA TGCGCTTGGT GAAGCGCAAG GCGTGCAGTG GGTGTTCGAC GATCAGCTCA TCCTCGAGCC GGACGGCGAT CGTGCGCTGA TCGTGCTGTC CGACGGCGGC GCCGACGCGG TCGTCGTCCG CGAGTTCGAC ATCGCGCAAT GCCGGTTCGT CGACGACGGC TTCTCGATCG AAGCGGCCGG CAAGCATTCG GTCGAATGGA TCGATCGCGA CACGATCTAC GTCGGCTGGG ACGACGGCGG CGCCACCGTC ACGCGCTCCG GCTATCCGCG CGAAGTGCGG CGCTGGACGC GCGGCACGCC GCTGTCCAGC GCGCCCGTGG TGTTTCGCGG CGCGCGCGGC GACATCTCGG TCGATGCGCA ATATGATCCG CTCGACCGGC ATCACGCGAT CGAGCAGGCG ATCAATTTCT ACGACGCGAA CACGTATCGC CTCGCCGAGG ACGGCGCGTG GGCGCGCTAC GACGTGCCAC CGCACGTCGA AGTCGGTTAC TGGAGCGGGT GGCTGCTGCT CCAGCCGCGG CTCGACTGGA CTTGCGGCGG CGCGCGCTAC GCGGGCGGCA GCCTGCTCGC GATCCGCGAG GACGCGTTCG TCGCCGGTGA GCGCGCGTTC GCCGCGCTGT TCGAGCCGAA CGAGCGCACG TCCGCATGCG GCTGGACGCA CACGCGCCGC TACGTGCTGG TGTCGTGGCT CGACGACGTG CTCACGCGCA CGATGCTCTG GCTTCCCGAA CGTCAGGATG ACGGAGCATG GCGCTGGCAT GCTCGTCCGT TCCCCGCGCG AGGGCTCGCG CAAGTGGACG TGTCGCCCGT CGAGCCCACG TTCGACGACG AGGTGTACGT GAGCGTCGAC GATTACCTGA AGCCGCCCGA GTATTCGCTC GCGAATCTCG CCAGCGACGA CCTGTCCGCC TGGACGCTGC TCGACCGCTG GCCGACGCAG TTCGACGCGT CCGAACTGAC GGTGCGGCGC GAACACGCGC GCTCGCGCGA CGGCACGCTC GTGCCTTATA CGCTGGTCGG GCCGCGCGAC GTGCTGGACA ATGCGGCGCG CGCGCCGCGC CCCTGCCTGT TGAACGGCTA CGGCGGCTTC GCGATTGCGC TCACGCCCGA TTACGATCCG TTGCTCGGCA TCGGCTGGCT CGAGAAAGGC GGCATCGCGG TGTTCGCCCA TATTCGCGGC GGCGGCGAGT TCGGCACGCA GTGGCACGAA TCGGCGCGGC AAACGCAACG GCAGCGATCG TTCGACGATT TCATCGCGGT CGCCGAAAAA CTCGTCGCGG ACGGCGTGAC GAGCGCCGCG CAACTGGGTA TTCGCGGCGG CAGCAACGGC GGGCTGCTGG TCGCGGCATG CATGATTCAG CGCCCGGACC TGTTCGGCGC GGTGGTGAGC GACGTGCCGC TTCTCGACAT GCAGCGCTAT GCGCTGCTGC ACGCGGGCGC ATCGTGGCTG GACGAATTCG GCGATCCCGA CGATCCGGCG CATGCGTCGG CGCTCGCGGC CTACTCGCCG TATCACCGGG TCGCGCGCGA CATCGCGTAT CCGCCCGCGC TGTTCACGAC ATCGACGAGC GACGACCGCG TGCATCCCGC CCATGCGAGA AAAATGGTCG CGCGCATGCA GGCGCAAGGG CACCGAAACG TATGGCTGAT CGAGAAAACC GATGGCGGCC ACGGCAGCGC GGACGCGATC GATACCGCCG AGCACGAAGC GATCGGCTAT GTGTTTCTGT GGACTCACTT GTCCCGCGGC GCGCATGACG CGCGCGAGTG A
|
Protein sequence | MIVGIAHGYG RSGKRHGVGT GFGQPRHDEG TLATCTRRAT RAPRMPAADA SQSTSRQPSA SAARLTLPLH AMPLAKNAMP HASWPEQADP HQFLEELDSA ASVGWVDAQN ARTHDAPWLD EAHYRALVER FTRALLPRER PVIPQRWQDW AYDVWQDEQH PKGLWRRTRW TSWRSGHADW QTLIDLDALG EAQGVQWVFD DQLILEPDGD RALIVLSDGG ADAVVVREFD IAQCRFVDDG FSIEAAGKHS VEWIDRDTIY VGWDDGGATV TRSGYPREVR RWTRGTPLSS APVVFRGARG DISVDAQYDP LDRHHAIEQA INFYDANTYR LAEDGAWARY DVPPHVEVGY WSGWLLLQPR LDWTCGGARY AGGSLLAIRE DAFVAGERAF AALFEPNERT SACGWTHTRR YVLVSWLDDV LTRTMLWLPE RQDDGAWRWH ARPFPARGLA QVDVSPVEPT FDDEVYVSVD DYLKPPEYSL ANLASDDLSA WTLLDRWPTQ FDASELTVRR EHARSRDGTL VPYTLVGPRD VLDNAARAPR PCLLNGYGGF AIALTPDYDP LLGIGWLEKG GIAVFAHIRG GGEFGTQWHE SARQTQRQRS FDDFIAVAEK LVADGVTSAA QLGIRGGSNG GLLVAACMIQ RPDLFGAVVS DVPLLDMQRY ALLHAGASWL DEFGDPDDPA HASALAAYSP YHRVARDIAY PPALFTTSTS DDRVHPAHAR KMVARMQAQG HRNVWLIEKT DGGHGSADAI DTAEHEAIGY VFLWTHLSRG AHDARE
|
| |