Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_1015 |
Symbol | pepA |
ID | 4883681 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 989975 |
End bp | 991486 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640126943 |
Product | leucyl aminopeptidase |
Protein accession | YP_001058065 |
Protein GI | 126441940 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0260] Leucyl aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.490931 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACTTTA GCATAAAAGG CTGTGATTGG AGCAAAGGCA CGGCGAACGG GTTCCTGACG GGGAAATCCG ACTGCATCGT GCTGGGCGTG TTCGAGGCGC AAACCTTGTC CGGCGCGGCG CTCGACATCG ACGAAGCCAC GAAGGGGCTC GTCTCGCGCG TGATCAAGGC GGGCGACATC GACGGCAAGC TCGGCAAGAC CTTGTTTTTG CACGAGGTTT CGGGCATCGG CGCATCGCGC GTGCTGCTCG TCGGCCTGGG CAGGCAGGAT GCTTTCAGCC AGAAAGCCTA CGGCGACGCG GCAAAGGCCG CATGGCGCGC GCTGCTCGGC ACGAAAGTGG TTCAGGTCAC GTTCACGCTC GCGCAGTTGC CCGTGCCCGA GCGCGCGTCC GACTGGGGTG TGCGCGCGGC GATTCTCGCG CTGCGCAATG AAACGTACAA GTTCACGCAG ATGAAGAGCA AGCCGGACGC GGGCGCGCCG GCGCTCAAGC GCGTCGTGTT CAGCGTCGAT CCGGCCGACG ACAAGGCGGC GAAGGTCGCC GCGAAGCAGG CGGTCGCGCT CGCGAACGGG ATGGACCTCA CGCGCGACCT CGGCAATCTG CCCGGCAACG TCTGCACGCC GACCTACCTC GCGAACACCG CGAAGAAGAT CGCGAAGGAC TGGGGCCTGA AAGTCGACGT GCTGGGCCTG AAGCAGATCC AGGCGCTCAA GATGGGCTCG TTCCTGTCGG TCGCGAAGGG CTCGGTCGAG CCGCCGCAGT TCATCGTGCT GCAGTACCGG GGCGCGGCCG CGAAGGCGGC GCCCGTCGTG CTCGTCGGCA AGGGCATCAC GTTCGACTCC GGCGGCATTT CGCTGAAGCC GGGCGAGGGA ATGGACGAGA TGAAGTACGA CATGTGCGGC GCGGGCTCGG TGCTCGGCAC GATGCGCGCG GTCGCCGAAA TGGGCCTGAA GGTCAACGTC GTCGCGATCG TGCCGACCTG CGAGAACATG CCGGCCGGCA ACGCGAACAA GCCGGGCGAC ATCGTCACGA GCATGAAGGG CCTGACGATC GAGGTGCTCA ACACCGACGC GGAGGGCCGC CTCATCCTGT GCGACGCGCT CACGTACGCG GAGCGCTTCA AGCCGGCCGC CGTGATCGAC GTCGCGACGC TGACGGGCGC GTGCATCATC GCGCTCGGCC ACCACAACAC CGGCCTCTTC TCGAAGGACG ACGCGCTCGC GGGCGAGCTG CTCGACGCGT CGCGCGAAGC GGGCGATCCG GCGTGGCGCC TGCCGCTCGA CGACGAGTAT CAGGATCAGC TGAAGTCGAA CTTCGCGGAT CTCGCGAACA TCGGCGGGCG CCCGGCCGGC AGCGTGACGG CCGCGTGCTT CCTGTCGCGC TTCGCGGAAA ACTATCCGTG GGCGCACCTC GACATCGCGG GCACCGCCTG GAAGAGCGGC GCGGCGAAGG GGGCGACGGG CCGCCCCGTG CCGCTCCTCG CGCAATTCCT GATCGACCGC GCCGGCGCGT GA
|
Protein sequence | MDFSIKGCDW SKGTANGFLT GKSDCIVLGV FEAQTLSGAA LDIDEATKGL VSRVIKAGDI DGKLGKTLFL HEVSGIGASR VLLVGLGRQD AFSQKAYGDA AKAAWRALLG TKVVQVTFTL AQLPVPERAS DWGVRAAILA LRNETYKFTQ MKSKPDAGAP ALKRVVFSVD PADDKAAKVA AKQAVALANG MDLTRDLGNL PGNVCTPTYL ANTAKKIAKD WGLKVDVLGL KQIQALKMGS FLSVAKGSVE PPQFIVLQYR GAAAKAAPVV LVGKGITFDS GGISLKPGEG MDEMKYDMCG AGSVLGTMRA VAEMGLKVNV VAIVPTCENM PAGNANKPGD IVTSMKGLTI EVLNTDAEGR LILCDALTYA ERFKPAAVID VATLTGACII ALGHHNTGLF SKDDALAGEL LDASREAGDP AWRLPLDDEY QDQLKSNFAD LANIGGRPAG SVTAACFLSR FAENYPWAHL DIAGTAWKSG AAKGATGRPV PLLAQFLIDR AGA
|
| |