Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A1426 |
Symbol | |
ID | 4887218 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 1336962 |
End bp | 1338653 |
Gene Length | 1692 bp |
Protein Length | 563 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640131365 |
Product | putative serine metalloprotease |
Protein accession | YP_001062423 |
Protein GI | 126445517 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.23258 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAGA AACGTAGTAA TCCTGAACAG ACGCGGCTTG GCCAGGTCCG TGCCGTCGCC GGCATTCTCT CCATGTCCGT CCTCGTTCCG CTCGCCGGTT GCGGCGGCGG TGGCGACGGA GGCGGAAGCG GCACGCCGTC GGCCGCCGCG CAGCCGACCC CCGCGCCGGC GCCGGCGCCG GCCCCGGCAC CCGCGCCGAG CTCGGGTTCG TCGCAATCCA CCAATTCGTC GACCTCGACG GCGGCCTGCC CCGTCACGCA GGCCGCCTCG ACCGCCGCCA GCGAAACGCT CGTCACCCGC ACCGTTTCGC ACGAAGCGCC CGTCGATCAT CTGATCGTCA AGCTGCAACG CACGGCGGCG GCGAGCGCAT CCGGCGCGCG CATCATGGCC GCGGCGAACG ACGCGGCCCG ACTCGATTCG GTGATCCAGC GCGTGATGTC GCAATGGAGC GCGAAGAGCG GCGCCGTTCG CTCGTATGCG CAGAACATCG CGCCGACGAA CGCGGTGCAG GTGGAACGGA CGATGTCGGA CGGTGCCGCG CTGCTCGCGC TCGGACAAAA GATGAGCGCG GATAATGCCG GCGCTCTCGC GCAAACGTTC GCGGCCGATC CGGACGTCGC CTATGCGGAG CCCGACCGGC GCGTGTTCGC CCGCACGGTG GCGACCGACC CGGACTACGC GCAGCAGTGG AACTACTTCG ATCCGGCGGC CGGCATCAAT CTGCCGGACG CATGGAACGT GACGAACGGC CTGCCGAGCG TCGTCACCGC GGTGCTCGAC ACCGGCTATC GCCCGCATCC GGACATCATC GCGAACCTGC TGCCGGGCTA CGATTTCATC TCCGACATCA ACACCGGCAA CAACGGCCAC GGCCGCGGCC CGGATGCGAC CGACCCGGGC GACTGGGTCA CGCAGCAGGA ACTGACCGAT CCGTCGAGCC CGTTCTACCA ATGCGCGAGC GCGCCGTCGA ACAGCAGCTG GCACGGCACG CAGGTCGCCG GCATCATCGG CGCCGCCGCG AACAACGGCA TCGGCATCGC GGGCGTCAGC TGGTACGGCA AGATCCTGCC CGTGCGCGTG CTCGGCAAGT GCGGCGGCAC GACGAGCGAC ATCGCCGACG CGATGCGCTG GGCGGCGGGC ATTCCCGTCG CGGGCGCGCC GACGAACCTC ACGCCGGCGA AGGTGATCAA CCTGAGCCTC GGCGGCACCG GCCCGTGCGG CGACACGTTC CAGCAGGCGA TCAACGACGT GATCGCGCGC GGCACGACCG TCGTCGTCTC GGCCGGCAAC GACGGCCAGG CGACGACGCT GGACCGCCCG GCCAACTGCA AGGGCGTGAT CTCGGTCGGC GCGACCGACA GCACCGGCCA GCGCGCGTGG TACAGCAACT TCGGCTCGGA CATCACGCTG AGCGCGCCGG GCTCGAACAT CCTGTCGACG AGCAATGCGG GCACCACGGT GCCGACCACC GACGCGTACG GCACGCACAG CGGCACGAGC CTTGCCGCGC CGCAGGTGGC GGGCGTCGCC TCGCTGATGC TCGCGGTCAA CCCGAACCTC ACGCCCGCGC AGATCGCGCA GAAGCTCGCG AGCACCGCGC GGCCGTCGCC GGCCACCGCA TCCTGCCTCG CGCGCGCGCC GGGCGCGGGC ATCGTCGACG CCGGCACGGT GGTTGCGTCC GCAACGAAAT AG
|
Protein sequence | MNKKRSNPEQ TRLGQVRAVA GILSMSVLVP LAGCGGGGDG GGSGTPSAAA QPTPAPAPAP APAPAPSSGS SQSTNSSTST AACPVTQAAS TAASETLVTR TVSHEAPVDH LIVKLQRTAA ASASGARIMA AANDAARLDS VIQRVMSQWS AKSGAVRSYA QNIAPTNAVQ VERTMSDGAA LLALGQKMSA DNAGALAQTF AADPDVAYAE PDRRVFARTV ATDPDYAQQW NYFDPAAGIN LPDAWNVTNG LPSVVTAVLD TGYRPHPDII ANLLPGYDFI SDINTGNNGH GRGPDATDPG DWVTQQELTD PSSPFYQCAS APSNSSWHGT QVAGIIGAAA NNGIGIAGVS WYGKILPVRV LGKCGGTTSD IADAMRWAAG IPVAGAPTNL TPAKVINLSL GGTGPCGDTF QQAINDVIAR GTTVVVSAGN DGQATTLDRP ANCKGVISVG ATDSTGQRAW YSNFGSDITL SAPGSNILST SNAGTTVPTT DAYGTHSGTS LAAPQVAGVA SLMLAVNPNL TPAQIAQKLA STARPSPATA SCLARAPGAG IVDAGTVVAS ATK
|
| |