Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMA10247_1793 |
Symbol | |
ID | 4891425 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei NCTC 10247 |
Kingdom | Bacteria |
Replicon accession | NC_009080 |
Strand | - |
Start bp | 1779724 |
End bp | 1781232 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640150448 |
Product | serine protease, MucD |
Protein accession | YP_001081332 |
Protein GI | 126451419 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.455589 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAATC CCTCGCTGCG CACCTGGCTC GTCGCCGCGG CGGTGACGGC CTTGACGCCG CTTGCCGCGC AATCGGCGAC GGCGGCTCCG AATGTCACGA CCACGCCCGC CGCAACGGGC GCCGTGCCCG CGGCGCGCGC GGGCCTGCCC GATTTCGCGG ACCTCGTCGA GCGGGTCGGC CCGGCGGTCG TCAACATCCG GACGACGGCG AACGTGCCGG CCGATACGCG CGGCGCGCTG CCGCCCGGCC TCGACAACGG CGACATGTCG GAATTCTTCC GCCGCTTCTT CGGCATTCCG TTGCCGCAGG GGCCGGGCGG GCAGAAGAAC GCGCCGAGCA CGCCCGATGC GCCCGACACC GAACAGAACC GCGGCGTGGG CTCGGGCTTC ATCCTGTCGC CGGACGGCTA TGTGATGACG AACGCGCACG TCGTCGACGA CGCGGACACG ATCTACGTGA CGCTCACCGA CAAGCGCGAA TTCAAGGCAA AGCTCATCGG CGTCGACGAG CGCACGGACG TCGCGATCGT GAAGATCAAC GCGTCGAGCC TGCCGACCGT CGCGATCGGC GATTCGAACC GCGTGCGCGT CGGCGAATGG GTCGTCGCGA TCGGTTCGCC GTTCGGCCTC GACAACACCG TCACGGCCGG CATCGTCAGC GCAAAGGGCC GCAACACCGG CGACTATCTG CCGTTCATCC AGACGGACGT CGCGGTCAAC CCCGGCAACT CGGGCGGCCC GCTCATCAAC ATGCAGGGCG AGGTGATCGG CATCAACTCG CAGATCTACA GCCGCACGGG CGGCTTCATG GGCATTTCGT TCGCGATTCC GATCGACGAG GCGATGCGCG TCGCCGAGCA GCTGAAGGCA TCGGGCAAGG TCACGCGCGG CAGGATCGCG GTCGCGATCG GCGAGGTGAC GAAGGAAGTG GCGGATTCGA TCGGCCTGCC GAAGGCCGAA GGCGCGCTCG TCAGCAGCGT CGAGCCAGGC GGCCCGGCCG ACAAGGCGGG CCTGCAGCCG GGTGACATCA TCCTGAAGTT CAACGGCCGT CCGGTGGAGG CGGCGTCGGA TCTGCCGCGC ATGGTCGGCG ACACGAAGCC GGGCGCGAAG GCGACGGTGA CGGTGTGGCG CAAGGGGCAA TCGCGCGATC TGCCGATCAC GATCGCGGAA TTCCCGGCCG ACAAGGCCGC GAAGGCCGAC AGCCGTCAGG CGCCGCAGCA GAAGCCGCGC AGCAGCGCGC TCGGCCTGAC GGTCAGCGAC CTGTCGCCCG AGCAGTTGAA GACGCTCAAG CTGCGCAACG GCGTGCAGAT CGACGCGGTC GACGGCCCGG CCTCGCGCGC GGGGCTGCAG CGCGGCGACA TCGTGCTGCG CGTCGGCGAC GTCGACATCA CGAGCGCGAA GCAGTTCGTC GACGTGACGT CGAAGCTCGA TCCGCAGCGC GCGGTCGCGG TGCTCGTGCG GCGCGGCGAG AACACGCAGT TCATCCCGAT CCGGCCGCGT CAGAAGTGA
|
Protein sequence | MMNPSLRTWL VAAAVTALTP LAAQSATAAP NVTTTPAATG AVPAARAGLP DFADLVERVG PAVVNIRTTA NVPADTRGAL PPGLDNGDMS EFFRRFFGIP LPQGPGGQKN APSTPDAPDT EQNRGVGSGF ILSPDGYVMT NAHVVDDADT IYVTLTDKRE FKAKLIGVDE RTDVAIVKIN ASSLPTVAIG DSNRVRVGEW VVAIGSPFGL DNTVTAGIVS AKGRNTGDYL PFIQTDVAVN PGNSGGPLIN MQGEVIGINS QIYSRTGGFM GISFAIPIDE AMRVAEQLKA SGKVTRGRIA VAIGEVTKEV ADSIGLPKAE GALVSSVEPG GPADKAGLQP GDIILKFNGR PVEAASDLPR MVGDTKPGAK ATVTVWRKGQ SRDLPITIAE FPADKAAKAD SRQAPQQKPR SSALGLTVSD LSPEQLKTLK LRNGVQIDAV DGPASRAGLQ RGDIVLRVGD VDITSAKQFV DVTSKLDPQR AVAVLVRRGE NTQFIPIRPR QK
|
| |