Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A2116 |
Symbol | |
ID | 4904352 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | - |
Start bp | 2072783 |
End bp | 2074372 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640145221 |
Product | kumamolisin |
Protein accession | YP_001076149 |
Protein GI | 126457317 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4934] Predicted protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.229792 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAGGC ATCTTCACGC CGGCAACGAA TCGCATCTCG TCGCCGAATC CACGTGCATC GGGCCGTGCG ATCCGGCCGA GACGATTCAT GTAGTGGTGA TGTTGCGGCG ACAGCAAGAG CAGCACCTCG ATTCATTGTT GCAGGGCCTC GCGAGCGGCG ATCCGGGCGT GAAGCCTGTC TCGCGCGAGG CGTTCGCCCA GCGTTTCGGC GCGCATCCCG ACGACGTCAT GAAAGTCGAG GCATTCGCGC AGCAGCGCGG CCTCGCGGTC GCGCGCGTCG ATCCGGTCGA GAGCCTCGTC GTGCTGTCGG GCACGATCGC GCAGTTCGAG GCGGCCTTCG GCGTGAAGCT CGAGCGCTTC GAGCATCGGT CGATCGGCCA GTATCGCGGC CGCACGGGCG ATATCACGCT GCCCGACGAG TTGCACGGCA TCGTCACCGC GGTGCTCGGG CTCGACGACC GCCCGCAGGC CCGGCCGCAT TTCCGGCTGC GGCCGACTTT CCTGCCCGCG CGCGCGCCGG CCGTCACCTA CACGCCGCCG CAGCTCGCGG CCCTCTACGA TTTCCCGCCC GGCGACGGCG CGGGCCAGTG CATCGCGATC GTCGAGCTCG GCGGCGGCTA TCGGCCGGCC GAGATCCAGC AGTATTTCGG CGGCCTCGGG CTCGCGCGGC AGCCGAAGCT CGTCGACGTG AGCGTCGGCG CGGGGCGCAA CGCGCCGACG GGCGATCCGA GCGGGCCGGA CGGCGAAGTC GCGCTCGATA TCGAGATCGC GGGCGCGATC GCGCCCGGCG CGACGATTGC CGTCTATTTC GCGCAGAACA GCGACGCCGG CTTCATCCAG GCGGTCAATC AGGCGGTGCA CGACACGACG AACCGGCCCT CCGTCGTGTC GATCAGTTGG GGCGCGGCGG AGGCGAACTG GACGTCGCAA TCGATCCAGG CCTTCGATAG CGTGCTGCAG TCGGCCGCGG CGCTCGGCGT GACCGTGTGC GCGGCGTCCG GCGATGACGG CTCGAACGAC GGCCTGCAGG ACGGCACGAA TCACGTCGAT TTCCCGGCAT CGAGCCCGTA CGTGCTCGCG TGCGGCGGCA CGCGGCTCGA CGCACTGCCG GGGCAGGGCA TCCGCAGCGA AGTCGTGTGG AACGACGAGG CGGCGGGCGG CGGCGCGACG GGCGGCGGCG TCAGCGCCGT GTTCGACGTG CCGCAGTGGC AGAGCGGCCT GAGCGCGACG CTCGCGCAGG GCGGCGGCGC GTCGCCGCTC GCGAAGCGCG GCGTGCCGGA CGTCGCGGGC GATGCGTCGC CCGCGACGGG CTACGAGGTG TTCGTCGCGG GCACGTCGAC GGTGATGGGC GGCACGAGCG CCGTCGCACC GCTGTGGGCC GCGCTCGTCG CGCGGATCAA TGCGGCGGCG GGCAGCCCCG CGGGCTGGAT CAACCCGAAG CTGTACCGGA ACGCGGGCGC GCTGCACGAC ATCTCGGTGG GCGATAACGG CGCATATGCG GCGACGCCGG GCTGGGACGC GTGCACGGGG CTCGGCAGCC CGAACGGCGC GAAGGTCGCG GCGGCGCTGA AGGGCGGCGC GGCGGGCTGA
|
Protein sequence | MARHLHAGNE SHLVAESTCI GPCDPAETIH VVVMLRRQQE QHLDSLLQGL ASGDPGVKPV SREAFAQRFG AHPDDVMKVE AFAQQRGLAV ARVDPVESLV VLSGTIAQFE AAFGVKLERF EHRSIGQYRG RTGDITLPDE LHGIVTAVLG LDDRPQARPH FRLRPTFLPA RAPAVTYTPP QLAALYDFPP GDGAGQCIAI VELGGGYRPA EIQQYFGGLG LARQPKLVDV SVGAGRNAPT GDPSGPDGEV ALDIEIAGAI APGATIAVYF AQNSDAGFIQ AVNQAVHDTT NRPSVVSISW GAAEANWTSQ SIQAFDSVLQ SAAALGVTVC AASGDDGSND GLQDGTNHVD FPASSPYVLA CGGTRLDALP GQGIRSEVVW NDEAAGGGAT GGGVSAVFDV PQWQSGLSAT LAQGGGASPL AKRGVPDVAG DASPATGYEV FVAGTSTVMG GTSAVAPLWA ALVARINAAA GSPAGWINPK LYRNAGALHD ISVGDNGAYA ATPGWDACTG LGSPNGAKVA AALKGGAAG
|
| |