Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A2113 |
Symbol | |
ID | 4905766 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | - |
Start bp | 2070371 |
End bp | 2072035 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640145218 |
Product | serine carboxypeptidase family protein |
Protein accession | YP_001076146 |
Protein GI | 126457803 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2939] Carboxypeptidase C (cathepsin A) |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCATCG ATTCGACTTC CTCCGGCGGC GCGCAGCCGC TCCATCACGG CGCGAACGGC TCGGTTCACG CGCCGCCGCC GGTCATCGTC GCGCCGAAGG ACGACGGCGA CCAGCCGTTC TTCGATCCGG TCGCCTATGG CAACGGCCCC GACGATTCGG TGACGGACAC CACCGAGGCC GCCGCGATCA CGCACCACAC GGTCCGGATC GACGGCCGCA CGATCGCGTA CACGGCCGCG GCGGGCCATC TCGTGACCGT CGATCCGAGC AGCTCGCAGC CGGATGCGAA GATCTTCTAC GTCGCGTTCA CGCAGGACGG CCAGCAGGAG CAAACGCGCC CCGTCACGTT CTTCTACAAC GGCGGGCCGG GCTCGTCGGC CGTGTTCGTG CTGCTCGGCT CGTTCGCGCC GCGGCGCATC CGCACGTCGA TGCCGAGCTT CACGCCGCCC GCGCCGTACC GGATGGAAGA CAACCCGGAC AGCCTGCTCG ACAAGAGCGA TCTCGTGTTC ATCAACCCGG TCGGCACCGG CTATTCGGCG GCGATCGCGC CGCGCAAGAA CCGCGATTTC TGGGGCGTCG ATCAGGACGC GAACTCGATC AAGCAGTTCA TCAAGCGCTA TCTGACGAAG CACAACCGGT GGAATTCGCC GAAGTACCTG TTCGGCGAAT CGTACGGCAC CGCGCGCAGC TGCGTGCTCG CGTACAAGCT GCACGAGGAC GGCGTCGACC TGAACGGGAT CACGCTGCAG TCGTCGATTC TCGATTACCG GCAGGCGGGC AATCCGGTGG GCGCGCTGCC CACCGCGGCG GCCGACGCGT GGTATCACAA GCGGCTCGGC GTCGCGCCGA CGCCGACCGA TCTCGGCGCA TTCGTGGAGG AGGTCGCGCA GTTCGCGCGC ACCGACTATC TCGGCGCGCT GCGCAAGTTC CCGCAGACCG ACGCGGCCGT CGTCAAGAAG CTGTCCGACT ACACCGGCAT CGACACGACG ACGTTGCTGT CGTGGAGCCT CGACATCGCG GGCTACGACG CGCGCGGCAA CGCGCTGTTC CTCACGACGC TGCTGAAGGC ACAAGGCCTC GCGCTCGGCG CGTACGACGG CCGCGTGACG GGAATCGAAT CGGGGATCGC GGGCCGGATC GATCCGAACT CGGGCGGCAA CGATCCGACG ATGACGGCGG TGTCGGGCGT CTACACGGCG ATGTGGAATA CGTACCTGAA CGAGCAGTTG AAGTACACGT CGAACTCGTC GTTCACCGAC CTGAACGACC AGGCATTCAA GTACTGGGAC TTCGGCCACA TCGATCCGAC GGGCGAACAG CAGGGCGTCG ACGCGAAGGG CAACGTGATC CTGTACACGG CGGGCGATCT CGCCGCGACG ATGGCGCTCA ACGTCGATCT GAAGGTGCTC TCGGCGAACG GGCTCTACGA TTTCGTCACG CCGTTCTACC AGACGGTGCT CGATCTGCAG CAGATGCCGC TCGAGGACCC GAAGGTGCGG CAGAACCTGT CCGCGCGCTT CTATCCGTCC GGGCACATGG TGTACCTCGA CGGCGGCTCG CGCACCACGC TCAAGCACGA CCTCGCGCAG ATGTACGAAT CGACGGTGCG CGACACCGCG GCGGTGATGC GCATTCGCGC GTTGCAGGAG AAAAAGCGCG CGTAG
|
Protein sequence | MSIDSTSSGG AQPLHHGANG SVHAPPPVIV APKDDGDQPF FDPVAYGNGP DDSVTDTTEA AAITHHTVRI DGRTIAYTAA AGHLVTVDPS SSQPDAKIFY VAFTQDGQQE QTRPVTFFYN GGPGSSAVFV LLGSFAPRRI RTSMPSFTPP APYRMEDNPD SLLDKSDLVF INPVGTGYSA AIAPRKNRDF WGVDQDANSI KQFIKRYLTK HNRWNSPKYL FGESYGTARS CVLAYKLHED GVDLNGITLQ SSILDYRQAG NPVGALPTAA ADAWYHKRLG VAPTPTDLGA FVEEVAQFAR TDYLGALRKF PQTDAAVVKK LSDYTGIDTT TLLSWSLDIA GYDARGNALF LTTLLKAQGL ALGAYDGRVT GIESGIAGRI DPNSGGNDPT MTAVSGVYTA MWNTYLNEQL KYTSNSSFTD LNDQAFKYWD FGHIDPTGEQ QGVDAKGNVI LYTAGDLAAT MALNVDLKVL SANGLYDFVT PFYQTVLDLQ QMPLEDPKVR QNLSARFYPS GHMVYLDGGS RTTLKHDLAQ MYESTVRDTA AVMRIRALQE KKRA
|
| |