Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A2200 |
Symbol | |
ID | 4886438 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 2131721 |
End bp | 2133385 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640132137 |
Product | serine carboxypeptidase family protein |
Protein accession | YP_001063194 |
Protein GI | 126443582 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2939] Carboxypeptidase C (cathepsin A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.555603 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCATCG ATTCGACTTC CTCCGGCGGC GCGCAGCCGC TCCATCACGG CGCGAACGGC TCGGTTCACG CGCCGCCGCC GATCATCGTC GCGCCGAAGG ACGACGGCGA CCAGCCGTTC TTCGATCCGG TCGCCTACGG CAACGGCCCC GACGATTCGG TGACGGACAC CACCGAGGCC GCCGCGATCA CGCACCACAC GGTCCGGATC GACGGCCGCA CGATCGCGTA CACGGCCGCG GCGGGCCATC TCGTGACTGT CGATCCGAGC AGCTCGCAGC CGGATGCGAA GATCTTCTAC GTCGCGTTCA CGCAGGACGG CCAGCAGGAG CAAACGCGCC CCGTCACGTT CTTCTACAAC GGCGGGCCGG GCTCGTCGGC CGTGTTCGTG CTGCTCGGCT CGTTCGCGCC GCGGCGCATC CGCACGTCGA TGCCGAGCTT CACGCCGCCC GCGCCGTACC GGATGGAAGA CAACCCGGAC AGCCTGCTCG ACAAGAGCGA TCTCGTGTTC ATCAACCCGG TCGGCACCGG CTATTCGGCG GCGATCGCGC CGCGCAAGAA CCGCGATTTC TGGGGCGTCG ATCAGGACGC GAACTCGATC AAGCAGTTCA TCAAGCGCTA TCTGACGAAG CACAACCGGT GGAATTCGCC GAAGTACCTG TTCGGCGAAT CGTACGGCAC CGCGCGCAGC TGCGTGCTCG CGTACAAGCT GCACGAGGAC GGCGTCGACC TGAACGGGAT CACGCTGCAG TCGTCGATTC TCGATTACCG GCAGGCGGGC AATCCGGTGG GCGCGCTGCC CACCGCGGCG GCCGACGCGT GGTATCACAA GCGGCTCGGC GTCGCGCCGA CGCCGACCGA TCTCGGCGCG TTCGTGGAGG AGGTCGCGCA GTTCGCGCGC ACCGACTATC TCGGCGCGCT GCGCAAGTTC CCGCAGGCCG ATGCGGCCGT CGTCAAGAAG CTGTCCGACT ACACCGGCAT CGACACGACG ACGTTGCTGT CGTGGAGCCT CGACATCGCG GGCTACGACG CGCGCGGCAA CGCGCTGTTC CTCACGACGC TGCTGAAGGC ACAAGGCCTC GCGCTCGGCG CGTACGACGG CCGCGTGACG GGAATCGAAT CGGGGATCGC GGGCCGGATC GATCCGAACT CGGGCGGCAA CGATCCGACG ATGACGGCGG TGTCGGGCGT CTACACGGCG ATGTGGAATA CGTACCTGAA CGAGCAGTTG AAATACACGT CGAACTCGTC GTTCACCGAC CTGAACGACC AGGCATTCAA GTACTGGGAC TTCGGCCACA TCGATCCGAC GGGCGAACAG CAGGGCGTCG ACGCGAAGGG CAACGTGATC CTGTACACGG CGGGCGATCT CGCCGCGACG ATGGCGCTCA ACGTCGATCT GAAGGTGCTG TCGGCGAACG GGCTCTACGA TTTCGTCACG CCGTTCTACC AGACGGTGCT CGATCTGCAG CAGATGCCGC TCGAGGACCC GAAGGTGCGG CAGAACCTGT CCGCGCGCTT CTATCCGTCC GGGCACATGG TGTACCTCGA CGGCGGCTCG CGCACCACGC TCAAGCACGA CCTCGCGCAG ATGTACGAAT CGACGGTGCG CGACACCGCG GCGGTGATGC GCATTCGCGC GTTGCAGGAG AAAAAGCGCG CGTAG
|
Protein sequence | MSIDSTSSGG AQPLHHGANG SVHAPPPIIV APKDDGDQPF FDPVAYGNGP DDSVTDTTEA AAITHHTVRI DGRTIAYTAA AGHLVTVDPS SSQPDAKIFY VAFTQDGQQE QTRPVTFFYN GGPGSSAVFV LLGSFAPRRI RTSMPSFTPP APYRMEDNPD SLLDKSDLVF INPVGTGYSA AIAPRKNRDF WGVDQDANSI KQFIKRYLTK HNRWNSPKYL FGESYGTARS CVLAYKLHED GVDLNGITLQ SSILDYRQAG NPVGALPTAA ADAWYHKRLG VAPTPTDLGA FVEEVAQFAR TDYLGALRKF PQADAAVVKK LSDYTGIDTT TLLSWSLDIA GYDARGNALF LTTLLKAQGL ALGAYDGRVT GIESGIAGRI DPNSGGNDPT MTAVSGVYTA MWNTYLNEQL KYTSNSSFTD LNDQAFKYWD FGHIDPTGEQ QGVDAKGNVI LYTAGDLAAT MALNVDLKVL SANGLYDFVT PFYQTVLDLQ QMPLEDPKVR QNLSARFYPS GHMVYLDGGS RTTLKHDLAQ MYESTVRDTA AVMRIRALQE KKRA
|
| |