Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_A2223 |
Symbol | |
ID | 3692057 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007435 |
Strand | + |
Start bp | 2708460 |
End bp | 2710217 |
Gene Length | 1758 bp |
Protein Length | 585 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637732477 |
Product | serine-type carboxypeptidase family protein |
Protein accession | YP_337374 |
Protein GI | 76819158 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2939] Carboxypeptidase C (cathepsin A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00580188 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTAAAA GAATACCGAT TGGGACTACT GTGGCAGCGC TTGCGACCGT CACGCTGGTT GCGTGCGGCG GCGACGATCC ATCGTCTGGT GCCGTGAGCG CCGGCAACTC GCCGGCCATA GTGCCGTCAG TACCCGCCGC AACGCCGGCG ACGCCAACCG CGACGGCCGC CGAGTCTGAA CAGGACGTGT TCGCAAGAAC TGCGCCGGCT GACAAAGTAT TGGATGACTC CAACGTCTAC GACACGACAA AAGACGGTTC CATCGCTTTG TCCAAGGTCA ACGAGGATTC GTCGGTCAAG CGACATACGA TCACCATCAA CGGGAAAGTA CTGCCGTATG TCGCGCGTGC CGGGCACCTG GTTGCTTACC GGCAAAACGG GGCGGGCAAG AAAGCCGAGG CGGCGATCTT CTATACGGCG TACACGCGTG ACGCACTGCC CAAGGAGCAT CGCCCGGTCA CGTTCTTATG GAACGGCGGC CCCGGATCGG CGTCGATCTG GCTGCACATG GGATCGTGGG GCCCGAAACG GCTCAAGTCC GACGCCCCGA ACATGGCCGA TCCCACGAAG CAGCCCGACA GCTTCCCGTT CGAGGACAAT GCGATTTCGC TACTCGACCA GTCGGACATC GTCTTCGTCG ATCCGCCGGG GACAGGTCTT TCCACAGCCA TTGCGCCGCT CAAGAACGGC GATCTATGGG GAACCGACGA TGATGCCCAA GTGGTTGCCG ACTTCATTAC GAGCTATACG AACAAGTACA ACCGGCAGAG CTCTCCGAAG TACCTGTACG GCGAATCCTA CGGCGGTATC CGCACGCCGA TCGTCGCCAA CCTGCTGGAG CAGGCAGGTA CCAGCGGCTA CGTTCCCGAT CCGTCCGGCA AGCCGGCCAG GGTGCTGGAC GGATTCATCC TCAATTCGCC GCTCGTCGAT TACAACTCCA ATTGCGACAT GATGGGCGGA AGAGTGACGT GCGAGGGTTA CATCCCGTCC TACGCCATGA CGGCAGACTA CTTCAAGAAA TCCGTCAAGC GCGGCACGCG AACGCAAGAG CAGTATCTCG GCGAGTTGCG CACGCTGGCG AGAACGACAT TCCAGACGAC CTACGGAACC TACTTCACCA ACGGCAAACC GAACAGCCAA TGGAATGCGT ATGCGGCGAG CACGGCGGGA CAGGCTCTCC TGAACCGGAT CGCCGACTAT ACCGGTATTC CCGCGTCGAC CTGGAACGGC TCGTTCAACT ACACGCCGAG GCCGTTTCGA AACGCGCTGG TTCCAGGCTA CGAACTCGGG CGCTACGATG CGCGCATGAA GGTACCGAAC GGGAACAGCT TCGCGGCGGA CAGCTACATC GACGTGGCCT TCCTTAACCA GCTCAAGACG TATTTTCCCG ATTTCGTCAA CTACAAGACC CAGTCGATCT ACGAGCCGCT CAATAATGCG ACGATCAGAA ACTGGAAATG GAAGAGGGCC GGGAGCAAAT ATGATTACCC ACAGAGCATT ACCGACATTC AAGCCGTTCT GACTGCCAAC CCCGATGCCA AGCTGCTGAT TCTTCACGGT TATGAAGATA TCGCGACGCC GGGTTTCCAG ACCGAACTGG ATCTCGAGGG GGTGAACCTG AGCGATCGCA TCCCGGTCAA ATGGTTCGAA GGCGGGCATA TGATCTACAA CACCGAGGCA TCGCGCGTGC CGCTGAAGCA GGCGATCGAC AGCTATTACA AGTCGCCGAC GCGTATCGCG GACGGTTCCT TACTATAA
|
Protein sequence | MAKRIPIGTT VAALATVTLV ACGGDDPSSG AVSAGNSPAI VPSVPAATPA TPTATAAESE QDVFARTAPA DKVLDDSNVY DTTKDGSIAL SKVNEDSSVK RHTITINGKV LPYVARAGHL VAYRQNGAGK KAEAAIFYTA YTRDALPKEH RPVTFLWNGG PGSASIWLHM GSWGPKRLKS DAPNMADPTK QPDSFPFEDN AISLLDQSDI VFVDPPGTGL STAIAPLKNG DLWGTDDDAQ VVADFITSYT NKYNRQSSPK YLYGESYGGI RTPIVANLLE QAGTSGYVPD PSGKPARVLD GFILNSPLVD YNSNCDMMGG RVTCEGYIPS YAMTADYFKK SVKRGTRTQE QYLGELRTLA RTTFQTTYGT YFTNGKPNSQ WNAYAASTAG QALLNRIADY TGIPASTWNG SFNYTPRPFR NALVPGYELG RYDARMKVPN GNSFAADSYI DVAFLNQLKT YFPDFVNYKT QSIYEPLNNA TIRNWKWKRA GSKYDYPQSI TDIQAVLTAN PDAKLLILHG YEDIATPGFQ TELDLEGVNL SDRIPVKWFE GGHMIYNTEA SRVPLKQAID SYYKSPTRIA DGSLL
|
| |