Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A0975 |
Symbol | |
ID | 4888300 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 945010 |
End bp | 946737 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640130915 |
Product | serine carboxypeptidase family protein |
Protein accession | YP_001061974 |
Protein GI | 126443991 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2939] Carboxypeptidase C (cathepsin A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCAGCGC TTGCGACTGT CACGCTGGTT GCGTGCGGCG GCGACGATCC ATCGTCTGGT GCCGTGAGCG CCGGCAACTC GCCGGCCATA GTGCCGTCAG TACCCGCCAC AACGCCGGCG ACGCCAACCG CGACGGCCGC CGAGTCTGAA CAGGACGTGT TCGCAAGAAC TGCGCCGGCT GACAAAGTAT TGGATGACTC CAACGTCTAC GACACGACAA AAGACGGTTC CATCGCTTTG TCCAAGGTCA ACGAGGATTC GTCGGTCAAG CGACATACAA TCACCATCAA CGGGAAAGTA CTGCCGTATG TCGCGCGTGC CGGGCACCTG GTTGCTTACC GGCAAAACGG GGCGGGCAAG AAAGCCGAGG CGGCGATCTT CTATACGGCG TACACGCGTG ACGCACTGCC CAAGGAGCAT CGCCCGGTCA CGTTCTTATG GAACGGCGGC CCCGGATCGG CGTCGATCTG GCTGCACATG GGATCGTGGG GCCCGAAACG GCTCAAGTCC GACGCCCCGA ACATGGCCGA TCCTACGAAG CAGCCCGACA GCTTCCCGTT CGAGGACAAT GCGATTTCGC TACTCGACCA GTCGGACATC GTCTTCGTCG ATCCGCCGGG GACAGGTCTT TCCACGGCCA TTGCGCCGCT CAAGAACGGC GATCTATGGG GAACCGACGA TGATGCCCAA GTGGTTGCCG ACTTCATTAC GAGCTATACG AACAAGTACA ACCGGCAGAG CTCTCCGAAG TACCTGTACG GCGAATCCTA CGGCGGTATC CGCACGCCGA TCGTCGCCAA CCTGCTGGAG CAGGCAGGTA CCAGCGGCTA CGTTCCCGAT CCGTCCGGCA AGCCGGCCAG GGTGCTGGAC GGATTCATCC TCAATTCGCC GCTCGTCGAT TACAACTCCA ATTGCGACAT GATGGGCGGG AGAGTGACGT GCGAGGGTTA CATCCCGTCC TACGCCATGA CGGCAGACTA CTTCAAGAAA TCCGTCAAGC GCGGCACGCG AACGCAAGAG CAGTATCTCG GCGAGTTGCG CACGCTGGCG AGAACGACAT TCCAGACGAC CTACGGAACC TATTTCACCA ACGGCAAACC GAACAGCCAA TGGAATGCGT ATGCGGCGAG CACGGCGGGA CAGGCTCTCC TGAACCGGAT CGCCGACTAT ACCGGTATTC CCGCGTCGAC CTGGAACGGC TCGTTCAACT ACACGCCGAG GCCGTTTCGA AACGCGCTGG TTCCAGGCTA CGAACTCGGG CGCTACGATG CGCGCATGAA GGTACCGAAC GGGAACAGCT TCGCGGCGGA CAGCTACATC GACGTGGCCT TCCTTAACCA GCTCAAGACG TATTTTCCCG ATTTCGTCAA CTACAAGACC CAGTCGATCT ACGAGCCGCT CAATAATGCG ACGATCAGAA ACTGGAAATG GAAGAGGGCC GGGAGCAAAT ATGATTACCC ACAGAGCATT ACCGACATTC AAGCCGTTCT GACTGCCAAC CCCGATGCCA AGCTGCTGAT TCTTCACGGT TATGAAGATA TCGCGACGCC GGGTTTCCAG ACCGAACTGG ATCTCGAGGG GGTGAACCTG AGCGATCGCA TCCCGGTCAA ATGGTTCGAA GGCGGGCATA TGATCTACAA CACCGAGGCA TCGCGCGTGC CGCTGAAGCA GGCGATCGAC AGCTATTACA AGTCGCCGAC GCGTATCGCG GACGGTTCCT TACTATAA
|
Protein sequence | MAALATVTLV ACGGDDPSSG AVSAGNSPAI VPSVPATTPA TPTATAAESE QDVFARTAPA DKVLDDSNVY DTTKDGSIAL SKVNEDSSVK RHTITINGKV LPYVARAGHL VAYRQNGAGK KAEAAIFYTA YTRDALPKEH RPVTFLWNGG PGSASIWLHM GSWGPKRLKS DAPNMADPTK QPDSFPFEDN AISLLDQSDI VFVDPPGTGL STAIAPLKNG DLWGTDDDAQ VVADFITSYT NKYNRQSSPK YLYGESYGGI RTPIVANLLE QAGTSGYVPD PSGKPARVLD GFILNSPLVD YNSNCDMMGG RVTCEGYIPS YAMTADYFKK SVKRGTRTQE QYLGELRTLA RTTFQTTYGT YFTNGKPNSQ WNAYAASTAG QALLNRIADY TGIPASTWNG SFNYTPRPFR NALVPGYELG RYDARMKVPN GNSFAADSYI DVAFLNQLKT YFPDFVNYKT QSIYEPLNNA TIRNWKWKRA GSKYDYPQSI TDIQAVLTAN PDAKLLILHG YEDIATPGFQ TELDLEGVNL SDRIPVKWFE GGHMIYNTEA SRVPLKQAID SYYKSPTRIA DGSLL
|
| |