Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_3454 |
Symbol | |
ID | 4882131 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 3379084 |
End bp | 3380718 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640129382 |
Product | serine carboxypeptidase family protein |
Protein accession | YP_001060465 |
Protein GI | 126438428 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2939] Carboxypeptidase C (cathepsin A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.489337 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAGCAC GTAAAGCGCA GCTGTTATTA GCCGCGGTAT TCTCTTCGCT GCTGATCGCG GCGTGTAACG GAGACGATGC CGGTTCGCCC GCGAGCGCCG CGGCGAGCGA GAACAGTTCG GGCAACATCG ATGCCTCGTC CGGCGCGGAC AAGCCCTATC GGGATCCCAA CCGTTATTCG TCGAGCGGCA GCGCTTCGCT GTCGCCTTCC GCGGTGACCG AGAACGCCGC GATCACGCAT CACCAGATCA CGGTGAACGG CAAGACGATC AACTACACCG CGACCGCCGG CCACCTGAGC GCGCGCAACC CGCAAAGCGG CGCCGCCGAA GCGTCGTTCT TCTATGTCGC GTACACGGCC GACAATCAGC CGCTCGGCAA GCGGCCCGTC ACGTTTTTCT ACAACGGCGG GCCGGGCTCG TCGACGGTCT GGCTGCATCT CGGCTCGTTC GGGCCCAAGC GCCTCGTGAC GGGCGATCCG AACGCGAACA GCCCGACGCC GTTCCCGTTC GTCGACAACC AGGAAACGCT CCTCGACACG ACCGATCTCG TGTTCGTCGA CGCGATCGGC ACCGGTTTCT CGGAGGCGAT CGCGCCGAAC ACGAACCAGA CGTTCTGGGG CGTCGATCAG GATGCCGGAG CATTTCGCGA TTTCGTGATC CGCTATCTGC AGGTGAATCA GCGCAACGAT TCGCCGAAGT ACCTGTTCGG GGAATCGTAC GGCACGCCGC GCACCGACGT GCTCGCGAAC CTGCTCGAGA CGGCGGGCGT GAAGCTCGAC GGCATCGTGC TGCAATCGTC GATCCTCAAC TACAACAGCA ACTGCGACAT GGCGAGCAAC TCCGTCGGCG GCTCGAACAG CGGCTCGAGC GCGGTGAGCT GCGCAGGCTT CGTGCCGAGC TACGGCGCGA TCGGCGCGTA TTACCAGCTC GACAACCCGA ATCCGACGAA CCTGCCGCAG TATGCGGACC AGATGCGTTT GCTCACGGCG GGCACCTATT CGCCGGCCGT GGACGCGTAC CTCGCGAACC ATACGCCGCC TTCGTCGAGC CTCGTCACGA CGATGTCGAA CGCAACGGGC GCGACCGTGC CGCTCTGGCG CGCGAACTTC AACCTGCTGC CGACTTCGTA TGACAACAGC TACCAGCTCG CGCTGATTCC GGGTACGCTG ATCGGCCGCT ACGACGCGCG CGTGAACGTG CCGACGAACA GCCCGCTCGC GTCGGACGGC GATCCGTCGA GCAGCTTCAT CACGAAGCCG TTCACCGATA CGATCGGCAC CTATTTGCCG AACGTGCTCA AGTACACGGC GAAATCGGCC TATGCTGTGC AGAGCAACGC GATTGCCTCG TGGGACTGGA GCCACGACGG CCTCGATCTG CCCGACACGA TCCCCGATCT GGCCGCGGCG CTCACGCTCA ATCCCGCGCT GAAGGTGCTG TCGCTGAACG GCTATCACGA CATCGCGACG CCGTTCTATC AGACGGAACT CGATCTTGCC CGTCTCGGTG CGCAGCCGAA CCTGACGATC AAGGATTACC AGGGCGGGCA CATGGTCTAC CTGGACGATA CGTCCCGGCC GCAGGAAAAA GCCGATCTGG CGACGTTCTA CGCGGCGGCG CCTGCCGCGC GATGA
|
Protein sequence | MPARKAQLLL AAVFSSLLIA ACNGDDAGSP ASAAASENSS GNIDASSGAD KPYRDPNRYS SSGSASLSPS AVTENAAITH HQITVNGKTI NYTATAGHLS ARNPQSGAAE ASFFYVAYTA DNQPLGKRPV TFFYNGGPGS STVWLHLGSF GPKRLVTGDP NANSPTPFPF VDNQETLLDT TDLVFVDAIG TGFSEAIAPN TNQTFWGVDQ DAGAFRDFVI RYLQVNQRND SPKYLFGESY GTPRTDVLAN LLETAGVKLD GIVLQSSILN YNSNCDMASN SVGGSNSGSS AVSCAGFVPS YGAIGAYYQL DNPNPTNLPQ YADQMRLLTA GTYSPAVDAY LANHTPPSSS LVTTMSNATG ATVPLWRANF NLLPTSYDNS YQLALIPGTL IGRYDARVNV PTNSPLASDG DPSSSFITKP FTDTIGTYLP NVLKYTAKSA YAVQSNAIAS WDWSHDGLDL PDTIPDLAAA LTLNPALKVL SLNGYHDIAT PFYQTELDLA RLGAQPNLTI KDYQGGHMVY LDDTSRPQEK ADLATFYAAA PAAR
|
| |