Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMASAVP1_A0888 |
Symbol | |
ID | 4679062 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei SAVP1 |
Kingdom | Bacteria |
Replicon accession | NC_008785 |
Strand | + |
Start bp | 884308 |
End bp | 886155 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639845162 |
Product | serine-type carboxypeptidase family protein |
Protein accession | YP_992228 |
Protein GI | 121599966 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2939] Carboxypeptidase C (cathepsin A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATAC AGAAGTCCTT GAAAGACGGT TTCACGCTCG GATGGTGCAG GGCGGCACGG CCGGTTGCCG CTGCCGCGCT GGCCGCGCTG CTCGTCGCCG CGTGCGGCGG CGACGACGGC GGCGGCGGGA GCCCGTCGCT CGCGGCCGCG AACGTCGCGA ACACGAGCAC GTCGACGAAC GCGACGACGG CCGCCGATGC GACGACCAAT GCCGCGCTGC CGCCGGATCA GCCGTATATT GACAACGACG TCTATGGCAC CGGGCCGAAC GATTCGGTCA GCGACGCGAC GGAGGGCACC GCGGTCGTGC ACCGGCAGGT GAAGATCGGC GATCAGATCC TCACCTACAC GGCGACGGCC GGCCACCTCG TGACGATCGA TCCGATCACG TCGAAGCCGA ACGCGAAGAT GTTCTACGTC GCGTACACGC TCGACAATCC GAACCCGGGC AAGCCGCGCC CCGTCACGTT CTTCTACAAC GGCGGCCCGG GCTCGTCGTC GGTGTACCTG CTGCTGGGCT CGTTCGGGCC GAAGCGCCTG CAGTCGTCGT TCCCGAACTT CACGCCGCCC GCGCCGTACC GGCTGCGCGA CAACCCCGAG AGCCTGCTCG ACCGCTCCGA TCTCGTGTTC ATCAATCCGG TCGGCACCGG CTACTCGGCC GCGATCGCGC CGGCGAAGAA CAAGGATTTC TGGGGCGTCG ACCAGGACGC GCACTCGATC GACCGCTTCA TCCAGCGCTA CCTGACGAAG TACGCGCGCT GGAACTCGCC GAAGTTCCTG TTCGGCGAAT CGTACGGCAC GGCGCGCAGC GCGGTGACCG CGTGGGTGCT GCATGAGGAC GGCATCGAGC TGAACGGGAT CACGCTGCAG TCGTCGATTC TCGACTATGC GAACGCGGTG AGCGCGATCG GCATCTTCCC GACGCTCGCG GCCGATGCGT TCTACTGGAA CAAGACGACC ATCAGCCCGA AACCGGCCGA TCTGGATGCG TACATGGCGC AGGCGCGCAG CTATGCGGAC AACGTGCTCG CGCCGCTCGC GCAGGCGCCG AATCCGCAGG ACGGCGGCTT CGTCAACGTG CGGCTGAACC TGAACGTCGC GACCGCGCAG CAGATGGGCG CGTACATCGG CACCGATCCG ATCTCGCTGG TCCAGACGTT CGGCAATCCG GCCGCGCTCG GCAACGTGCC GTCGTCCAAC GACAACCCGC CGTACACGTT CTTCCTGACG CTCGTGCCGG GCATCCAGAT CGGCCAGTAC GACGGACGCG CGAACTACAC GGGCAAGGGC ATCGCGCCGT ACATCCTGCC GAACTCGGGC AGCAACGATC CGTCGATCAG CAACGTCGGC GGCGCGTACA CGGTGCTGTG GAACGACTAC ATCAACAACG ACCTGAAGTA TGTGTCGACG TCGTCGTTCG TCGATCTGAA CGACCAGGTG TTCAACAACT GGGACTTCAG CCACACGGAC CCGACGGGCG CGAACCGCGG CGGCGGCAAC ACGCTGTACA CGGCGGGCGA TCTCGCCGCG ACGATGAGCC TGAACCCGGA CCTGAAGGTG CTGTCGGCGA ACGGCTATTT CGACGCGGTG ACGCCGTTCC ACCAGACCGA GCTCACGCTC GCGCAGATGC CGCTCGATCC GTCGCTGAAG TCGGCGAACC TGACGATGAA ATACTATCCG TCGGGCCACA TGATCTATCT GAACGATCAC TCGCGGATCG CGATGAAGGC GGATCTGGCG ACGTTCTACG ACGGCATCCT CGCGGACCGC ACGGCGATGC GGCGCGTGCT GCTGCGCCAG CAGAAGGCGC TGCAGTTGAA GCAGCAGAAG CAACAGCAAG GGCAGTGA
|
Protein sequence | MKIQKSLKDG FTLGWCRAAR PVAAAALAAL LVAACGGDDG GGGSPSLAAA NVANTSTSTN ATTAADATTN AALPPDQPYI DNDVYGTGPN DSVSDATEGT AVVHRQVKIG DQILTYTATA GHLVTIDPIT SKPNAKMFYV AYTLDNPNPG KPRPVTFFYN GGPGSSSVYL LLGSFGPKRL QSSFPNFTPP APYRLRDNPE SLLDRSDLVF INPVGTGYSA AIAPAKNKDF WGVDQDAHSI DRFIQRYLTK YARWNSPKFL FGESYGTARS AVTAWVLHED GIELNGITLQ SSILDYANAV SAIGIFPTLA ADAFYWNKTT ISPKPADLDA YMAQARSYAD NVLAPLAQAP NPQDGGFVNV RLNLNVATAQ QMGAYIGTDP ISLVQTFGNP AALGNVPSSN DNPPYTFFLT LVPGIQIGQY DGRANYTGKG IAPYILPNSG SNDPSISNVG GAYTVLWNDY INNDLKYVST SSFVDLNDQV FNNWDFSHTD PTGANRGGGN TLYTAGDLAA TMSLNPDLKV LSANGYFDAV TPFHQTELTL AQMPLDPSLK SANLTMKYYP SGHMIYLNDH SRIAMKADLA TFYDGILADR TAMRRVLLRQ QKALQLKQQK QQQGQ
|
| |