Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMA10247_A0716 |
Symbol | |
ID | 4891130 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei NCTC 10247 |
Kingdom | Bacteria |
Replicon accession | NC_009079 |
Strand | + |
Start bp | 664027 |
End bp | 665691 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640146992 |
Product | serine carboxypeptidase family protein |
Protein accession | YP_001077917 |
Protein GI | 126446177 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2939] Carboxypeptidase C (cathepsin A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCATCG ATTCGACTTC CTCCGGCGGC GCGCAGCCGC TCCATCACGG CGCGAACGGC TCGGTTCACG CGCCGCCGCC GGTCATCGTC GCGCCGAAGG ACGACGGCGA CCAGCCGTTC TTCGATCCGG TCGCCTACGG CAACGGCCCC GACGATTCGG TGACGGACAC CACCGAGGCC GCCGCGATCA CGCACCACAC GGTCCGGATC GACGGCCGCA CGATCGCGTA CACGGCCGCG GCGGGCCATC TCGTGACCGT CGATCCGAGC AGCTCGCAGC CGGATGCGAA GATCTTCTAC GTCGCGTTCA CGCAGGACGG CCAGCAGGAG CAAACGCGCC CCGTCACGTT CTTCTACAAC GGCGGGCCGG GCTCGTCGGC CGTGTTCGTG CTGCTCGGCT CGTTCGCGCC GCGGCGCATC CGCACGTCGA TGCCGAGCTT CACGCCGCCC GCGCCGTACC GGATGGAAGA CAACCCGGAC AGCCTGCTCG ACAAGAGCGA TCTCGTGTTC ATCAACCCGG TCGGCACCGG CTATTCGGCG GCGATCGCGC CGCGCAAGAA CCGCGATTTC TGGGGCGTCG ATCAGGACGC GAACTCGATC AAGCAGTTCA TCAAGCGCTA TCTGACGAAG CACAACCGGT GGAATTCGCC GAAGTACCTG TTCGGCGAAT CGTACGGCAC CGCGCGCAGC TGCGTGCTCG CGTACAAGCT GCACGAGGAC GGCGTCGACC TGAACGGGAT CACGCTGCAG TCGTCGATTC TCGATTACCG GCAGGCGGGC AATCCGGTAG GCGCGCTGCC CACCGCGGCG GCCGACGCGT GGTATCACAA GCGGCTCGGC GTCGCGCCGA CGCCGACCGA TCTCGGCGCG TTCGTGGAGG AGGTCGCGCA GTTCGCGCGC ACCGACTATC TCGGCGCGCT GCGCAAGTTC CCGCAGGCCG ATGCGGCCGT CGTCAAGAAG CTGTCCGACT ACACCGGCAT CGACACGACG ACGTTGCTGT CGTGGAGCCT CGACATCGCG GGCTACGACG CGCGCGGCAA CGCGCTGTTC CTCACGACGC TGCTGAAGGC ACAAGGCCTC GCGCTCGGCG CGTACGACGG CCGCGTGACG GGAATCGAAT CGGGGATCGC GGGCCGGATC GATCCGAACT CGGGCGGCAA CGATCCGACG ATGACGGCTG TGTCGGGCGT CTACACGGCG ATGTGGAATA CGTACCTGAA CGAGCAGTTG AAGTACACGT CGAACTCGTC GTTCACCGAC CTGAACGACC AGGCATTCAA GTACTGGGAC TTCGGCCACA TCGATCCGAC GGGCGAACAG CAGGGCGTCG ACGCGAAGGG CAACGTGATC CTGTACACGG CGGGCGATCT CGCCGCGACG ATGGCGCTCA ACGTCGATCT GAAGGTGCTG TCGGCGAACG GGCTCTACGA TTTCGTCACG CCGTTCTACC AGACGGTGCT CGATCTGCAG CAGATGCCGC TCGAGGACCC GAAGGTGCGG CAGAACCTGT CCGCGCGCTT CTATCCGTCC GGACACATGG TGTACCTCGA CGGCGGCTCG CGCACCACGC TCAAGCACGA CCTCGCGCAG ATGTACGAAT CGACGGTGCG CGACACCGCG GCGGTGATGC GCATTCGCGC GTTGCAGGAG AAAAAGCGCG CGTAG
|
Protein sequence | MSIDSTSSGG AQPLHHGANG SVHAPPPVIV APKDDGDQPF FDPVAYGNGP DDSVTDTTEA AAITHHTVRI DGRTIAYTAA AGHLVTVDPS SSQPDAKIFY VAFTQDGQQE QTRPVTFFYN GGPGSSAVFV LLGSFAPRRI RTSMPSFTPP APYRMEDNPD SLLDKSDLVF INPVGTGYSA AIAPRKNRDF WGVDQDANSI KQFIKRYLTK HNRWNSPKYL FGESYGTARS CVLAYKLHED GVDLNGITLQ SSILDYRQAG NPVGALPTAA ADAWYHKRLG VAPTPTDLGA FVEEVAQFAR TDYLGALRKF PQADAAVVKK LSDYTGIDTT TLLSWSLDIA GYDARGNALF LTTLLKAQGL ALGAYDGRVT GIESGIAGRI DPNSGGNDPT MTAVSGVYTA MWNTYLNEQL KYTSNSSFTD LNDQAFKYWD FGHIDPTGEQ QGVDAKGNVI LYTAGDLAAT MALNVDLKVL SANGLYDFVT PFYQTVLDLQ QMPLEDPKVR QNLSARFYPS GHMVYLDGGS RTTLKHDLAQ MYESTVRDTA AVMRIRALQE KKRA
|
| |