Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_0393 |
Symbol | |
ID | 4901575 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | + |
Start bp | 356014 |
End bp | 357705 |
Gene Length | 1692 bp |
Protein Length | 563 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640133623 |
Product | serine carboxypeptidase family protein |
Protein accession | YP_001064676 |
Protein GI | 126451557 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2939] Carboxypeptidase C (cathepsin A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATGGAG GGTATGGCGG CCTCGCGCGC CCGCGCGGCC GGATGCCGGC GACGGCCGCC GTTGCCGCCG CGTTGCTGCT CGCGCTCGGC GGGTGCGGCG ACGATCTGCA GAGCACCACG ACGCCCGCGC AGCTCAATCA ACCGTACACG GACACGACCG CGTATTCGCC GAAGGCGGGC GATGGGCTGC CGGCGTCGCA GGTATCCGAA CGCGCCGCGG TGATGAGCCA CCAGTGGACG GCGAACGGCG CGAGCGTCGA TTACCTGACG ACGACCGGCC ACCTGACGGC CACCGATCCG AACGGCAACG CGGAGGCGAC GATGTCGTAC GTCGCGTATA CGGCGCCGAG CCGCGACGGC TCGCCGCGGC CCGTCACGTT CTTCTACAAC GGGGGGCCGG GCTCGTCGTC GGTGTGGCTG CGGCTCGGCT CGTTCGCGCC GACGCGGGTC GCGACGCCCG ATCCGCTGAT GACGAACTGG CCGAATTTCC CGCTCGTCGA CAACCCGGAG AGCCTGATCG CGACCACCGA CATGGTGTTC ATCGATCCGC CGGGCACGGG CCTGTCGGAG GCGATCCAGC CGAACACGAA CCAGACGTTC TGGGGAGCGG ACGCCGACGT GAAGGTGATG CGCGATTTCA TTCGCCGCTA CCTGTCGGTC AACGGCCGCG GCGGCTCGCC GATCTATCTG TACGGCGAAT CGTACGGCAC GCCGCGCACG GATATGCTCG CGCTCGCGCT CGAATCGGCG GGCGTGCCGC TCACGGGCAT CGTGCTGCAG TCGTCGATCC TGAACTACAT GGCGGCCGCG GGCGACCAGG CGGTGGGCAC CTTTCCGTCG TACGCGCAGG TGGCCGCATA CTTCAACCAG GTGTCGCCGT CGCCGACGAA TCTGGGCGCG TATGCGCAGC GCATCGAGAA TTTCGTGACC GCGCAGTACG CGCCGATCGT GCATTACGCG ACGGCTTCGT CGCCGATCTC GCCGGACGCC GGCACGCTCG CCGCGTGGTC GTCGCAAACG GGCATGGCCA CCGCGTCGAT CGGCGCGTAC TTCCAGTATT TCTACGATAC GGAGCCGTCG CCCGGCCAGA CGACGCTCGT GCCCGGCTAC ACGATCGGCC GCTACGACGG CCGCGTGTCG CTGCCGAACG GCGACGCGCG CCTCGCGAGC GACGACGATC CGTCCGACAT CCTGATCTCG AAGCCGTTCA CGAGCGCGCT CGCGTCGCAG ATGCCGAACT ACCTCGGCTA CACCGCGCCG AACGCGACGT ATCAGACGCT CAATCCCGAC ATCATCGGCG TGTGGAACTT CAGCCACGCG GGCCAGCCGT ATCCGGACAC GATCCCGGAT CTGCTTGCCG CGCTGCAACT GAATCCGAAG CTGAAGGTGC TCGCGTCGAA CGGCTATCAT GATCTCGCGA CGCCGTACTT CGAAACGGAG AAGGAGCTCG CGCGGCTGCA GACGGTGTCC GGCCTCGCGC CGAATCTGCA GGTGACGTTC TACCAGGGCG GCCACATGAT CTATCTCGAC GACGTCGCGA GGCCGCAGAT GCAGGCGGAT CTCGTCGCGT TCTACCAGAA CCGGCCGGTG GCAAACGCGT TGACGCTCGC GGCGCTGCCG TCGCCGTGGC CCGACGAAAG CCCGGCGAAC ACGCCGACGG CGAAGATCGC GCGGGCCGCC GCGGCCCGCT GA
|
Protein sequence | MHGGYGGLAR PRGRMPATAA VAAALLLALG GCGDDLQSTT TPAQLNQPYT DTTAYSPKAG DGLPASQVSE RAAVMSHQWT ANGASVDYLT TTGHLTATDP NGNAEATMSY VAYTAPSRDG SPRPVTFFYN GGPGSSSVWL RLGSFAPTRV ATPDPLMTNW PNFPLVDNPE SLIATTDMVF IDPPGTGLSE AIQPNTNQTF WGADADVKVM RDFIRRYLSV NGRGGSPIYL YGESYGTPRT DMLALALESA GVPLTGIVLQ SSILNYMAAA GDQAVGTFPS YAQVAAYFNQ VSPSPTNLGA YAQRIENFVT AQYAPIVHYA TASSPISPDA GTLAAWSSQT GMATASIGAY FQYFYDTEPS PGQTTLVPGY TIGRYDGRVS LPNGDARLAS DDDPSDILIS KPFTSALASQ MPNYLGYTAP NATYQTLNPD IIGVWNFSHA GQPYPDTIPD LLAALQLNPK LKVLASNGYH DLATPYFETE KELARLQTVS GLAPNLQVTF YQGGHMIYLD DVARPQMQAD LVAFYQNRPV ANALTLAALP SPWPDESPAN TPTAKIARAA AAR
|
| |