Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bphyt_4779 |
Symbol | |
ID | 6280182 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia phytofirmans PsJN |
Kingdom | Bacteria |
Replicon accession | NC_010676 |
Strand | + |
Start bp | 885006 |
End bp | 886667 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 642615861 |
Product | peptidase S10 serine carboxypeptidase |
Protein accession | YP_001888514 |
Protein GI | 187919483 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2939] Carboxypeptidase C (cathepsin A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.138285 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.00136432 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACGAATA CCGAGTCAGC TTCTGTTAGT CCGCAGCCGT CGCACAATTC GCATGCTTCA GGCGCGCCCG CGCCCGCGCC GGCCAGCGCG CCGCACAAGG CCAAAGATCA GCCGTTCTTC GACCCGGTTG CTTACGGCAA CGGGCCGGAT GATTCCGTGA CCGAAACCGA TGAAAGCGCG GCGATCACGC ACCACTCGGT CACCATCGGC GGGCACAAGA TCGACTACAC GGCGACGGCG GGCCACCTCG TCATCGTCGA CCCGAGTAGT TCAAAGGCGG AAGCCCGCAT GTTCTACGTG GCGTTCACGC AGGACAATCA GAAGGAAGAA GCCCGGCCGG TCACGTTCTT CTATAACGGC GGGCCTGGGT CGTCGTCCGT TTTCGTGCTG CTGGGCTCGT TCGCGCCGCG CCGCATCAAG ACGTCGATGC CGAGCTTCAC GCCACCCGCG CCGTATTCGA TGGAAGACAA CCCGGACAGC CTGCTCGACA AGAGCGACCT CGTCTTCATC AACCCGGTCG GCACCGGCTA CTCGGCGGCG ATCGCGCCGA AGAAGAACCG CGACTTCTGG GGCGTCGACC AGGACGCGGA CTCGATCAAG CAGTTCATCA AGCGTTTTCT GACCAAGAAC AACCGCTGGA ATTCGCCGAA GTACCTGTTC GGCGAGTCGT ATGGCACGGC GCGCAGTTGC GTGCTGGCGT ACCGTTTGCA CGAAGATGGC GTGGATCTGA ACGGCATCAC GCTGCAATCG TCGATTCTCG ATTACACGCA GGCCGGCAAC CCGGTGGGCG CGCTGCCCAC CGCGGCCGCG GACGCGTGGT ATCACAAGAA GCTCGGCATC GCGCCGCGGC CGACCGACCT CGGCACGTTC GCCGAAGAAG TCGCGCAGTT CGCGCGCACG GACTATCTGG CCGCGCTGCG TAAGTTTCCG ACCACCGACG CGGCAACGGT CGAAAAGCTC AGCGAATACA CGGGCATCGA CAAAACGACC TTGCTCGCGT GGAGTCTGGA TGTCGCCTCG TACGACAGCC GGGGCAATTC GTTGTTCCTC ACCACCCTGC TGAAATCCAA GGGTCTTGCG CTCGGCGAGT ACGACGGCCG TGTGACGGCG ATAGGCACGG GCATTGCCGG CAAGATCGAC CCGAATTCCG GCGGCAACGA CCCGACCATG ACGGCGGTGA CCGGCGTCTA CACGACGATG TGGAACGTCT ATCTGAACGA GCAACTGAAG TACACGTCGA ATTCGTCGTT CACGGATCTG AACGACCAGG CCTTCAAGTA CTGGGACTTC AGTCACATCG ATCCGACCGG CGCGCAGAAG GGCGTCGACT CGAAGGGCAA CATCATTCTG TACACCGCCG GGGACCTGGC TGCCGTGATG GCGCTCAATC CCGACCTGAA GGTGCTGTCG GCCAACGGCT TCTTCGATTT CGTCACGCCG TTTTATCAGA CCGTGCTCGA TTTACAGCAA ATGCCGCTGC TCAGCCAGCA GGTCCGGCAG AATCTGTCGG CGCGCTTTTA TCCGTCGGGG CATATGGTTT ATCTCGACGG CGGATCGCGC ACCGCGCTGA AGGCCGATCT CGCGAAGATG TACGACACGA CGGTGTCCAA TACGCAGGCG CTGCTTCGTA TCCGCGCATT GCAGGCGCGT GTGGCGCAGT AG
|
Protein sequence | MTNTESASVS PQPSHNSHAS GAPAPAPASA PHKAKDQPFF DPVAYGNGPD DSVTETDESA AITHHSVTIG GHKIDYTATA GHLVIVDPSS SKAEARMFYV AFTQDNQKEE ARPVTFFYNG GPGSSSVFVL LGSFAPRRIK TSMPSFTPPA PYSMEDNPDS LLDKSDLVFI NPVGTGYSAA IAPKKNRDFW GVDQDADSIK QFIKRFLTKN NRWNSPKYLF GESYGTARSC VLAYRLHEDG VDLNGITLQS SILDYTQAGN PVGALPTAAA DAWYHKKLGI APRPTDLGTF AEEVAQFART DYLAALRKFP TTDAATVEKL SEYTGIDKTT LLAWSLDVAS YDSRGNSLFL TTLLKSKGLA LGEYDGRVTA IGTGIAGKID PNSGGNDPTM TAVTGVYTTM WNVYLNEQLK YTSNSSFTDL NDQAFKYWDF SHIDPTGAQK GVDSKGNIIL YTAGDLAAVM ALNPDLKVLS ANGFFDFVTP FYQTVLDLQQ MPLLSQQVRQ NLSARFYPSG HMVYLDGGSR TALKADLAKM YDTTVSNTQA LLRIRALQAR VAQ
|
| |