Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | AnaeK_2069 |
Symbol | |
ID | 6786451 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaeromyxobacter sp. K |
Kingdom | Bacteria |
Replicon accession | NC_011145 |
Strand | + |
Start bp | 2325720 |
End bp | 2327291 |
Gene Length | 1572 bp |
Protein Length | 523 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 642763529 |
Product | protease Do |
Protein accession | YP_002134426 |
Protein GI | 197122475 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.719287 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCCTCA TCACCCGAAT CGTCACCGTC TCTCTCGCAG CCGCAGCGAT CTTCGCGTGC ACGCGGGACG GCAGCGCGGC GACCGCATCC GCCGCGCCCG CGGCGGCGCC GGCCCAGCTG TTCCGCGACG CCGCGGCGGC CGCGCCCGGC CCCGAGGCCG CCATCCCGGT GCAGACCTCG CTCGCGCCGC TCATCGACAA GCTCCGCCCG GCGGTGGTGA ACATCTCCAC CACCACCGTC ACCAAGCACC CGCGCGTCCA GCGCGGCCCA CGCGGCCAGA ACCCGCACGG CGGCGGCACG CCGGACGAGG GCTTCGAGGA CTTCTTCGAG CGCTACTTCG GCCGCCCCGC GCCGGAGATG CCCGAGGAGT TCAAGGGCTC GTCGCTCGGC TCCGGGTTCC TGCTCAACAC CGAGGGCTAC ATCCTCACCA ACAACCACGT GGTGAAGGAC GCCACCGACA TCCGCGTGCG CCTCTCGGAC GACCGCGAGT TCGGCGCCAG GATCGTCGGC CGCGATCCGC TCACCGACGT GGCGCTCATC CAGCTCGTGA ACCCTCCGAA GAACCTGCCG ACGGTGGTGC TCGGCGACTC CGACGCGCTC CGCCAGGGCG ACTTCGTGCT CGCGCTGGGC AGCCCGTTCG GCCTGCGCGA CACGGCCACG CTCGGCATCG TGTCGGCGAA GCACCGCCCC GGCATCAACC CCGGCGGCAC CTACGACGAC TTCATCCAGA CCGACGCCGC CATCAACCCC GGCAACTCGG GCGGCCCGCT GTTCAACCTC CGCGGCGAGG TGGTCGGCAT CAACACCGCC ATCGTGTCGC CGCAGATCGG CCAGGGCATC GGCTTCGCGG TGCCCATCAA CATGGCGAAG GCGCTGCTGC CGCAGCTCAA GGAGAAGGGC AAGGTCACGC GCGGCTTCCT GGGCGTGTCG GTGTCCGACC TCTCGCCGGA TCTCATCCAG GGCTTCGGCC TGCAGTCCGG CACCAAGGGC GCGCTGGTCC AGAACGTGGT CCCGCGCTCG CCGGCCGACA AGGCGGGGCT GCAGCCCGGC GACGTGGTCG TCGCGCTGAA CGACAAGACG GTCGAGACCG CCGGCGCGCT CACCCGCGGC GTCGCGCTGG TCGCGCCGGG CCAGACCGCG AACCTGACCG TGCTGCGCGG CGGCCAGAAG AAGCAGTTCG CGGTGAAGGT CGTGCAGCGG CCCGAGGACG GGGAGGCCGT CGGCCGCAAC GAGCAGGGCG GCGGCGACGA AGGCGGCGGG CAGGGCGCCC GCGATCAGTC GCCGAAGCTC GGCGTCTCGA TCGCGCCCAT CACCCCGGAC GTCGCGCGCC AGTTCGGCGT CGAGCCGGGC GAGGGCGTGG TGGTGGTGGA CGTCACCGAA GGTGGCCCGG CCGATCGCGC CGGCATCCGC CGCGGCGACG TCATCCTCGA GGCGAACCGC CAGAAGGTGG CGCGGCCGGA GGACATGCGG TCGGCGGTGG CGAAGCTGAA GGAGGGCGAC ATGGCGCTTC TGCGCGTTCG CCGCGGCGAC GCCGCCGTGT TCATCGCGGT GCCGGTGGGC GGCGGCAAGT AG
|
Protein sequence | MRLITRIVTV SLAAAAIFAC TRDGSAATAS AAPAAAPAQL FRDAAAAAPG PEAAIPVQTS LAPLIDKLRP AVVNISTTTV TKHPRVQRGP RGQNPHGGGT PDEGFEDFFE RYFGRPAPEM PEEFKGSSLG SGFLLNTEGY ILTNNHVVKD ATDIRVRLSD DREFGARIVG RDPLTDVALI QLVNPPKNLP TVVLGDSDAL RQGDFVLALG SPFGLRDTAT LGIVSAKHRP GINPGGTYDD FIQTDAAINP GNSGGPLFNL RGEVVGINTA IVSPQIGQGI GFAVPINMAK ALLPQLKEKG KVTRGFLGVS VSDLSPDLIQ GFGLQSGTKG ALVQNVVPRS PADKAGLQPG DVVVALNDKT VETAGALTRG VALVAPGQTA NLTVLRGGQK KQFAVKVVQR PEDGEAVGRN EQGGGDEGGG QGARDQSPKL GVSIAPITPD VARQFGVEPG EGVVVVDVTE GGPADRAGIR RGDVILEANR QKVARPEDMR SAVAKLKEGD MALLRVRRGD AAVFIAVPVG GGK
|
| |