Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Anae109_2046 |
Symbol | |
ID | 5378130 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaeromyxobacter sp. Fw109-5 |
Kingdom | Bacteria |
Replicon accession | NC_009675 |
Strand | + |
Start bp | 2316092 |
End bp | 2317669 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640843558 |
Product | protease Do |
Protein accession | YP_001379233 |
Protein GI | 153004908 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.858833 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.0343309 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCGCA TCATCCACCG AAGCATCGCC ACCACGTTCG TCGCGGCCCT GATCGCGTGC TCGCAGGACG GCCGCGCGGC CACCGAGCCC CAGCGTCCCG GCGTGAACGG CGCCCCGCTG TTCCAGGACG CCTCCCAGCA GGCGAGCCCG CAGCAGCCTG CGGAGGCCGC GGTTCCCCCT CAGGGCTCGC TCGCGCCGCT CATCGAGAAG GTCAAAGGCG CCGTGGTGAA CATCTCCACG ACGACGGTCA TCAAGCACCC GCAGACCCGC GGGATGCCCA ACCCCCACGG CCAGGCGCCG GGCGGTCGCG GCGGCCCCGG CGACGAGGAG TTCCAGGACT TCTTCGAGCG GTTCTTCGGC GGCCGCCCCG CGCCGCGGAT GCCGGAGGAG TTCCGCGGCT CCTCGCTCGG CTCGGGCTTC GTCATCAGCC CGGACGGCTT CATCCTCACG AACAACCACG TGGTGCAGGA CGCGACGGAC ATCCTCGTCC GGCTCACCGA CGGCCGCGAG CTCAAGGCCG AGACGGTCGG CCGCGACCCG GCGACGGACG TCGCCCTCAT CAGGCTCGTG AACCCGCCGA AGGACCTCCC GAACGTCGTG CTGGGCGACT CCGATGCGCT CCGGCAGGGC GACTTCGTGC TCGCGCTGGG CAGCCCGTTC GGCCTGCGCG ACACGGCCAC GCTCGGCATC GTCTCCGCGA AGCACCGCCG CGAGGTGAAC CCCACCGGCA CCTACGACGA CTTCATCCAG ACGGACGCGG CCATCAACTC CGGCAACTCG GGCGGCCCGC TGTTCAACCT GCGCGGCGAG GTCATCGGCA TCAACACCGC CATCGTCTCG CCACAGCTCG GCTCCGGCGT CGGCTTCGCC GTGCCCATCA ACCTCGCGAA GTCGATCCTC CCGCAGCTGC GCGAGAAGGG GAAGGTGACG CGCGGCTACG TGGGCGTCTC GATCACCGAT CTCAACCGCG ACCTCGCCCA GGGCTTCGGC CTCCCTCCGG ATCAGAAGGG CGCCCTCATC CAGGCCGTCG TCCCCCGCGG CCCCGCCGCG AAGGCGGGCG TGCAGCCCGG CGATGTCGTC GTCGCGGTGA ACGGCAAGCC CGTGACCTCC GGCGGCGATC TCACGCGCGC GGTCGCGCTG GTCCAGCCCG GCAGCAAGGT GGATCTCACG GTCGTGCGCA GCGGCCAGAA GAAGCAGTTC AGCTTCGCGG TCGCGCAGCG GCCCGATGAC GAGGAGGCGA TCGCCCGCGG GCAGGGCGGC GAGCAGGAGG AGGGCGGCGA CAAGGCCCCG AAGCTCGGTG TGACCCTCGG TGATCTCACC CCGCAGATCG CGCGGCAGCT CGGGATCGAG CCGGGGGAGG GCGTGCTCGT GCGCGACGTC GCCCCCGCCG GCCCGGCCGG GCGCGCCGGC ATCGAGCCGG GCATGGTCAT CGTCGAGCTG AACCGGAAGC CGGTGAAGAC GGTGCAGGAC GTCGCGCAGG CCATCGCGAA GATGAAGGAC GGCGAGGTCG CGCTCCTGCG CGTTCGCCGC GGTCAGGATC TCTTCTACGT GGCGGTCCCG GTGGGCGGGC GCCAGTAG
|
Protein sequence | MRRIIHRSIA TTFVAALIAC SQDGRAATEP QRPGVNGAPL FQDASQQASP QQPAEAAVPP QGSLAPLIEK VKGAVVNIST TTVIKHPQTR GMPNPHGQAP GGRGGPGDEE FQDFFERFFG GRPAPRMPEE FRGSSLGSGF VISPDGFILT NNHVVQDATD ILVRLTDGRE LKAETVGRDP ATDVALIRLV NPPKDLPNVV LGDSDALRQG DFVLALGSPF GLRDTATLGI VSAKHRREVN PTGTYDDFIQ TDAAINSGNS GGPLFNLRGE VIGINTAIVS PQLGSGVGFA VPINLAKSIL PQLREKGKVT RGYVGVSITD LNRDLAQGFG LPPDQKGALI QAVVPRGPAA KAGVQPGDVV VAVNGKPVTS GGDLTRAVAL VQPGSKVDLT VVRSGQKKQF SFAVAQRPDD EEAIARGQGG EQEEGGDKAP KLGVTLGDLT PQIARQLGIE PGEGVLVRDV APAGPAGRAG IEPGMVIVEL NRKPVKTVQD VAQAIAKMKD GEVALLRVRR GQDLFYVAVP VGGRQ
|
| |