Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0995 |
Symbol | |
ID | 8413867 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | - |
Start bp | 1123190 |
End bp | 1125820 |
Gene Length | 2631 bp |
Protein Length | 876 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 645022584 |
Product | peptidase U32 |
Protein accession | YP_003180015 |
Protein GI | 257784798 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGAAC GTAATCGTCG TGAGCGAGAA TTTACAGCTC ACGAGTTAAA TCGCATGAAC GAGCTGGAGT GGACGGAGGA AGCCTCTGAC TTAGAGTTTT CATCTCAACC GCTTCCTCGT GCAACACAGA TGGAACTTCT CGCACCAGCG GGAGGACCTG CACCGTTTGC TGCTGCTTTG GCTGGCGGTG CGGACGCTAT TTATTGCGGT TTGGGTAACA ACTTTAACGC GCGCCGTGGC GCAGACAACT TTGATGACGA GTCTTTTGCA CGTGCTTGTA GGCAAGCTCA TCTGGCTGGC GCTCGCGTGT ACGTCACTGT TAATGTTGTT GTTAAGTGGG ATGAAATGCA GCGCGTACTG CGCCTGATTC GTCGTGCATG GATTCTTGGA GCAGATGCTT TTATCATCCA AGACTGGGGT CTTATGGCTC AGGTTAGAAA GACTTGGCCA GAGATTGAAT GTCACGTATC AACGCAGGCA AACATTCACG ATACGCGTGG TGTTTCTGCC TGTAAAAAGC TTGGTGTAGG ACGCGTTACC CTGTCCAGAG AGCTTACTAA AGAAGAGATT TCTACTATTT CCAAGCTTGG TGTTGAACTA GAGTGCTTTG GTCACGGCGC CTTGTGTTTC TGTTATTCGG GTATTTGCCA TATGTCATCC ATGCGTGGAG ACCGTTCTGC TAACCGCGGA GCTTGTGCTC AGCCGTGTCG TCTTCCGTAT GAGCTGCTCA ATTCTAAGCA CGAAGTTGTC TCGATGGGCG GCATTGATAG GCTGCTGTGT CCTAAGGATT ATTGCACTAT TGATGACGTG CCGGACATGA TTGAGGCGGG CGTGGGCTCG CTTAAGATTG AAGGTCGTAT GAAGGCGCCT GAGTACGTGT ACTCTGTTGT GTCTTCGTAT CGCCAGGCAA TTGATGCTGC CGAGAAAGGC GTTGATAACC AGGCCGATGT GGCACGTCGC CATCGTTTGC TTAAACGTTC GTTTAACCGC GGCTTGACTA ATGCATATCT GCACGGCACC GCCGGCAACA AGATGATGAG CTACGAGCGC TCCAATAACC GCGGCGAGGT TGTAGGTGAG GTTACTGGAG GACGTTCGCT TGAAGATGCT CTTGAGCGCA AGAGTGGCTT GAATGGTGGC CGCGTTAAGC TGCGTCGCTA CAAGCAGGCT GACGTTGACC TTATTGCACA TGCGCCTATT GGCAAGAATG ACCTGCTTGA GATTAGACCT TTAGATGAGC CTGACAAGTT CCTGACCGCT CTTTGTCCTA CAGATGTTGA ACCTGGTCAG AAGGTCACCG TCCGCACTTC TCGCGTTATG CAGACTGGTT CTACGGTGCG CATTATTCGC TCTGAGGCAG CACGTGTGGC AGCTGAGCAG ATTTCTTCAC TGGAATATCC TCGCAAACGA GCTGTTGATG TGACTATTAT TGCTCGCATC GGCCAGCCTT TTACCGTGGT GCTTACTACA ACAGATGGAG CGGCAAGTGC GTCTGCTGAA GGCTTTGTGG TTGAGGAAGC TCGCACTAAG GCGGTAACTT CTGACGAGCT TATTGAGCAC GTAGGACGTA TGGGGACCTC TCCATTTGAG GCAGTGAGTT TTGACGTACA GATGGATGAC GCGTGTGGCA TGAGTTTTAG TGCTGTTCAC AAGGTTCGCG CCGCAGCATG TGAGCAGCTT GAGGCTGCGC TTCTGGAGGA GTACCAGGAT CGTGAGTATA AGATTGCTCC GCTTTCACGC CTTGCCTATC AAAAGGAGCG AGAAGCTCAG GATCAAGAGA AACTCTTTGT TTTTGACAAG GCAGCTGCAA AAACAAATGC ATCGCAGGCA GAGATATGTG TTCTGGTTGA GACTCCTGAG CAGGCACGCG TTGCACTCAA GGCGGGAGCA GATCGTCTGT ACGCAACAAC AGATGTACTT ACAGACGCTT CGTGGCCAGA AGATCTTCTC GCCAAGATAG TGCCATGGCT TGACGAGGTG TGCCGCGAGA TTGACCACAA TCGTTTGGAT CCGTACGTCG TATCTGGTAA GCCGATTGCC GTAGGCAACA TCTCTGAGCT AGCGCTGGCC GTTGAGCGTG GAGCTGTCCC AGAGGTGCGT GAGTGTATTC CAATCCACAA CGATTACGCT TTGCAGGCTC TTGCTGACAT GGATGCAGAA GGCGTCTGGC TCAACTCAGA ACTTACCCTC CAAGAGATTT GTCACATGGC GAGAAACGCC TCGATTCCTG TGGGCTATAT GGTTAGCGGC CGTATTCGTA CCATGACAAC TGAGCATTGT ATTTTGATGT CTACGGGTAA GTGCATTCAC GATTGTGATG CCTGTCAGTT GCGCCTTGAG GAGCATACGC TTAGGGGTAT TGATAATGAT TACATGCCCG TAAGAACTGA CAGACACGGT CGCTCAAAGA TTTGGAGTCC TAAACTTTTT GATGGTGTGC CAGAAATCTC TGAGATGCTT TCAGCCGGCG TGAAGCGTTT TATGGTGGAC GCAACACTTT TGAGTGTGGA GCAGACAAGA GAGGCCACTT CTCGCGTAGC TGCAGCGATT GAGGCAACAG CGTCGAGTGC ATCATTGCCT CCACGTCTCA AAGATGCTTC AGTTGGACAT CTTTTCTCGC CAATTGGATA G
|
Protein sequence | MNERNRRERE FTAHELNRMN ELEWTEEASD LEFSSQPLPR ATQMELLAPA GGPAPFAAAL AGGADAIYCG LGNNFNARRG ADNFDDESFA RACRQAHLAG ARVYVTVNVV VKWDEMQRVL RLIRRAWILG ADAFIIQDWG LMAQVRKTWP EIECHVSTQA NIHDTRGVSA CKKLGVGRVT LSRELTKEEI STISKLGVEL ECFGHGALCF CYSGICHMSS MRGDRSANRG ACAQPCRLPY ELLNSKHEVV SMGGIDRLLC PKDYCTIDDV PDMIEAGVGS LKIEGRMKAP EYVYSVVSSY RQAIDAAEKG VDNQADVARR HRLLKRSFNR GLTNAYLHGT AGNKMMSYER SNNRGEVVGE VTGGRSLEDA LERKSGLNGG RVKLRRYKQA DVDLIAHAPI GKNDLLEIRP LDEPDKFLTA LCPTDVEPGQ KVTVRTSRVM QTGSTVRIIR SEAARVAAEQ ISSLEYPRKR AVDVTIIARI GQPFTVVLTT TDGAASASAE GFVVEEARTK AVTSDELIEH VGRMGTSPFE AVSFDVQMDD ACGMSFSAVH KVRAAACEQL EAALLEEYQD REYKIAPLSR LAYQKEREAQ DQEKLFVFDK AAAKTNASQA EICVLVETPE QARVALKAGA DRLYATTDVL TDASWPEDLL AKIVPWLDEV CREIDHNRLD PYVVSGKPIA VGNISELALA VERGAVPEVR ECIPIHNDYA LQALADMDAE GVWLNSELTL QEICHMARNA SIPVGYMVSG RIRTMTTEHC ILMSTGKCIH DCDACQLRLE EHTLRGIDND YMPVRTDRHG RSKIWSPKLF DGVPEISEML SAGVKRFMVD ATLLSVEQTR EATSRVAAAI EATASSASLP PRLKDASVGH LFSPIG
|
| |