Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0804 |
Symbol | |
ID | 8413669 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | - |
Start bp | 885786 |
End bp | 888173 |
Gene Length | 2388 bp |
Protein Length | 795 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 645022386 |
Product | ComEC/Rec2-related protein |
Protein accession | YP_003179824 |
Protein GI | 257784607 |
COG category | [R] General function prediction only |
COG ID | [COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) |
TIGRFAM ID | [TIGR00360] ComEC/Rec2-related protein [TIGR00361] DNA internalization-related competence protein ComEC/Rec2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.569588 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCTAAGA CGATGGAACA GCCCTTTCGT CCTTTTATTC CGTATAGCCT ATGTGCGCTG CTTTTTGTTT GTTTATGGCT TCAAGTTTTT ATTGCAAGAA GCGCAACGCT CGATACAGTT CTGATTCATA TTCTAGGAAT CTGTGCGGTG GTTGCTTTGG GATTTTGGTA TGTAAAGCAT GCGCATACCA GCATTATGAC TGTTGGGGTA GTTTTACTCT TACTTGTAAT TATCCTTATC AGGTCTTTCT TTATATATGA CAGTCAAATG GCGTCAGTGA ATCTTTTAGA GTCAACTTCC GTACATGATT TTGAACTAGT TGTATCAAGA GATTTAGTCT ACAAAAATCA CACATGGATT GGTCAAGCAA ACGTCATCTA CCAAGGTAGA TGCATTGGAT GTGTGGGACT ACATACCAAG GAACAGTTCT TACGAGAAAC ACATCTGTGC TGTGAAGGTA GGTTTACACG ATATAAAGAT AAAGATTTTT CAGAAGAACA ATTTAAGAGG GGAGTACTTG GTTCAATTCA GGTAACCAAG GTGAATTCAA AAACCTATGA AGAGGGAATT GTTGGGGAAG TTGCCCAATT TAGAGAAAAC TGTATTTCTC AACTTCAACC AGAGCTGAGC TTTGGGCGAG CGCTAGCTGC ATGCGGTCTT TGTGCCTATA GGCCCAGCCT GTATAGCTTT AACATTCCGA GAATATTTAT GCGCTGCGGC CTTATGCACC TCATTGCAAT TTCTAGTTGT CACATTGTAA TTCTCTCTGC CTATATTGAT GCTTTGTTTA AAAAACTTAC ATTAAAGCCT CTTTTGAGGG GAGTGTTAAA TTTATTTCTA TTAAGCAGTT ATGCACTTTT CTGTGGAGTG CCTGCGTCTG CATTGAGGGC AATACTCCTT GTTGAGCAAA AATACATTAT GCAGTGTGTT GGAAGGAAGA ATCATACGTT ATCTGCTGTT TCTATTGTTG CATTGCTTAT GCTTTGTGTT GATCCACAAC TGAGCGTAAA TATTGCTTTT ACATTGTCTT TAACTTGTAT TTTAGGAATG AATATATATG GCGGTCTTGC TCAATATTAC GCTAAAACTG CATTTCATTT ATCTGGATAT GGAGCAGTAC AAAAGGTGCT GAGAAAGCCA TTTTCTAGTG TACGAAACAC AATGTGCGCA ACAACAGTTG CTCAGTTTTC ATGCCTTCCA ATTTCATGTG TGTGCTTTGG CCATTTTTCA TTACTCGCAC CTCTTTCAAG TGCACTGATT ACAAGTCCAT TTTCACTTCT CAGTCTATTT GGAATTGGAG CAATAATTTT AAGCAGTATT AAACCAGCTC AAGATACTGT TCTCTTTTTT ATAGATTTTC TTGGGAAGCT TATAGAGACT CTTGCATCTT TCTTATCAGA AAGACCTTTT GCAAGTGTGT TTGTAGCCGA CATTGGTTGG ATAGTCTCAT TAGCTTTTAT TGCAGTAATG GTGGCGCTCT ATGTGTTATG GCCAAAGGGA AGAAGAATAG TTTTAGTTGG CATATGTTTG GCAGTATCTA TGCTTATTGT CGGATTAAGT ATCTATTGGA GGTTTTTCTC GCCAACAAGA ATTTGTGTGA TGGATATAGG TCAGGGAGAC GCCATATTGC TGAGCGATGG CGTGCATTCT CTGCTTGTGG ATACTAGTGC AGGAGATGTG GTTAACGATG CTCTGGAGCG ACAGCACGTC TCATATCTTG ATGCAATTTT GCTTACACAT CTTGATGAAG ACCATGCTGG CGGCGTGAGA TATATGGTTG GCTCAGTAAA GGCAGGCCGT GTATTAGTTG GAGAAGGAAT AACAAAACAA GAAAAGCCTG AGTTACAGAG AGCTATTCAG AGGATTTCTG GTAGTGGTTC TTATGAAGTT TTATATGGCG ACGAATTTGA TGTTGGTCGC TTTCATGTTC GTGTGGTGTG GCCACATAGG GGCTATAAAG CAAAAGAAGC CAATAATGCT TCAGTCGAGC TGTATGTGAC ATATAACGAT GGACAAAATA CGTTAACTAC GCTTTTAACG GGAGATGCTG AAAGAGATCA AACTAAAGAG ACGGTGACAT CGGGTGATGT GGGAGATATT GATTTCTTAA AAGTTGGACA TCATGGTGCA GCAAAGTCAC TCTATCCAGC AACTGCTCAA GTACTAAAAC CTGAGGTAGC GGTGGCAAGT GCTGGAAAGA ATAATCATTA TGGGCATCCT AAACAGGAAG CGATAGATAT TTTAGAGGGT GTTGGTGCAA GGTTTTACTG CACAAAAGAT TATGGAGACG TCACAGTATT TCCTGGAGAA CATGGACCTA AGGTGAGCGT GCAACATGCT AAAGTAGATA CAGATTTAGA GGAGGAAAAA GATGGCGGAG CAGGCTAA
|
Protein sequence | MSKTMEQPFR PFIPYSLCAL LFVCLWLQVF IARSATLDTV LIHILGICAV VALGFWYVKH AHTSIMTVGV VLLLLVIILI RSFFIYDSQM ASVNLLESTS VHDFELVVSR DLVYKNHTWI GQANVIYQGR CIGCVGLHTK EQFLRETHLC CEGRFTRYKD KDFSEEQFKR GVLGSIQVTK VNSKTYEEGI VGEVAQFREN CISQLQPELS FGRALAACGL CAYRPSLYSF NIPRIFMRCG LMHLIAISSC HIVILSAYID ALFKKLTLKP LLRGVLNLFL LSSYALFCGV PASALRAILL VEQKYIMQCV GRKNHTLSAV SIVALLMLCV DPQLSVNIAF TLSLTCILGM NIYGGLAQYY AKTAFHLSGY GAVQKVLRKP FSSVRNTMCA TTVAQFSCLP ISCVCFGHFS LLAPLSSALI TSPFSLLSLF GIGAIILSSI KPAQDTVLFF IDFLGKLIET LASFLSERPF ASVFVADIGW IVSLAFIAVM VALYVLWPKG RRIVLVGICL AVSMLIVGLS IYWRFFSPTR ICVMDIGQGD AILLSDGVHS LLVDTSAGDV VNDALERQHV SYLDAILLTH LDEDHAGGVR YMVGSVKAGR VLVGEGITKQ EKPELQRAIQ RISGSGSYEV LYGDEFDVGR FHVRVVWPHR GYKAKEANNA SVELYVTYND GQNTLTTLLT GDAERDQTKE TVTSGDVGDI DFLKVGHHGA AKSLYPATAQ VLKPEVAVAS AGKNNHYGHP KQEAIDILEG VGARFYCTKD YGDVTVFPGE HGPKVSVQHA KVDTDLEEEK DGGAG
|
| |