Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_1224 |
Symbol | |
ID | 8414103 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | + |
Start bp | 1372648 |
End bp | 1375038 |
Gene Length | 2391 bp |
Protein Length | 796 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 645022818 |
Product | N-6 DNA methylase |
Protein accession | YP_003180242 |
Protein GI | 257785025 |
COG category | [V] Defense mechanisms |
COG ID | [COG0286] Type I restriction-modification system methyltransferase subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.708606 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAAAC TCAAATATCA AGAAAAGGAC GGCAAGGTCT ACTGCCCGCT CAAGGACAAG TGGCTGATTG CCACTCCGGA GGAAAAGGTC AGGCAGCGAT ACGTCTGCAC GCTCGTTAAC GACTTCGGCT ACCAGCTCGA GCAAATGGCG CAGGAACTCA AGGTCACCAA CTCCAAACGA GGCCAAGGCA AGGCGCGTGC GGACATCGTC ATCTGGAAGA GTACAGACGA AAAAGACGAG AGCAAGTCAG CCTTCATCGT AGTCGAATGC AAGGCGGAAA ACGTCAAGAT CCATGTCGAG GACTATTATC AAGGATTTAA CTACGCGTCT TGGGCGCACG CCCAGTTCTT CGTCACGACG AACGAGAAGG AAACAAAATA CTTCAACGTC GACCCAACCT ATCTTCCCCA GAAGCTGGAG GAGGTCGTCG GCATCCCGAC AGCGAAGGAC GTCGATAATG CGAAGAAGAT AGAGCAGATC AAGAACCGCA CGAAGACCTT CACCCGCGAG GAGTTCACGC GGACGCTTCA GGCATGCCAC AACATCATCC GAAACAACGA CAAGCTTTCC CCGGAGGCTG CGTTCGACGA GATCAGCAAG CTGCTATTCA TGAAGATACG CTACGAGCGC CAGCAACGGG GCACTAAGGT GTTCACGAGA AAACAGTACG AGGCGGAGGA GAAGAACTAC GAGGAAAATA TCCGTCCCGG CCTCAAAGGC ACAGTTCTCT ACTCACAGTC GTACATGCAG CGCCTTTTCA GCACCACGAA GGAGGAATTC AAAGACGACC ACCTCTTCGA GGACAGCGAC GAGATCAAGA TCAGGAACAA CAGCTTCATC CAGATCCTCG GAAAGCTTGA AAACTTCAAC CTATCCGATA CGCAAGACGA TGTGAAGGGC ATCGCCTTCG AGCAGTTCCT TGGCACGACA TTCAGAGGCG AGTTGGGTCA GTTCTTCACA CCGCGCACCA TCGTCGATTT CATGACGGAG ATCATCGATC CTCAGGAGGG CGAGATCATC TGCGATCCGA CATGCGGCTC CGGCGGCTTC CTCATCAAGG CTTTCGAGTA CGTCCGCGAG AAGATCGAGG CGGACATCCG CGAGCAGAAG GAGAAGCTGC GCTCGGAGTT TGAAAGCGAC GATTTCGAGA GCAAACCCGA AGACGAGCAG ATCAGAGTCA CCGTCCTTAT CGACAAGATG CAAGCCGTGC TCAACGCAGA GCTTGATACT AGTGCAACCA ATAGTCGCAT GCAGCAGCTT TCCCGCAACT GCATCTACGG CACGGACGCC AACCCGCGCA TGGCGCGAAC GTCCAAGATG AACATGATCA TGCACGGAGA CGGACACGGC GGCGTACACC ACCACGACGG CCTTCTGAAT GTGAATGGCA TCTTCGAGGA GCGTTTCGAC GTGATTCTCA CCAACCCGCC ATTCGGCCAG AACGTCGACC GCAGTCAGAC CATCACGGAC GCCGACCGCT TCACCGACGA GGAGATGAAG AAGAAGTACC GCAACAAGTA CGGCGAGGCA TATGACGAGG CGCTCAAGCA GGTTGACGAC CATATCGGAA AGCCCTTGCT CTCGCTCTAC GATCTCGGCT CCACGAGCAC CCTCACCGAA GTGCTCTTCA TGGAGCGCTG CCTGCGCCTT CTCAAGAAGG GCGGACGCAT GGGCATGGTT CTGCCCGAAG GCGTCCTCAA CAACAAGAAC CTTGCAGCGG TGCGCGAGTA CTTCGAGGGA AAGGCAAAGC TAATCCTCAT CTGCTCCATC CCGCAGGACG TGTTCATTGC GGCAGGTGCG ACAGTAAAGC CGAGCCTCGT TTTCATGCGG AAGTTCACCG CCGAAGAAGA AGCAAAGTAC GCCAAGTGCA AACAGGCCGC AGCAGACGAG GTGGCCGCAC TGCATAAGGA TGAGGTCGAC GAGCTTGAAA AGGCCATAGC CTACTGCACC GCCGTCACCG AAACGCTCAA GGACGATCTC AAAGATGCCC GCAGCAGGCT GAAACAGGCC AAGAAGGACA AGGCGAAAAC CTCAAGTATC AATGCGGAAA TCAATGCCAT CCAGCAAGAG CAGACCGATA ACAAAACAAA GAAGAAGGAG GTGGAGAAGA CGCTCAAGGA TCTGCAAAAA CGGATGATCG AAGAGGTAAA GCCGCTCATC AAGAAGAACT TCGACTACGA CATCCCCATC GCAAAGATTG ACGATGCCGG AATCACAACC ACGGGCGCGG CATCCGAGGG AAACCAACTG CCCGCTCTTG TCGAGGAATA CAAGGCGTAC CGCAAGGAGC ATGCCCTGTG GGAAACGGAC AATCGGGCAT CCCGGTATAT TCCCATAGAT GAGGATAGCT TCCAACGAGT CTTTTTCGAG GAAGGCGAGG AGGTGCACTA A
|
Protein sequence | MAKLKYQEKD GKVYCPLKDK WLIATPEEKV RQRYVCTLVN DFGYQLEQMA QELKVTNSKR GQGKARADIV IWKSTDEKDE SKSAFIVVEC KAENVKIHVE DYYQGFNYAS WAHAQFFVTT NEKETKYFNV DPTYLPQKLE EVVGIPTAKD VDNAKKIEQI KNRTKTFTRE EFTRTLQACH NIIRNNDKLS PEAAFDEISK LLFMKIRYER QQRGTKVFTR KQYEAEEKNY EENIRPGLKG TVLYSQSYMQ RLFSTTKEEF KDDHLFEDSD EIKIRNNSFI QILGKLENFN LSDTQDDVKG IAFEQFLGTT FRGELGQFFT PRTIVDFMTE IIDPQEGEII CDPTCGSGGF LIKAFEYVRE KIEADIREQK EKLRSEFESD DFESKPEDEQ IRVTVLIDKM QAVLNAELDT SATNSRMQQL SRNCIYGTDA NPRMARTSKM NMIMHGDGHG GVHHHDGLLN VNGIFEERFD VILTNPPFGQ NVDRSQTITD ADRFTDEEMK KKYRNKYGEA YDEALKQVDD HIGKPLLSLY DLGSTSTLTE VLFMERCLRL LKKGGRMGMV LPEGVLNNKN LAAVREYFEG KAKLILICSI PQDVFIAAGA TVKPSLVFMR KFTAEEEAKY AKCKQAAADE VAALHKDEVD ELEKAIAYCT AVTETLKDDL KDARSRLKQA KKDKAKTSSI NAEINAIQQE QTDNKTKKKE VEKTLKDLQK RMIEEVKPLI KKNFDYDIPI AKIDDAGITT TGAASEGNQL PALVEEYKAY RKEHALWETD NRASRYIPID EDSFQRVFFE EGEEVH
|
| |