Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0199 |
Symbol | |
ID | 8413047 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | - |
Start bp | 238156 |
End bp | 239676 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 645021768 |
Product | N-6 DNA methylase |
Protein accession | YP_003179223 |
Protein GI | 257784006 |
COG category | [V] Defense mechanisms |
COG ID | [COG0286] Type I restriction-modification system methyltransferase subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.471881 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000223426 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGATTACCG GAGCAACCAA GAGCAAGGTA GACGACGTTT GGCAGAGGAT GTGGGAAGGC GGCATCACCA ACCCGCAGGA AGTTATCACC CAGCTTACCT ACCTTATGTT CATCCGTTCG CTCGACGATA AGGAACTAGA ATCAGAGCGC ATGGAAGAGC TTGGCATTCC GCAGGAGTAC CTTTTCCCTC AGACGTCAGA GGGGCAGGAG ATGCGCTGGT GCTCCATCAA GAACATGGCG CCGGAAAAAA TGCTCGAAGC CATACGGGAC AAGGTTTTTC CCTTCATCAA AACCTTGCAT GATGACACGC CCTTTGCACG CAGCATGCGT GACGCCACCT TCGGCATCAA CAACCCACGC ACCCTGCAGA AAGCAGTCTC GGGCATCGAC TCTCTCATGA ACGACTTCGA GAACGACATG GATGATCTCG GAGACCTCTA CGAATACATG CTCTCCAAGC TTTCCACCGC TGGCACCAAT GGTCAGTTCC GCACGCCCAA ACACATCCGC GACATGATGG TTGCTATGGT CGATCCCCGT CCCGGTGAGC GCATCTGCGA CCCCGCCATG GGTACGGCGG GCTTTCTCAT CAGTGCCGCC GACCATCTTC GTAATGACTC AGCTATGAAA GACGATGACT GGACAGTCTT TGCCGGTGAG GCGGCGGAGA AGGACGCGGA TGGCAATGTT GTTGCGGAGG GTCGTCATCA GTTCTCTGGA GGCGAGACCG ACCAGACCAT GTTCCGCATC AGCGCTATGA ACCTGATGCT GCACGGCATT AGCCAGCCAG ACATCAAGCT AGTTGATTCG GTAAGTAAGC AAAACACCAC CAGCGACAAG TACGACCTCG TGCTTGCCAA CCCGCCTTTT ACAGGTAGCG TCGACACAGA AGACATCGCA CCAAGCCTCA AGGCGATTTG TAACAGCAAG CAGACCGAAC TGCTCTTTGT GGCGCTCTTT TTGCGTATGC TCAAAGTGGG CGGCCGCTGC GCATGCATCG TTCCCAACGG CGTGCTCTTC CGCACCAATT CCAAGGCGTA TCGTCAGCTG CGCCAAGAGC TCGTTGACAA CCAGCAGCTG CGCGCCATCA TCTACATGCC AAGCGGCGTA TTCAAACCCT ATTCCGGTGT AAGTACCGCT GTACTTGTCT TCACGAAAAC CAATGCCGGT GGCACTGACA AGGTGTGGCT CTACAACATG GAGGGCGACG GCTACACACT CGATGACAAG CGAGATATCG ATGATGCCCA CAACGACGTG CCAGACATTC TGGAGCGTTG GGCTCATTTG GAAAGTGAAG AGAAACGCGA CCGCAAGCAG AAGAGCTTCC TCGTCTCCAA GCAGGACATC ATCGACAACG ACTACGACTT CAGCTTCAAC AAGTACGTTG AGACTGAGTA CGAGCGCATT GAATATCCTC CTACGGAACA GATTGTGGCA GAGCTCGACG AGCTGAATAA GGAAATGGCG ACAGGTCTTG CAGAGCTCAA GAAGGTGCTT GGCGGTGGGC TAGATGACTA G
|
Protein sequence | MITGATKSKV DDVWQRMWEG GITNPQEVIT QLTYLMFIRS LDDKELESER MEELGIPQEY LFPQTSEGQE MRWCSIKNMA PEKMLEAIRD KVFPFIKTLH DDTPFARSMR DATFGINNPR TLQKAVSGID SLMNDFENDM DDLGDLYEYM LSKLSTAGTN GQFRTPKHIR DMMVAMVDPR PGERICDPAM GTAGFLISAA DHLRNDSAMK DDDWTVFAGE AAEKDADGNV VAEGRHQFSG GETDQTMFRI SAMNLMLHGI SQPDIKLVDS VSKQNTTSDK YDLVLANPPF TGSVDTEDIA PSLKAICNSK QTELLFVALF LRMLKVGGRC ACIVPNGVLF RTNSKAYRQL RQELVDNQQL RAIIYMPSGV FKPYSGVSTA VLVFTKTNAG GTDKVWLYNM EGDGYTLDDK RDIDDAHNDV PDILERWAHL ESEEKRDRKQ KSFLVSKQDI IDNDYDFSFN KYVETEYERI EYPPTEQIVA ELDELNKEMA TGLAELKKVL GGGLDD
|
| |