Gene Apar_1224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1224 
Symbol 
ID8414103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1372648 
End bp1375038 
Gene Length2391 bp 
Protein Length796 aa 
Translation table11 
GC content55% 
IMG OID645022818 
ProductN-6 DNA methylase 
Protein accessionYP_003180242 
Protein GI257785025 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.708606 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAAC TCAAATATCA AGAAAAGGAC GGCAAGGTCT ACTGCCCGCT CAAGGACAAG 
TGGCTGATTG CCACTCCGGA GGAAAAGGTC AGGCAGCGAT ACGTCTGCAC GCTCGTTAAC
GACTTCGGCT ACCAGCTCGA GCAAATGGCG CAGGAACTCA AGGTCACCAA CTCCAAACGA
GGCCAAGGCA AGGCGCGTGC GGACATCGTC ATCTGGAAGA GTACAGACGA AAAAGACGAG
AGCAAGTCAG CCTTCATCGT AGTCGAATGC AAGGCGGAAA ACGTCAAGAT CCATGTCGAG
GACTATTATC AAGGATTTAA CTACGCGTCT TGGGCGCACG CCCAGTTCTT CGTCACGACG
AACGAGAAGG AAACAAAATA CTTCAACGTC GACCCAACCT ATCTTCCCCA GAAGCTGGAG
GAGGTCGTCG GCATCCCGAC AGCGAAGGAC GTCGATAATG CGAAGAAGAT AGAGCAGATC
AAGAACCGCA CGAAGACCTT CACCCGCGAG GAGTTCACGC GGACGCTTCA GGCATGCCAC
AACATCATCC GAAACAACGA CAAGCTTTCC CCGGAGGCTG CGTTCGACGA GATCAGCAAG
CTGCTATTCA TGAAGATACG CTACGAGCGC CAGCAACGGG GCACTAAGGT GTTCACGAGA
AAACAGTACG AGGCGGAGGA GAAGAACTAC GAGGAAAATA TCCGTCCCGG CCTCAAAGGC
ACAGTTCTCT ACTCACAGTC GTACATGCAG CGCCTTTTCA GCACCACGAA GGAGGAATTC
AAAGACGACC ACCTCTTCGA GGACAGCGAC GAGATCAAGA TCAGGAACAA CAGCTTCATC
CAGATCCTCG GAAAGCTTGA AAACTTCAAC CTATCCGATA CGCAAGACGA TGTGAAGGGC
ATCGCCTTCG AGCAGTTCCT TGGCACGACA TTCAGAGGCG AGTTGGGTCA GTTCTTCACA
CCGCGCACCA TCGTCGATTT CATGACGGAG ATCATCGATC CTCAGGAGGG CGAGATCATC
TGCGATCCGA CATGCGGCTC CGGCGGCTTC CTCATCAAGG CTTTCGAGTA CGTCCGCGAG
AAGATCGAGG CGGACATCCG CGAGCAGAAG GAGAAGCTGC GCTCGGAGTT TGAAAGCGAC
GATTTCGAGA GCAAACCCGA AGACGAGCAG ATCAGAGTCA CCGTCCTTAT CGACAAGATG
CAAGCCGTGC TCAACGCAGA GCTTGATACT AGTGCAACCA ATAGTCGCAT GCAGCAGCTT
TCCCGCAACT GCATCTACGG CACGGACGCC AACCCGCGCA TGGCGCGAAC GTCCAAGATG
AACATGATCA TGCACGGAGA CGGACACGGC GGCGTACACC ACCACGACGG CCTTCTGAAT
GTGAATGGCA TCTTCGAGGA GCGTTTCGAC GTGATTCTCA CCAACCCGCC ATTCGGCCAG
AACGTCGACC GCAGTCAGAC CATCACGGAC GCCGACCGCT TCACCGACGA GGAGATGAAG
AAGAAGTACC GCAACAAGTA CGGCGAGGCA TATGACGAGG CGCTCAAGCA GGTTGACGAC
CATATCGGAA AGCCCTTGCT CTCGCTCTAC GATCTCGGCT CCACGAGCAC CCTCACCGAA
GTGCTCTTCA TGGAGCGCTG CCTGCGCCTT CTCAAGAAGG GCGGACGCAT GGGCATGGTT
CTGCCCGAAG GCGTCCTCAA CAACAAGAAC CTTGCAGCGG TGCGCGAGTA CTTCGAGGGA
AAGGCAAAGC TAATCCTCAT CTGCTCCATC CCGCAGGACG TGTTCATTGC GGCAGGTGCG
ACAGTAAAGC CGAGCCTCGT TTTCATGCGG AAGTTCACCG CCGAAGAAGA AGCAAAGTAC
GCCAAGTGCA AACAGGCCGC AGCAGACGAG GTGGCCGCAC TGCATAAGGA TGAGGTCGAC
GAGCTTGAAA AGGCCATAGC CTACTGCACC GCCGTCACCG AAACGCTCAA GGACGATCTC
AAAGATGCCC GCAGCAGGCT GAAACAGGCC AAGAAGGACA AGGCGAAAAC CTCAAGTATC
AATGCGGAAA TCAATGCCAT CCAGCAAGAG CAGACCGATA ACAAAACAAA GAAGAAGGAG
GTGGAGAAGA CGCTCAAGGA TCTGCAAAAA CGGATGATCG AAGAGGTAAA GCCGCTCATC
AAGAAGAACT TCGACTACGA CATCCCCATC GCAAAGATTG ACGATGCCGG AATCACAACC
ACGGGCGCGG CATCCGAGGG AAACCAACTG CCCGCTCTTG TCGAGGAATA CAAGGCGTAC
CGCAAGGAGC ATGCCCTGTG GGAAACGGAC AATCGGGCAT CCCGGTATAT TCCCATAGAT
GAGGATAGCT TCCAACGAGT CTTTTTCGAG GAAGGCGAGG AGGTGCACTA A
 
Protein sequence
MAKLKYQEKD GKVYCPLKDK WLIATPEEKV RQRYVCTLVN DFGYQLEQMA QELKVTNSKR 
GQGKARADIV IWKSTDEKDE SKSAFIVVEC KAENVKIHVE DYYQGFNYAS WAHAQFFVTT
NEKETKYFNV DPTYLPQKLE EVVGIPTAKD VDNAKKIEQI KNRTKTFTRE EFTRTLQACH
NIIRNNDKLS PEAAFDEISK LLFMKIRYER QQRGTKVFTR KQYEAEEKNY EENIRPGLKG
TVLYSQSYMQ RLFSTTKEEF KDDHLFEDSD EIKIRNNSFI QILGKLENFN LSDTQDDVKG
IAFEQFLGTT FRGELGQFFT PRTIVDFMTE IIDPQEGEII CDPTCGSGGF LIKAFEYVRE
KIEADIREQK EKLRSEFESD DFESKPEDEQ IRVTVLIDKM QAVLNAELDT SATNSRMQQL
SRNCIYGTDA NPRMARTSKM NMIMHGDGHG GVHHHDGLLN VNGIFEERFD VILTNPPFGQ
NVDRSQTITD ADRFTDEEMK KKYRNKYGEA YDEALKQVDD HIGKPLLSLY DLGSTSTLTE
VLFMERCLRL LKKGGRMGMV LPEGVLNNKN LAAVREYFEG KAKLILICSI PQDVFIAAGA
TVKPSLVFMR KFTAEEEAKY AKCKQAAADE VAALHKDEVD ELEKAIAYCT AVTETLKDDL
KDARSRLKQA KKDKAKTSSI NAEINAIQQE QTDNKTKKKE VEKTLKDLQK RMIEEVKPLI
KKNFDYDIPI AKIDDAGITT TGAASEGNQL PALVEEYKAY RKEHALWETD NRASRYIPID
EDSFQRVFFE EGEEVH