Gene Apar_0199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0199 
Symbol 
ID8413047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp238156 
End bp239676 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content54% 
IMG OID645021768 
ProductN-6 DNA methylase 
Protein accessionYP_003179223 
Protein GI257784006 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.471881 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000223426 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGATTACCG GAGCAACCAA GAGCAAGGTA GACGACGTTT GGCAGAGGAT GTGGGAAGGC 
GGCATCACCA ACCCGCAGGA AGTTATCACC CAGCTTACCT ACCTTATGTT CATCCGTTCG
CTCGACGATA AGGAACTAGA ATCAGAGCGC ATGGAAGAGC TTGGCATTCC GCAGGAGTAC
CTTTTCCCTC AGACGTCAGA GGGGCAGGAG ATGCGCTGGT GCTCCATCAA GAACATGGCG
CCGGAAAAAA TGCTCGAAGC CATACGGGAC AAGGTTTTTC CCTTCATCAA AACCTTGCAT
GATGACACGC CCTTTGCACG CAGCATGCGT GACGCCACCT TCGGCATCAA CAACCCACGC
ACCCTGCAGA AAGCAGTCTC GGGCATCGAC TCTCTCATGA ACGACTTCGA GAACGACATG
GATGATCTCG GAGACCTCTA CGAATACATG CTCTCCAAGC TTTCCACCGC TGGCACCAAT
GGTCAGTTCC GCACGCCCAA ACACATCCGC GACATGATGG TTGCTATGGT CGATCCCCGT
CCCGGTGAGC GCATCTGCGA CCCCGCCATG GGTACGGCGG GCTTTCTCAT CAGTGCCGCC
GACCATCTTC GTAATGACTC AGCTATGAAA GACGATGACT GGACAGTCTT TGCCGGTGAG
GCGGCGGAGA AGGACGCGGA TGGCAATGTT GTTGCGGAGG GTCGTCATCA GTTCTCTGGA
GGCGAGACCG ACCAGACCAT GTTCCGCATC AGCGCTATGA ACCTGATGCT GCACGGCATT
AGCCAGCCAG ACATCAAGCT AGTTGATTCG GTAAGTAAGC AAAACACCAC CAGCGACAAG
TACGACCTCG TGCTTGCCAA CCCGCCTTTT ACAGGTAGCG TCGACACAGA AGACATCGCA
CCAAGCCTCA AGGCGATTTG TAACAGCAAG CAGACCGAAC TGCTCTTTGT GGCGCTCTTT
TTGCGTATGC TCAAAGTGGG CGGCCGCTGC GCATGCATCG TTCCCAACGG CGTGCTCTTC
CGCACCAATT CCAAGGCGTA TCGTCAGCTG CGCCAAGAGC TCGTTGACAA CCAGCAGCTG
CGCGCCATCA TCTACATGCC AAGCGGCGTA TTCAAACCCT ATTCCGGTGT AAGTACCGCT
GTACTTGTCT TCACGAAAAC CAATGCCGGT GGCACTGACA AGGTGTGGCT CTACAACATG
GAGGGCGACG GCTACACACT CGATGACAAG CGAGATATCG ATGATGCCCA CAACGACGTG
CCAGACATTC TGGAGCGTTG GGCTCATTTG GAAAGTGAAG AGAAACGCGA CCGCAAGCAG
AAGAGCTTCC TCGTCTCCAA GCAGGACATC ATCGACAACG ACTACGACTT CAGCTTCAAC
AAGTACGTTG AGACTGAGTA CGAGCGCATT GAATATCCTC CTACGGAACA GATTGTGGCA
GAGCTCGACG AGCTGAATAA GGAAATGGCG ACAGGTCTTG CAGAGCTCAA GAAGGTGCTT
GGCGGTGGGC TAGATGACTA G
 
Protein sequence
MITGATKSKV DDVWQRMWEG GITNPQEVIT QLTYLMFIRS LDDKELESER MEELGIPQEY 
LFPQTSEGQE MRWCSIKNMA PEKMLEAIRD KVFPFIKTLH DDTPFARSMR DATFGINNPR
TLQKAVSGID SLMNDFENDM DDLGDLYEYM LSKLSTAGTN GQFRTPKHIR DMMVAMVDPR
PGERICDPAM GTAGFLISAA DHLRNDSAMK DDDWTVFAGE AAEKDADGNV VAEGRHQFSG
GETDQTMFRI SAMNLMLHGI SQPDIKLVDS VSKQNTTSDK YDLVLANPPF TGSVDTEDIA
PSLKAICNSK QTELLFVALF LRMLKVGGRC ACIVPNGVLF RTNSKAYRQL RQELVDNQQL
RAIIYMPSGV FKPYSGVSTA VLVFTKTNAG GTDKVWLYNM EGDGYTLDDK RDIDDAHNDV
PDILERWAHL ESEEKRDRKQ KSFLVSKQDI IDNDYDFSFN KYVETEYERI EYPPTEQIVA
ELDELNKEMA TGLAELKKVL GGGLDD