Gene Emin_0224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0224 
Symbol 
ID6263151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp239287 
End bp241257 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content39% 
IMG OID642610687 
ProductDNA methylase N-4/N-6 domain-containing protein 
Protein accessionYP_001875123 
Protein GI187250641 
COG category[L] Replication, recombination and repair 
COG ID[COG2189] Adenine specific DNA methylase Mod 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0000113178 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCAGACA AATTTAATAA AGAATCTTTA AATATAACAG AAGAAAAGCT AACTCAGTTT 
AAACAACTAT TCCCAGAAGT TTTTAGTGAG GGCAAGGTGG ACTTTGAACG CCTTAAGCTT
ACCCTTGGGG AACACACTGC AATATCTAAT GAACGCTACG TGCTAAACTG GGCTAACAAG
AGTGATGCCT TTACCGCCAT ACAAACCCCA ACAACCAAAA CCCTTTACCC TGCCGTCAAA
GAATCAGTCA ATTTTGATAC CACTCAAAAT GTGTTTATAG AGGGTGAGAA CCTGGAAGTG
TTAAAGATAC TCCAAAAGTC CTACTTTGGC AAAATTAAGA TGATATACAT TGACCCGCCC
TATAACACTG GCAATGACAA TTTTATTTAC CCAGACAAAT TTGCCGAAAC TAAAGAAGAG
TACCTTAAAA AAATTAAAGA AAAAGATGAA GAGGGCTACC TGCTAAAAGA AGGCCTCTTC
CGCAAAAACT CCAAAGAAAA CGGGCAGTTC CACAGCAACT GGCTTAACAT GATGTATCCC
AGACTGTTCC TAGCTAAAAA CCTGCTTAAA GACGATGGTG TTATTTTTGT GTCCATAGAT
GACAATGAGG TGCATAACCT GCGCCTGTTA ATGAATGAAG TGTTTGGAGA AGATAATTTC
AAAGCAATTT TCCCCAGAGT TACCAAAAAG GGTGGGAAGT CCTCAGAAGC AACCGCCAAG
AACCATGACT ACGTCTTGAT GTATTCAAAA AATAATTCTT TTGCAGATAT TATAGGGATA
GCACATAATG ATGATGGTTA TAGTAACCAG GATGAGTTTT TCGAAGAAAG GGGATTTTAT
AAATTAAATC AGACCCTAGA TTATGATTCT TTGGGATGGG TAAAGTCTCT TGACTACCCA
ATAGAAATTG GAGAGCATGT TTACTATGCT GGCGGGTCAA AGGAGAAATA TGAAGAGCGG
CATAGTGGCA AGCACGGACG TGCGGACTGG GGATGGAGAT GGAGTAAAGA CCTATTTGAG
TTTGGTTTTA GTAACGGCTT TGTTGAATTA AAAGACTCTG GGAGTAGACC TAGGATATAT
ACAAAAACTT ACCAGAATGT AAAAATTGAG AAGGTCGGCA ATAAATATGA AATTATAAAC
ATAGACCGCT CAAAGCCTTT GTCAACACTT GAGTTTATTG AAAATAAATA CTCCAACGAT
AATGCTACAA AGGTGATTGA CGGCGTTATT GGGAAAGGCA TTTTTGAATA CACTAAGCCT
CCAGAACTCA TATCACAATT AGCACATTTA ATAAATGAAA AAGATTTCTT TGTTTTGGAC
TTTTTTGCGG GTTCTGGTAC TACTGCTCAG GCTGTAATGG AGTTAAATAA AGAAGACGGT
GGGAAGAGAA AATTTATTTG TGTTCAACTG CCAGAAAAAA CAGAAGAGAC CTCAGAAGCT
TTTAGGGCAG GATACAAAAC TATATCAGAA ATAAGTGCTG AGCGTATACG CCGTGCCATC
AAAAAAATAG AAACAGAAAC TAAAGCAGAT AATACCATGT TTAAGGCCGA AGGCAAGCTG
GACCTTGGGT TCAAGTTTTA TAAATTAAAA GAATCTAACT TTAAAACATG GCGCAGTGAT
ATGGTGTCCA CTAAAGAAGA CCTGGCAACC ACTATCAATA TGTTTGAAGA CGCATTAAAG
GACGGTGCAA AAGAAACCAA TATTTTATGT GAACTATTGC TAAAACGGGG CTATGACCTT
AACGTCCCTG TGGAAACAGC GGAAGTGGAC AAAACCAAAA TATACATAGT TAACAACGGG
GAGCTGGTGG TATGCCTGGA GAAACTGTCA GACAAAGTTA TATCCAAAAT ACTAGAGATT
AAGCCCCAAA GGTGTTTAAT GCTAGATAGT TTATTTGACG GCAAAGACAG CTTAAAAACC
AACACCGTCT TACAACTCCA AGACGCAGAA ATAGACCTAA CGGTGGTATG A
 
Protein sequence
MADKFNKESL NITEEKLTQF KQLFPEVFSE GKVDFERLKL TLGEHTAISN ERYVLNWANK 
SDAFTAIQTP TTKTLYPAVK ESVNFDTTQN VFIEGENLEV LKILQKSYFG KIKMIYIDPP
YNTGNDNFIY PDKFAETKEE YLKKIKEKDE EGYLLKEGLF RKNSKENGQF HSNWLNMMYP
RLFLAKNLLK DDGVIFVSID DNEVHNLRLL MNEVFGEDNF KAIFPRVTKK GGKSSEATAK
NHDYVLMYSK NNSFADIIGI AHNDDGYSNQ DEFFEERGFY KLNQTLDYDS LGWVKSLDYP
IEIGEHVYYA GGSKEKYEER HSGKHGRADW GWRWSKDLFE FGFSNGFVEL KDSGSRPRIY
TKTYQNVKIE KVGNKYEIIN IDRSKPLSTL EFIENKYSND NATKVIDGVI GKGIFEYTKP
PELISQLAHL INEKDFFVLD FFAGSGTTAQ AVMELNKEDG GKRKFICVQL PEKTEETSEA
FRAGYKTISE ISAERIRRAI KKIETETKAD NTMFKAEGKL DLGFKFYKLK ESNFKTWRSD
MVSTKEDLAT TINMFEDALK DGAKETNILC ELLLKRGYDL NVPVETAEVD KTKIYIVNNG
ELVVCLEKLS DKVISKILEI KPQRCLMLDS LFDGKDSLKT NTVLQLQDAE IDLTVV