Gene Elen_1517 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1517 
Symbol 
ID8415815 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1811011 
End bp1813086 
Gene Length2076 bp 
Protein Length691 aa 
Translation table11 
GC content64% 
IMG OID645024485 
ProductN-6 DNA methylase 
Protein accessionYP_003181874 
Protein GI257791268 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.140297 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGATA AGACCGCGTT CGACTACGTT AATGAGATAT GGAGCATCGC GAACTACGTG 
CGCGACGTCA TCCGCCCCGC CGACTACAAC AAGCTCATCC TTCCCTTCGC TGTGCTGAGG
CGCTTCGAGT GCGCCCTCGA GCCTACGCGG GCGGCCGTGA GCAGACAGGC CGCCAAGGGA
GTCTGGGACG ACGACGACCC GAAGTACTGC GCCCTCTCCG GCCACTGCTT CTACAACGTC
ACGAGCTTCA CGCTCTCCAA CCTCGGGGCG ACGAAGACCT GTGACGCCCT CATGGCCTAC
ATCAACGGCT TCTCCGTGAA CGCCCGCGAG GTCCTGCAGC GCTTCGAGAT GCGCCAGACC
TGCGAGAAGC TTGATGAGAA GGGCATGCTC TACGAGGTGT GCACACGCTT CTCGGGCTTC
GACCTCGGGC CCGAGACGGT CTCCGACCGC ATGATGACCG ACATCTACGA GCACCTCATC
CAGCGATACG GCGAGGAGAT CTCGCAGGAC GCCGAGGACT TCATGACGCC GAAGGACGTG
GCGAGGCTCG CCACGGCGCT CCTGTTCGCG AACGAGGACA CGCTCCTCAA CGCCGACAAC
GGCGACATCC GCACGCTCTA CGACGGCAGC TGCGGCACCT GCGGCTTCAT CTGCGACGCG
CTCGACCAGC TCGACGAGTG GCACGACAAG GGGCACTTCA AGAGCCCAAC CAAGATTGTC
CCCTACGGCC AGGAGCTCGA GGACGCGACC TGGGCCATGG GCAAGGCCGC GCTCATGCTG
CGCAATATCG CCGGGGGCTC GGGAGACGTG CTCGACCAGA TGACCGACCT GTCGGCCGGC
ATCATGCTCG GCGACACCCT CGACGACGAC AGGTTCGAGG GACGCACCTT CAACTACCAG
CTCACGAACC CGCCCTACGG CAAGGAGTGG AAGAAGGAGA AGGACGCCGT CCTCGAGGAG
ATGGGCCGTG GGTTCGATGG CAGGTTCGGC GCCGGCAAGC CCGACATCGA CGACGGCAGC
ATGCTCTTCA TGCAGAACGT CGCGGCGAAG ATGGCCCCTC CCAAGGAGGG CGGCGGGAAG
GCCGCCATCG TGCTCTCGGG CTCCCCGCTG TTCAACGGGG ATGCCGGCAG CGGGCCATCG
GGCATCCGCC GCTGGCTGTT CAGCGAGGAC CTCGTCGACT GCATCGTGAA GCTGCCGACC
GAGATCTTCT ACCGCACGGG CATCGCAACC TACATCTGGG TGCTCAACAA CCACAAGCCG
GAGAACCGCA AGGGGTACGT GCAGCTCATC GACGCCTCGG AGGAGAAGAC CGCCCTGCGC
AAGAGCCAAG GGAACAAGCG CTACGAGATC GGCGAGGACC AGGCCGCGTG GATCGTGCGC
ACCTACGTAG ACGGGCACGA CCACGGCAGG TCGGTCATCG TCCCGGTCGA GAACTTCATG
TACCGCAAGG TGACCACGCA GCGCCCGCTG CGCGTTGTCA TTGAGCCCTC GGTCGATGGG
CTGGATGCCC TCTTTACCCT CAGCAAGCCC ATGGAGAAGC TGTCGGACGC GAGCCGCGCC
GCCATTCGCT CCTGGGTTGA GAAGAACGAG GGCGCCTCGC TCACATACAG CGAGGTCCTG
GCTGCGACCG AGAAGCTTCA CAAAGCGATC GAAAAGCCCA AGCCACAGAA GGCCGCCCTC
GCCGACGCCC TCGTCAAGGT CTTCGGACGT CGAGACCCGT CCGCGACTCC CGCAATCGAT
GCGAAGGGCA ATCCGGTGTT CGACCCCGAG CTCAAGGACA CGGAGAACGT TCCAATCGGC
ATGGAAATCA ACGACTACAT GGCGACCGAG GTGCTGCCCT ACGCTCCCGA CGCCGTGGTG
GACGAGTCCG TAAAGGACGA GCCGAAGTAC GACGCCAAGA GCGGGCTCAC GGCCAACCCG
CTCGGCGACG GCGGGGTTGG CGTCGTCGGA ACGACCATCT CGTTCAACAG GTATTTCTAC
AAGTACGAGA AGCCTCGCGA CCCGCAAGTC ATCGCTAAGG AGATCCTCGA GCTCGAGGAC
GGCCTCGGCG AGCTCATGAG GGGGTTCCTG GCATGA
 
Protein sequence
MADKTAFDYV NEIWSIANYV RDVIRPADYN KLILPFAVLR RFECALEPTR AAVSRQAAKG 
VWDDDDPKYC ALSGHCFYNV TSFTLSNLGA TKTCDALMAY INGFSVNARE VLQRFEMRQT
CEKLDEKGML YEVCTRFSGF DLGPETVSDR MMTDIYEHLI QRYGEEISQD AEDFMTPKDV
ARLATALLFA NEDTLLNADN GDIRTLYDGS CGTCGFICDA LDQLDEWHDK GHFKSPTKIV
PYGQELEDAT WAMGKAALML RNIAGGSGDV LDQMTDLSAG IMLGDTLDDD RFEGRTFNYQ
LTNPPYGKEW KKEKDAVLEE MGRGFDGRFG AGKPDIDDGS MLFMQNVAAK MAPPKEGGGK
AAIVLSGSPL FNGDAGSGPS GIRRWLFSED LVDCIVKLPT EIFYRTGIAT YIWVLNNHKP
ENRKGYVQLI DASEEKTALR KSQGNKRYEI GEDQAAWIVR TYVDGHDHGR SVIVPVENFM
YRKVTTQRPL RVVIEPSVDG LDALFTLSKP MEKLSDASRA AIRSWVEKNE GASLTYSEVL
AATEKLHKAI EKPKPQKAAL ADALVKVFGR RDPSATPAID AKGNPVFDPE LKDTENVPIG
MEINDYMATE VLPYAPDAVV DESVKDEPKY DAKSGLTANP LGDGGVGVVG TTISFNRYFY
KYEKPRDPQV IAKEILELED GLGELMRGFL A