Gene Elen_0369 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0369 
Symbol 
ID8414653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp474348 
End bp476642 
Gene Length2295 bp 
Protein Length764 aa 
Translation table11 
GC content62% 
IMG OID645023346 
ProductN-6 DNA methylase 
Protein accessionYP_003180749 
Protein GI257790143 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGATTA AGATGAAGCG CCTGGAAGAG GTTGCCTGCG TATTCGATGA TCGGTGCGCG 
CCCGTGCGAG GGGCGCAGCG GTTGCTGCGG AAGGGGCCGT ACCGGCTTTA TGTCGAGACG
GGTTTCATCC CCTTCGACGA TTATGCTTTC GATGGACGGT TCCTCTTGCT GGGATCAGTG
TGCAACGTGG AGGCTCCCAG CGGGTGTTTG CAGGTGACGG AAGCACGTGG AAAGTTTTCG
GCGACCGATT TGTACCATGT GGTTGCATGC GACGACGATG CCGACACGGT TTATCTGCGT
CACGTCCTAT CGCGCATCCC AGCTGCCAAG CATGCGGACA TGAGTGGGCA TACCGTGCGG
CTCACGGAGA GCAGTCTGCG ACATATTTCC GTGCCGTGGC CTGATGCGGA CGTTCGTCGC
GCCGTGGCGC GGTATCTGGA CGAGTGCGAA TCGCGCTGCC GTGACCTTGC GGCACGTAAT
CGAAGCTTGT TCGAGGAGGG GGTCGAAGCT TATCGTGAAG CCGCGAGACG CTCTTCGAAG
ACGATGAAGC TGGGGAACGC GTGCGCTATG CGCGAGGGCT CGTTCCTTCC GGCTGAAAAG
CGCAGTGCGA AGGGCGCGCT CCCGGCCGTC TCTTCGCAGG GCGTTATGGC GTACACCGAC
GAGGAGGGCG TACGAGAGCA GTGCGTAGTC GTAGGGCAGG CGGGGCAATA CCTTGTGGCC
CGCATGATGC CTGAAGGCGC GTATCCTTTG GTCGACACGA TTGCCTTGAC GACGGATGCG
TCTGACCCTT TGACGGTGGA TGCGCTCGTG TTCGCGCTCG CCTCGCTCGG TATTCGCCCG
CGTCTTCGCG TTGTCGATCG TGCGGTGGAG GCGCTGGCGC TTCCGCTCGA GGAGCTTGTC
GCGTTAGAGA TCCCCTTGAT CGAGGAGGGC GAGCGGGACG CGCGATATTC CGAGATGCGG
GCGATTCTGG AATCGATCGA AAAAGGAGAG CGCGAGGCGA AAGAGGCGCA CGCAGCCGCC
AAGGTGCTGG TTGACGGCTT GTTTGCGGGA CGCGAAGAGG CGCTCAAGCG CTTTGTGGAA
CCCGCGCCGC ATGAGGTGCT TGAAGCGCTC GTTCAGGATG TGCGATCGGA TCTCGCCCAT
GTCGAGGGCG TTGCGGCGTC GGCGTTCGAT GCGGCGTGGG AGGTGCTGCC GTTGCTGTTC
GTGCGCTTGG TTGACGACGG CGCGGCTTGG GCGCGCGTTA TCGCAGCCGA GGATACGCCG
GCTCAGATCG ATGTGGAACT GGAGCGTTTT GCGGCCCAGG ACGAGGGGCT GTCGTTCTTG
AGCGGTTTCG CCTTGAGCGC ATCGTCGTTG GACGAGTCTT CGCAACGACG CATGATCGAT
CGGATAGGCG ATCTGCGGCT TGACGGCTAT AACGGGGAAT TGCTCCGTTG GCTCGCTCTT
GGAAACGAAC CCGAGCCGGA CGCGCCGTGC CCCGCTGCCG TGAGCGATCT CATGGCGCGT
ATTGCGCTAG CGTTCAATCC CTCTGCTGCG CAAGCATACG ATCCTTGCTT GGGTGTCGGT
GATACGCTGG CGGCGCTTCG GCGTTTTGCC CCGACGATTC GCTGCGGCGG CCAGACCGTC
CGTTTCCCCG ATGCGCTTGT GGCGAAGCTG GCTGCGCGCT GCGAAGGGTG GTTCTTCGAC
GACGGTGCGT TGGCGGTGGG ATCCGCGCTG GTCGAAGACG AACTCGCGGG TAAGCTCGCT
GATGTGATCG TGTCGGTGCT GCCTCCCAAT CAGGGAGAAT GGACCGACCA TGCACCCGAT
CCCAGCGACA CGCGTTGGGC GTTCGGCGTT CCTCCGCGGA ATAAGGCGAA CCTTGCGTGG
GTGCAGCAGG CGTTTGCCCA TCGAGCACCG GGCGGCATCG CGGTTCTGGC GGCCAGTAAT
GCCGTGCTGC ACGAATCGCG AGGCTGCGAG CCTGGAGTAC GTGCCGCCCT GATCGAATCG
GGATGCGTTC GTGCAGTCGT TTCACTTCCA GGGGGTCTGT TCAGCGACGG GAGGGTTCCT
TTCAGCATCA TCGTTTTGGG CGACAAGCGC TCAGTGCCTT TTGAAACCCT GTTCGTCAAT
GCCCTGGAAT ACGGCGTGCC GAACGTGACG AGGGCGGGGC GCGGGCTTCC GATGGATGCG
CGCGATCGCG TCGTGTCGAC GGTGGAGCGG TGGATTGCGA CCGGGTCGAG CGTATTCATC
CCTGGATTTG CGCGCAGCGT GCCGGAAAGT GAGATTGTTG CGCTGGGAGA CCTGACGCCT
TGGTCGTACG TATAG
 
Protein sequence
MSIKMKRLEE VACVFDDRCA PVRGAQRLLR KGPYRLYVET GFIPFDDYAF DGRFLLLGSV 
CNVEAPSGCL QVTEARGKFS ATDLYHVVAC DDDADTVYLR HVLSRIPAAK HADMSGHTVR
LTESSLRHIS VPWPDADVRR AVARYLDECE SRCRDLAARN RSLFEEGVEA YREAARRSSK
TMKLGNACAM REGSFLPAEK RSAKGALPAV SSQGVMAYTD EEGVREQCVV VGQAGQYLVA
RMMPEGAYPL VDTIALTTDA SDPLTVDALV FALASLGIRP RLRVVDRAVE ALALPLEELV
ALEIPLIEEG ERDARYSEMR AILESIEKGE REAKEAHAAA KVLVDGLFAG REEALKRFVE
PAPHEVLEAL VQDVRSDLAH VEGVAASAFD AAWEVLPLLF VRLVDDGAAW ARVIAAEDTP
AQIDVELERF AAQDEGLSFL SGFALSASSL DESSQRRMID RIGDLRLDGY NGELLRWLAL
GNEPEPDAPC PAAVSDLMAR IALAFNPSAA QAYDPCLGVG DTLAALRRFA PTIRCGGQTV
RFPDALVAKL AARCEGWFFD DGALAVGSAL VEDELAGKLA DVIVSVLPPN QGEWTDHAPD
PSDTRWAFGV PPRNKANLAW VQQAFAHRAP GGIAVLAASN AVLHESRGCE PGVRAALIES
GCVRAVVSLP GGLFSDGRVP FSIIVLGDKR SVPFETLFVN ALEYGVPNVT RAGRGLPMDA
RDRVVSTVER WIATGSSVFI PGFARSVPES EIVALGDLTP WSYV