Gene Elen_1515 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1515 
Symbol 
ID8415813 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1806396 
End bp1809824 
Gene Length3429 bp 
Protein Length1142 aa 
Translation table11 
GC content67% 
IMG OID645024483 
Productprotein of unknown function DUF450 
Protein accessionYP_003181872 
Protein GI257791266 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.695629 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAGG TCGATATGCA TATCCCAGGC AAATCCGATC TCGACGTCAG TTTCGACGGC 
AGCCGGAAGA TCCATCAGGA GGCGCCGTTC GAGCGCTACA TATCTCGCAA GGTCGCAGGG
CTGGAGCATT CGGGCTGGGC CCTCTCACCT GACGATACGG GGTTCGACCC GAACGACGCA
CTCGTGTTCG ACGACTTCGT CGCCTGGCTC GAGGCCACCG CGCCGGAGAA GCTCGACAAG
ATGCGCCGAG AGAAGGGCGC CCGCTGGGCC GACCAGCTGA AGCGCGCCCT GGTGAAGAGT
CTCGAGAGGA ACGGCACCGT CCTCACGCTG CGCAAGGGCT TCCAGATGGC GGGCTATCAG
ACCATCGAGT GCATGGCGGG CTACCCGGAC GACGAGCGCG TGCCCGGGGC GAGGGAGCGC
TACGACGCCA ACATCCTGCG CTTCATGCAC CAGGTCCACT ACCAGACCGC CGGGAGCAAG
TCTCTCGACT TCGTCCTGTT CGTCAACGGC ATCCCCGTCG CCACCGGCGA GGTCAAGACC
GAGCTCACCC AGACCGTGCG TGACGCCATC GACGAGTACG CCGACGAACG TAAGCCGGTG
GAGCCCGACG GCGGCAGGAA GAACCCGCTG CTCATGTACA AGAGGGGCGC CGTGGTCCAC
TTCGCCGTCT CCGAGGACGA GGTCTGGATG TGCACGAACC TCGGCCCCTT CCCGAGCAAG
TCGGCGAGGC CCCGCTTCCT CCCGTTCAAC CTGGGACGCG ACGGCGGGGC GGGCAACCCG
GACGCGCCGG AGGGCGAGTA CAGGACCCAC TACCTCTGGG ACACCATCCT CCAGCGCGAC
AACTGGCTGT CCATCTTCGA CCGCTTCGTC TTCGAGGAGG TCGAGGACAG GCAGGACGCG
AGCGGCCGGT GGCGCAGACA GGCGACCCAG ATCTTCCCCC GCTACCACCA GTTCGACGCC
GAGAGGAAGG TCCTGGCCGA CGCGCGCGAG CACGGGGCGG GGCGCCGCTA CCTCATCGAG
CACTCGGCCG GCTCCGGCAA GACCGAGACC ATCTCATGGG CTGCGCACGA CCTCGCGAGG
CTCCGCCGCG CGGACGGCGA GAAGATGTTC TCGAGCGTCG TGGTGGTCAC CGACAGGCTG
TCGCTCGACC AGAACATAAA GAAGACCATC TCCCAGCTCT CCAAGGTGAC GGGACAGGTC
GTCATGATCG GCCGCGACGA GAGGGGCAAC GCCGTCTCGG ACGGCTCAAA GAGCTCCCAG
CTCGTCGAGG CCCTGCGCCA GAAGAGGGAG ATCGTCGTGG TCACCATCCA GACGTTCCTC
TACGCCTGGC CGGACATCGC GGTGCTGCCC GAGCTCGACG GCGCCGGCTT CGCGGTGATC
GTCGACGAGG CGCACAGCTC CCAGGAGGGC AGCTCGGCGG CGTCCCTCAA GACCGCCTTC
AACGCGGCGG CGGACAAGCT GAAGTTCGAG AGGATGCTCG ACTTCGACGA CGACTCGGGC
GACGACGGGT CGGTCATGGA CGCCTACTTC GCCCAGATGC AGGCGAGCAA CCTCATGCCG
CCGAACGTCT CCTTCCTCGC GTTCACCGCG ACCCCCAAGG CCGAGACCAA GACCCTCTTC
GGCACGCCGA CGGGCGAGGA GGACGAGGAC GGCAACCCGG TCATGGGCAG CTTCCACCTC
TACCCGATGC GGCAGGCGAT CGAGGAGGGC TACATCATCG ACCCGCTCTC GGGATACGTG
CCCTTGAAGA CCCTCACGAG AATCGAGGAC GAGTCCGAGG ACGCCGACGG GAGACTCGTC
GACGAGCGCC GCGCCAGGAG GAAGATCGCC AAGTGGCGCT CGCTCCACCC GACGAACGTC
ATGGAGAAGG CCAAGTGGAT CATCGACCAC TTCGTCGACA ACGTGGCGCC GCTCCTGAAC
GGCGAGGCCA AGGCCATGAT CGTCACCTCC GGGCGTCCCG AGGTCGTGCG CTACAAGTAC
GCGATAGAGG CGTACCTCCG CGCCCGCCCG GACCTCGACC CCGCCAAGGT CGAGCCGAGG
CTGCGCTTCA AGGTGCCCGG CGAGCCGCTC GTCGCCTTCT CCCAGAAGGT CCTGGGCGAC
AGGTGCGTCC TGCCCGAGGA CGAGTACCTC GAGGACAACC CGTTCGCCCT CATCGAGCGC
GGATACGAGT ACACCGAGGC CAACATGAAC CCGACCGGGC AGGGCCCGGT CGAGAGGGCG
TTCGACCGCC CCGAGAGCAG GCTGCTCATC GTGGCCAACA AGTTCCAGAC CGGGTTCGAC
CAGAAGAAGC TAGTCGCCCT CTACGTCGAC AAGCCGCTCG GCAACGCCAT CGAGATCGTG
CAGACCTACT CGCGCGTCAA CAGGACGTGC TCCGGCAAGG ACAGGGTCTT CGTCGTCGAC
TTCGTGAACG ACCCCGAGAC GGTTCTCGCC GCCTTCAAGA CCTACGACAA GGGCGCGACG
ATGAGCGCGG CCCAGGACCC GAACGTCGTC TACGACCTGA AGGACGCCCT CGACGCGGCC
GACGTGTTCG CGACGCGAGA CGTCGAGGGG TTCAAGGAGG CGCTCTACAG GTCCAAGGGC
CTCGCAGCCG GCGGGGAGAG CGATGCCTAC AGGACCGCCC TCTACTCCGC CGTCTCCGGG
CCCGCCGAGA TCTTCCGCGA GAGGTTCGCG GCGGCCAGGG ACTCCTACGA GACTTGGCTG
GAGTGCGCCA ACCGGGCGGA GGCGGCCGGC GAGGCCGAGG AGGCGGAGAG GGCCAAGAGG
AGCGCGGACG AGGCGGCGGG ACGCGTCGCG GAGCTGATGA CCTTCAGGAA GAAGCTCTCC
AAGTACTGCT CCGCCTACAC GTTCCTGTCC CAGGCGCTCG ACTTCGGCGA CCCCGACCTC
GAGGTGTTCT ACAGCTTCGC AAAGCTGCTC GGCCACAGGA TCGCGGGCAC CTCGCTCGAC
GACGTCGACG TGCGCGGCCT CGTGCTCCGG GACTTCCGCA TATCGGCCCA GAAGGTCCCC
GAGGGGGTCG AGGGCGGCGA GCTCATGCCG ATGGGCGCGG GCGGCTCAGG CGCCGCTCCC
AAGCGCGACA CCATGGCGAG GATCGTGGAG AGGCTGAACC GCACGTGGGG CGACGAGGGC
GACCCCCTCG TGAAGGCCCG AGCCGTCAAT GCGGTCTCCG ACGCCGTGGC GGCAGACCCC
GTGACCAACA CGCAGATCAC GAACACGAGC AACACGAAGG AGGCTGTCCT CGCGGACGGG
CGCCTGCGCA ACATCGTCAT CAGGGCCCTC ATGTCGATGA TGAGCAACGA GCTGGGAGAC
CTCGCCGAGC AGGCGCTCGA CGACCCGCAG GCGATCGAGC CCCTGGCCGA CCAGGTGTAC
GACCTCATCT CGCAGGGGAA GCGCTACGAC ATCGCCGAGC TCGCCTCCTA CCTCTCGAAG
GGGGAATAG
 
Protein sequence
MTKVDMHIPG KSDLDVSFDG SRKIHQEAPF ERYISRKVAG LEHSGWALSP DDTGFDPNDA 
LVFDDFVAWL EATAPEKLDK MRREKGARWA DQLKRALVKS LERNGTVLTL RKGFQMAGYQ
TIECMAGYPD DERVPGARER YDANILRFMH QVHYQTAGSK SLDFVLFVNG IPVATGEVKT
ELTQTVRDAI DEYADERKPV EPDGGRKNPL LMYKRGAVVH FAVSEDEVWM CTNLGPFPSK
SARPRFLPFN LGRDGGAGNP DAPEGEYRTH YLWDTILQRD NWLSIFDRFV FEEVEDRQDA
SGRWRRQATQ IFPRYHQFDA ERKVLADARE HGAGRRYLIE HSAGSGKTET ISWAAHDLAR
LRRADGEKMF SSVVVVTDRL SLDQNIKKTI SQLSKVTGQV VMIGRDERGN AVSDGSKSSQ
LVEALRQKRE IVVVTIQTFL YAWPDIAVLP ELDGAGFAVI VDEAHSSQEG SSAASLKTAF
NAAADKLKFE RMLDFDDDSG DDGSVMDAYF AQMQASNLMP PNVSFLAFTA TPKAETKTLF
GTPTGEEDED GNPVMGSFHL YPMRQAIEEG YIIDPLSGYV PLKTLTRIED ESEDADGRLV
DERRARRKIA KWRSLHPTNV MEKAKWIIDH FVDNVAPLLN GEAKAMIVTS GRPEVVRYKY
AIEAYLRARP DLDPAKVEPR LRFKVPGEPL VAFSQKVLGD RCVLPEDEYL EDNPFALIER
GYEYTEANMN PTGQGPVERA FDRPESRLLI VANKFQTGFD QKKLVALYVD KPLGNAIEIV
QTYSRVNRTC SGKDRVFVVD FVNDPETVLA AFKTYDKGAT MSAAQDPNVV YDLKDALDAA
DVFATRDVEG FKEALYRSKG LAAGGESDAY RTALYSAVSG PAEIFRERFA AARDSYETWL
ECANRAEAAG EAEEAERAKR SADEAAGRVA ELMTFRKKLS KYCSAYTFLS QALDFGDPDL
EVFYSFAKLL GHRIAGTSLD DVDVRGLVLR DFRISAQKVP EGVEGGELMP MGAGGSGAAP
KRDTMARIVE RLNRTWGDEG DPLVKARAVN AVSDAVAADP VTNTQITNTS NTKEAVLADG
RLRNIVIRAL MSMMSNELGD LAEQALDDPQ AIEPLADQVY DLISQGKRYD IAELASYLSK
GE