Gene Elen_3112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_3112 
Symbol 
ID8417448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp3619594 
End bp3622494 
Gene Length2901 bp 
Protein Length966 aa 
Translation table11 
GC content57% 
IMG OID645026092 
ProductDNA polymerase III, epsilon subunit 
Protein accessionYP_003183443 
Protein GI257792837 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1199] Rad3-related DNA helicases 
TIGRFAM ID[TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family
[TIGR01407] DnaQ family exonuclease/DinG family helicase, putative 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000116759 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000648685 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATTTCG GCGATCTCGA CCATAACGTC GTGGTTCTGG ACACCGAGAC CACCGGCTTC 
TCTTTCAACC ATGATGAGCT GACGCAGATC GCTGCAGCTC GCATGGAACA AGGCGAAATA
GTCGAATGGT TCATCACGTT CGTGAACCCG GGAAAGCCCA TTCCGGAAGA TGTCGCGCAC
CTGACCGATA TCCACGATAG AGATGTGGCC GATGCTCCGC TTCCCGCCGA TGCGCTGGCC
AAGCTTGTGG AATTCATTGG CGATGCGAAG GTGGTCGCGC ATAACGCAGA GTTCGATCGT
ACGTTCACCA CGCGTCATCC CAGCGGTTAT CCCCTGTTGG AGAATACCTG GATCGATTCG
CTCGATCTTG CGCGAATCGC GCTTCCCCGC ATGAAATCGC ATCGCTTGCT CGATCTTGTG
CGGGCTTTCG GCGCGCCTTT GTCGACACAT CGCGCCGATG CCGACGTCGA GGCAACCTGC
GCCATCTTCC GCATCCTGCT TGCAGGAGTC GCCGCGATGC CGCCGGCGCT TGTGTGCGAG
ATCGCCCATA TGGCGACTCC TGACGAGTGG TCGACGAGGA TGGTATTCGA GCAGTTCGCC
CGAAAGTACG AGGTGGAAGC GAACCAAGAT GTTTCACGTG AAACACCAAG TTTGAGTTTC
TCTTTGCGCG CCCTTCGTCG TGACCGCTTG GGCAAGCTGG AGCGCACGGC GAAACTGGAT
GCCGACGATA TCGCGGCCGA TCCGCAACGC TCGCTTGCGT TCCCTTCAGA TGCGGATATC
GTTCAAGCCT TCTCGGAAAC CGGGCTTGTG GGGTCGTTGT ACGAAGAGTT CGAGCCTCGC
ATAGAGCAGG TCGCTATGGC AGAGGCAGTG AGAAAGGCAT TTTCTTCTTC GGAGAACCTG
TTGGTCGAAG CGGGAACGGG CGTTGGAAAG TCCATGGCGT ACCTGGTTCC CGCCGCGTTG
ACGGCGCGGG CCAACAACAT AGCCGTCGGC GTTGCAACGA AGACTAATAC CCTGCTCGAT
CAGTTGGTTT ACCATGAGCT GCCTGCATTG GAAAAAGCGC TTCGGTTGGC CGACCCTGAC
AGGCCGTCTC TGACGTATGC TCCGCTCAAG GGTTTCTCGC ATTATCCTTG TTTGCGTAAG
ATCGGCCATA TCGTCGAGGA AGGCGCGCAG ACGAAGCTGT TCGGCAACAA AGAGCAGACC
CAGGCTCCCT CCCTTGCCGC CTTGCTGTCG TTTATCGAGC AGACTGAATA CGATGACATG
GACAGCTTGA AAATCGACTA TCGCGTGTTG CCTCGTCGTC TTATCACTAC GACGGCGAAC
GATTGTCTAC GACGTAAGTG TCCTTATTTC GGTACTTCAT GCTTTGTGCA CGGTTCGCGC
CGGCGCGCCG AGGCGGCCGA TATCGTGGTC ACCAACCATA GTTTGCTGTT CTGCGATCTC
GTAGCAGATG GTGGATTGCT GCCGCCGATC CGCTATTGGG TGGTGGATGA GGCCCATGGA
GCCGAAGCCG AAGCTCGTCG GGCTTTCTCG TTGTCTCTTT CGGCCGAGGA CATAACGCGC
CTGGCCAACC GTGTCGGTGC CGATGAGGCT TCCCGTAACG TGTTCGTGCG TGCCGAGCGC
CGTGTGGTGG TGCCCGGCAT GGAAGAGGGT TCTTCGCTGT TCTACGCCCT TACCGGCAAG
GCGCGCTCTG CGGGCAAGGC CTACGCTGAT GCTGCACGGG AATTCTCCGC TCACCTCAAG
GACCTTCTGT TCTTCGATCA GAACAAAAAA GGAAAAGGCT ACGAGATCGT CGAGCTGTGG
ATCAATTCAG ATGTGCGTTC CAGCGCAACG TTTGCCCAGG TAGAAAGCTA TGGTCGAGCG
ATGACCGAAG CAGCGGAGAA GCTGGTGCGC GTGTGTCAGG AGCTCGTGGG ATATCTGGAA
GAGCTCGAAG GCGCGGCAGA GATCCAGCGC GAGATAGCAT CGACCGCGAT GGAGCTCAAA
GACCAGATGA ACGCCGCCGA CATCATCTTG GACAGGGCTC CGGAAACGTA TGCCTATGCG
GCCACTTTGA GTCGCAAGAA AGATCGCGTC GCGGAAAAGC TGGAAGCGCT GCTGATCAAC
GTGGGCGAAG CGATGAACGA GACGTTGTTC GAGCGCACGC ACTCGACGGT GTTCGCGTCT
GCCACGTTGG CTGTGGACGA TAAATTCGAC GCATTCGAAA GCGCCCTGGG ATTGAACGCT
TCGGAGCATT CGACCTGTCA GATGTGCAAG CTGGATTCAA GCTATGATTT CGACGCTCAT
ATGACAGTGT ACGTGGCCAG CGATATGCCG GAACCGAACG AAGCCGCGTA TCTTGCCGCG
CTGCAGCGCC TGCTGGTGGA CGTTCACCGT GCGCAGAACG GCTCGATGCT GACCTTGTTC
ACGAACCGTC GCGAGATGGA GCGTTGCTTT GACGAGGTAC AACCCCAACT CAAAGTCGAC
GACCTGCGCG TCGTGTGTCA AAAATGGGGT GTGTCGGTAA AAGGTTTGCG CGATGATTTT
CTGGCGGACG AGCATCTTTC GCTCTTCGCG CTCAAAAGCT TTTGGGAGGG ATTCGATGCT
CCGGGCGCTA CGCTCAAAGG GGTCGTCATT CCCAAGCTGC CCTTCGCGAA GCCGACCGAT
CCTTTGTCGT GCGAACGGGC GGCTCGCGAC GATCAGGCGT GGAGGCGCTA CGTGCTACCC
GCCGCCGTTC TTGAGACGAA ACAGGCTGCT GGTCGACTTA TCAGGAAGGC CGATGATACG
GGCGTGCTTA TTCTCGCCGA TCGTCGCTTG TTGACGAAAA GCTACGGCAA AGCGTTCCTT
AATTCCCTTC CGAGCAGGAC CATCAAGGTG ATGACGGCGG CTGAGATTGT AGCCGATCTC
GAGCGCTCCA ACAACGGGTA A
 
Protein sequence
MDFGDLDHNV VVLDTETTGF SFNHDELTQI AAARMEQGEI VEWFITFVNP GKPIPEDVAH 
LTDIHDRDVA DAPLPADALA KLVEFIGDAK VVAHNAEFDR TFTTRHPSGY PLLENTWIDS
LDLARIALPR MKSHRLLDLV RAFGAPLSTH RADADVEATC AIFRILLAGV AAMPPALVCE
IAHMATPDEW STRMVFEQFA RKYEVEANQD VSRETPSLSF SLRALRRDRL GKLERTAKLD
ADDIAADPQR SLAFPSDADI VQAFSETGLV GSLYEEFEPR IEQVAMAEAV RKAFSSSENL
LVEAGTGVGK SMAYLVPAAL TARANNIAVG VATKTNTLLD QLVYHELPAL EKALRLADPD
RPSLTYAPLK GFSHYPCLRK IGHIVEEGAQ TKLFGNKEQT QAPSLAALLS FIEQTEYDDM
DSLKIDYRVL PRRLITTTAN DCLRRKCPYF GTSCFVHGSR RRAEAADIVV TNHSLLFCDL
VADGGLLPPI RYWVVDEAHG AEAEARRAFS LSLSAEDITR LANRVGADEA SRNVFVRAER
RVVVPGMEEG SSLFYALTGK ARSAGKAYAD AAREFSAHLK DLLFFDQNKK GKGYEIVELW
INSDVRSSAT FAQVESYGRA MTEAAEKLVR VCQELVGYLE ELEGAAEIQR EIASTAMELK
DQMNAADIIL DRAPETYAYA ATLSRKKDRV AEKLEALLIN VGEAMNETLF ERTHSTVFAS
ATLAVDDKFD AFESALGLNA SEHSTCQMCK LDSSYDFDAH MTVYVASDMP EPNEAAYLAA
LQRLLVDVHR AQNGSMLTLF TNRREMERCF DEVQPQLKVD DLRVVCQKWG VSVKGLRDDF
LADEHLSLFA LKSFWEGFDA PGATLKGVVI PKLPFAKPTD PLSCERAARD DQAWRRYVLP
AAVLETKQAA GRLIRKADDT GVLILADRRL LTKSYGKAFL NSLPSRTIKV MTAAEIVADL
ERSNNG