Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_3112 |
Symbol | |
ID | 8417448 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 3619594 |
End bp | 3622494 |
Gene Length | 2901 bp |
Protein Length | 966 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 645026092 |
Product | DNA polymerase III, epsilon subunit |
Protein accession | YP_003183443 |
Protein GI | 257792837 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1199] Rad3-related DNA helicases |
TIGRFAM ID | [TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family [TIGR01407] DnaQ family exonuclease/DinG family helicase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000116759 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000000000648685 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGATTTCG GCGATCTCGA CCATAACGTC GTGGTTCTGG ACACCGAGAC CACCGGCTTC TCTTTCAACC ATGATGAGCT GACGCAGATC GCTGCAGCTC GCATGGAACA AGGCGAAATA GTCGAATGGT TCATCACGTT CGTGAACCCG GGAAAGCCCA TTCCGGAAGA TGTCGCGCAC CTGACCGATA TCCACGATAG AGATGTGGCC GATGCTCCGC TTCCCGCCGA TGCGCTGGCC AAGCTTGTGG AATTCATTGG CGATGCGAAG GTGGTCGCGC ATAACGCAGA GTTCGATCGT ACGTTCACCA CGCGTCATCC CAGCGGTTAT CCCCTGTTGG AGAATACCTG GATCGATTCG CTCGATCTTG CGCGAATCGC GCTTCCCCGC ATGAAATCGC ATCGCTTGCT CGATCTTGTG CGGGCTTTCG GCGCGCCTTT GTCGACACAT CGCGCCGATG CCGACGTCGA GGCAACCTGC GCCATCTTCC GCATCCTGCT TGCAGGAGTC GCCGCGATGC CGCCGGCGCT TGTGTGCGAG ATCGCCCATA TGGCGACTCC TGACGAGTGG TCGACGAGGA TGGTATTCGA GCAGTTCGCC CGAAAGTACG AGGTGGAAGC GAACCAAGAT GTTTCACGTG AAACACCAAG TTTGAGTTTC TCTTTGCGCG CCCTTCGTCG TGACCGCTTG GGCAAGCTGG AGCGCACGGC GAAACTGGAT GCCGACGATA TCGCGGCCGA TCCGCAACGC TCGCTTGCGT TCCCTTCAGA TGCGGATATC GTTCAAGCCT TCTCGGAAAC CGGGCTTGTG GGGTCGTTGT ACGAAGAGTT CGAGCCTCGC ATAGAGCAGG TCGCTATGGC AGAGGCAGTG AGAAAGGCAT TTTCTTCTTC GGAGAACCTG TTGGTCGAAG CGGGAACGGG CGTTGGAAAG TCCATGGCGT ACCTGGTTCC CGCCGCGTTG ACGGCGCGGG CCAACAACAT AGCCGTCGGC GTTGCAACGA AGACTAATAC CCTGCTCGAT CAGTTGGTTT ACCATGAGCT GCCTGCATTG GAAAAAGCGC TTCGGTTGGC CGACCCTGAC AGGCCGTCTC TGACGTATGC TCCGCTCAAG GGTTTCTCGC ATTATCCTTG TTTGCGTAAG ATCGGCCATA TCGTCGAGGA AGGCGCGCAG ACGAAGCTGT TCGGCAACAA AGAGCAGACC CAGGCTCCCT CCCTTGCCGC CTTGCTGTCG TTTATCGAGC AGACTGAATA CGATGACATG GACAGCTTGA AAATCGACTA TCGCGTGTTG CCTCGTCGTC TTATCACTAC GACGGCGAAC GATTGTCTAC GACGTAAGTG TCCTTATTTC GGTACTTCAT GCTTTGTGCA CGGTTCGCGC CGGCGCGCCG AGGCGGCCGA TATCGTGGTC ACCAACCATA GTTTGCTGTT CTGCGATCTC GTAGCAGATG GTGGATTGCT GCCGCCGATC CGCTATTGGG TGGTGGATGA GGCCCATGGA GCCGAAGCCG AAGCTCGTCG GGCTTTCTCG TTGTCTCTTT CGGCCGAGGA CATAACGCGC CTGGCCAACC GTGTCGGTGC CGATGAGGCT TCCCGTAACG TGTTCGTGCG TGCCGAGCGC CGTGTGGTGG TGCCCGGCAT GGAAGAGGGT TCTTCGCTGT TCTACGCCCT TACCGGCAAG GCGCGCTCTG CGGGCAAGGC CTACGCTGAT GCTGCACGGG AATTCTCCGC TCACCTCAAG GACCTTCTGT TCTTCGATCA GAACAAAAAA GGAAAAGGCT ACGAGATCGT CGAGCTGTGG ATCAATTCAG ATGTGCGTTC CAGCGCAACG TTTGCCCAGG TAGAAAGCTA TGGTCGAGCG ATGACCGAAG CAGCGGAGAA GCTGGTGCGC GTGTGTCAGG AGCTCGTGGG ATATCTGGAA GAGCTCGAAG GCGCGGCAGA GATCCAGCGC GAGATAGCAT CGACCGCGAT GGAGCTCAAA GACCAGATGA ACGCCGCCGA CATCATCTTG GACAGGGCTC CGGAAACGTA TGCCTATGCG GCCACTTTGA GTCGCAAGAA AGATCGCGTC GCGGAAAAGC TGGAAGCGCT GCTGATCAAC GTGGGCGAAG CGATGAACGA GACGTTGTTC GAGCGCACGC ACTCGACGGT GTTCGCGTCT GCCACGTTGG CTGTGGACGA TAAATTCGAC GCATTCGAAA GCGCCCTGGG ATTGAACGCT TCGGAGCATT CGACCTGTCA GATGTGCAAG CTGGATTCAA GCTATGATTT CGACGCTCAT ATGACAGTGT ACGTGGCCAG CGATATGCCG GAACCGAACG AAGCCGCGTA TCTTGCCGCG CTGCAGCGCC TGCTGGTGGA CGTTCACCGT GCGCAGAACG GCTCGATGCT GACCTTGTTC ACGAACCGTC GCGAGATGGA GCGTTGCTTT GACGAGGTAC AACCCCAACT CAAAGTCGAC GACCTGCGCG TCGTGTGTCA AAAATGGGGT GTGTCGGTAA AAGGTTTGCG CGATGATTTT CTGGCGGACG AGCATCTTTC GCTCTTCGCG CTCAAAAGCT TTTGGGAGGG ATTCGATGCT CCGGGCGCTA CGCTCAAAGG GGTCGTCATT CCCAAGCTGC CCTTCGCGAA GCCGACCGAT CCTTTGTCGT GCGAACGGGC GGCTCGCGAC GATCAGGCGT GGAGGCGCTA CGTGCTACCC GCCGCCGTTC TTGAGACGAA ACAGGCTGCT GGTCGACTTA TCAGGAAGGC CGATGATACG GGCGTGCTTA TTCTCGCCGA TCGTCGCTTG TTGACGAAAA GCTACGGCAA AGCGTTCCTT AATTCCCTTC CGAGCAGGAC CATCAAGGTG ATGACGGCGG CTGAGATTGT AGCCGATCTC GAGCGCTCCA ACAACGGGTA A
|
Protein sequence | MDFGDLDHNV VVLDTETTGF SFNHDELTQI AAARMEQGEI VEWFITFVNP GKPIPEDVAH LTDIHDRDVA DAPLPADALA KLVEFIGDAK VVAHNAEFDR TFTTRHPSGY PLLENTWIDS LDLARIALPR MKSHRLLDLV RAFGAPLSTH RADADVEATC AIFRILLAGV AAMPPALVCE IAHMATPDEW STRMVFEQFA RKYEVEANQD VSRETPSLSF SLRALRRDRL GKLERTAKLD ADDIAADPQR SLAFPSDADI VQAFSETGLV GSLYEEFEPR IEQVAMAEAV RKAFSSSENL LVEAGTGVGK SMAYLVPAAL TARANNIAVG VATKTNTLLD QLVYHELPAL EKALRLADPD RPSLTYAPLK GFSHYPCLRK IGHIVEEGAQ TKLFGNKEQT QAPSLAALLS FIEQTEYDDM DSLKIDYRVL PRRLITTTAN DCLRRKCPYF GTSCFVHGSR RRAEAADIVV TNHSLLFCDL VADGGLLPPI RYWVVDEAHG AEAEARRAFS LSLSAEDITR LANRVGADEA SRNVFVRAER RVVVPGMEEG SSLFYALTGK ARSAGKAYAD AAREFSAHLK DLLFFDQNKK GKGYEIVELW INSDVRSSAT FAQVESYGRA MTEAAEKLVR VCQELVGYLE ELEGAAEIQR EIASTAMELK DQMNAADIIL DRAPETYAYA ATLSRKKDRV AEKLEALLIN VGEAMNETLF ERTHSTVFAS ATLAVDDKFD AFESALGLNA SEHSTCQMCK LDSSYDFDAH MTVYVASDMP EPNEAAYLAA LQRLLVDVHR AQNGSMLTLF TNRREMERCF DEVQPQLKVD DLRVVCQKWG VSVKGLRDDF LADEHLSLFA LKSFWEGFDA PGATLKGVVI PKLPFAKPTD PLSCERAARD DQAWRRYVLP AAVLETKQAA GRLIRKADDT GVLILADRRL LTKSYGKAFL NSLPSRTIKV MTAAEIVADL ERSNNG
|
| |