Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2014 |
Symbol | |
ID | 8416325 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2359599 |
End bp | 2362226 |
Gene Length | 2628 bp |
Protein Length | 875 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645024991 |
Product | DNA polymerase I |
Protein accession | YP_003182367 |
Protein GI | 257791761 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000063172 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.00000000000000410772 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGCCGAAGA AGATCGCCGT CATCGACGGC AACTCGCTTA TGCACCGCGC CTACCATGCG GTGCCCCAGA CGATGAACGC TCCCGACGGG CGGCCCACCA ACGCCGTGTT CGGCTTCGTG GCCATGCTGC TCAAGTTCAT CGACATCGCG AATCCCGACG CGCTCATCTG CGCGTTCGAT GCGGGTCGTC CGGCGTTTCG CATGGAGGCG CTCGAACAGT ACAAGGCACA GCGCCCTCCC ATGGACGACG ACCTCAAGGT GCAGTTCCCC ATCGTCGAGG AGCTGCTGGA GGCCATGAAC GTGCCCGTGG TGCGCATCAA GGGCTGGGAG GGCGACGACG TGCTGGGCAC CATCGCCGCG CGCGACGAGG AGCTGGGCTA CGAGACGCTG CTCGTGACGG GCGATAAGGA CGCCTACCAG CTGGCAACCG ACAAGACGCG CATCGTCACC ACGAAGAAGG GCATCACCGA CGTGGCCATC TACGGCCCGG CCGAGGTGCT CGAGCGCTAC GGCGTGCGCC CCGACCAGTT CATCGACTTC CTCGGCCTCA AGGGCGACTC GTCCGACAAC ATCCCCGGCG TGCCCGGCAT CGGCGACAAG ACCGCCGCGA AGCTGCTGCA GACCTACGGC AACCTCGAGG GCATCTACGA GCACGTGGAC GATCTCAAGG GCAAGCAGAA AGAGAAGATC GTCGACAACA AGGACATGGC GTATCTGAGC CGCGAGGTGG CCACCATCGT GCGCGACCTC GACTTCCCGC TCGACCTCGA AGCGTGCTCG TTCCCGTCGT TCGATTCCGA CAAGGTGACC GAGGCGTTCA AGAGCGTGCA GTTCAACGCG CACCTCAGCC GCGTGCTCAA GCTCGTGGGC AAGGAGCTTG AGAAGAAGGC GGCCCCGCTC GTGGTGGAGC CGGTGGTCTC GGGGCCCGAG GCGCAGGCGC TCGTCGACGC GGCCATCGCG CGCGGCGAGA CGGTGGGCGT GGCGTTCATC GAGCCCGAGC AGGTGTCGCT GTTCAACGCC GGCCTGCACG GCGCGGTGAA CACGAGCGAG GGCACGGCGG TGTTCGAGGA CGACGAGGGC CGCGAGGCGT TCGCGCGCAT CGTGCGCGCC GGCTCGTTCG CCGCGCTCGA CGTCAAGCGC GAGGTGCATC GCATCTATCC CGCCGATACG GCCGAGGCCG CGCTCGTGGA GGACGCCGAG CTCATGGGCA TGCGCGCGTT CGACCTGGGG CTGGCCGGCT ACGTGCTGAA CTCGTCGGTG TCCGAGTACT CCTTCGACGC GCTGCTCGAC GCCTACTACG GCGGCGTGCT GCCCGAGACG AAGGACGAGG CCGGCGCCGT GGCGGCCCAG GCCGCGGCGG CGCGCATGCT GGTGGGCCCG CTCACCGACG CGCTCGGGCG CGACGAGAGC AAGCGCGCCT ACTTCGACAT CGACCTGCCG CTCGTGGCGG TGCTCGCCAT CGTCGAGCGC ACGGGCGCCG CGGTGGATTG CGACCGCCTA GCCGAGCTGG GGGCCACGAC GCAAGCCGAG CTCGACGAGC TGCGTGCGCG CATCATCGAG ATCGCGGGGG AGGAGTTCAA CCTCGACAGC CCCAAGCAGC TCGCGCACAT CCTGTTCGAG GTGCTGGGCC TGCGCACGCT CAGGAAGAAC CAGCGCGGCT ACTCCACCGA CGCCGCCGTG CTCAAGGAGC TGTCGAACGA CCACGAGCTG CCGGCGCTCG TCCTGCGCTA CCGCGAGCTG GCGAAGATCA AGTCCACCTA CATCGACGCG CTGCCCCGCA TGCGCGCCGA CGACGGGCGC GTGCACACGA GCTTCAACGA GACGGTGACC ACCACGGGGC GCCTGTCGTC GTCCGAGCCG AACCTGCAGA ACATCCCCGT GCGCACCGAG TTCGGCCGCC AGATCCGCGA ATGCTTCGTG CCGCTCGAGG AGGGCCACGC GTTCCTCTCG GCCGACTACT CGCAGATCGA GCTGCGCCTG CTCGCGCACC TGTCGAACGA CGAGCACCTC GTGGCCGCAT TCTGCTCCGG CGCCGATTTC CACGCCGCCA CGGCCAGCCG CGTGTTCGGC CTGCCCGTGG AGGACGTTAG CCCTGAGCTG CGCAGCCGCG CGAAGGCCGT GAACTTCGGC ATCGTGTACG GCCAGCAGGC GTTCGGCCTG TCACAGAGCC TGGGCATCCC GTTCGGCGAG GCCAAGGAGA TGATCGAGCG CTACTTCGAG GCCTACCCCG GCGTGCGCGC CTACCTCGAC CGCACCATCG CCGAAGCGAA GGAGAAGGGC TATGCCGAGA CGATGTTCGG CCGCAAGCGC CATATCCCCG AGCTCAAGGC CGCGAACGCC ACGCAGCGCG GCTTCGGCGA GCGCACGGCC ATGAACCACC CCATGCAGGG CAGCGCGGCC GACATCATCA AGCTGGCCAT GACCGAGGTG CAGCGCCGCA TCATGGAGCG CGGCTTCGAA GCGAAGCTGC TGTTGCAGGT GCACGACGAG CTGGACTTCA GCGTGCCGGA AGGCGAGATC GAGGAGCTGT CGGCCATCGT GAAAGACGTG ATGGAGCACA TCGTGGATCT GCGCGTGCCT CTCGACGTCG ACGTCTCCTA TGCCGATAAC TGGGCGGAAG CGCACTGA
|
Protein sequence | MPKKIAVIDG NSLMHRAYHA VPQTMNAPDG RPTNAVFGFV AMLLKFIDIA NPDALICAFD AGRPAFRMEA LEQYKAQRPP MDDDLKVQFP IVEELLEAMN VPVVRIKGWE GDDVLGTIAA RDEELGYETL LVTGDKDAYQ LATDKTRIVT TKKGITDVAI YGPAEVLERY GVRPDQFIDF LGLKGDSSDN IPGVPGIGDK TAAKLLQTYG NLEGIYEHVD DLKGKQKEKI VDNKDMAYLS REVATIVRDL DFPLDLEACS FPSFDSDKVT EAFKSVQFNA HLSRVLKLVG KELEKKAAPL VVEPVVSGPE AQALVDAAIA RGETVGVAFI EPEQVSLFNA GLHGAVNTSE GTAVFEDDEG REAFARIVRA GSFAALDVKR EVHRIYPADT AEAALVEDAE LMGMRAFDLG LAGYVLNSSV SEYSFDALLD AYYGGVLPET KDEAGAVAAQ AAAARMLVGP LTDALGRDES KRAYFDIDLP LVAVLAIVER TGAAVDCDRL AELGATTQAE LDELRARIIE IAGEEFNLDS PKQLAHILFE VLGLRTLRKN QRGYSTDAAV LKELSNDHEL PALVLRYREL AKIKSTYIDA LPRMRADDGR VHTSFNETVT TTGRLSSSEP NLQNIPVRTE FGRQIRECFV PLEEGHAFLS ADYSQIELRL LAHLSNDEHL VAAFCSGADF HAATASRVFG LPVEDVSPEL RSRAKAVNFG IVYGQQAFGL SQSLGIPFGE AKEMIERYFE AYPGVRAYLD RTIAEAKEKG YAETMFGRKR HIPELKAANA TQRGFGERTA MNHPMQGSAA DIIKLAMTEV QRRIMERGFE AKLLLQVHDE LDFSVPEGEI EELSAIVKDV MEHIVDLRVP LDVDVSYADN WAEAH
|
| |