Gene Elen_2014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2014 
Symbol 
ID8416325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2359599 
End bp2362226 
Gene Length2628 bp 
Protein Length875 aa 
Translation table11 
GC content68% 
IMG OID645024991 
ProductDNA polymerase I 
Protein accessionYP_003182367 
Protein GI257791761 
COG category[L] Replication, recombination and repair 
COG ID[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000063172 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000000000410772 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGCCGAAGA AGATCGCCGT CATCGACGGC AACTCGCTTA TGCACCGCGC CTACCATGCG 
GTGCCCCAGA CGATGAACGC TCCCGACGGG CGGCCCACCA ACGCCGTGTT CGGCTTCGTG
GCCATGCTGC TCAAGTTCAT CGACATCGCG AATCCCGACG CGCTCATCTG CGCGTTCGAT
GCGGGTCGTC CGGCGTTTCG CATGGAGGCG CTCGAACAGT ACAAGGCACA GCGCCCTCCC
ATGGACGACG ACCTCAAGGT GCAGTTCCCC ATCGTCGAGG AGCTGCTGGA GGCCATGAAC
GTGCCCGTGG TGCGCATCAA GGGCTGGGAG GGCGACGACG TGCTGGGCAC CATCGCCGCG
CGCGACGAGG AGCTGGGCTA CGAGACGCTG CTCGTGACGG GCGATAAGGA CGCCTACCAG
CTGGCAACCG ACAAGACGCG CATCGTCACC ACGAAGAAGG GCATCACCGA CGTGGCCATC
TACGGCCCGG CCGAGGTGCT CGAGCGCTAC GGCGTGCGCC CCGACCAGTT CATCGACTTC
CTCGGCCTCA AGGGCGACTC GTCCGACAAC ATCCCCGGCG TGCCCGGCAT CGGCGACAAG
ACCGCCGCGA AGCTGCTGCA GACCTACGGC AACCTCGAGG GCATCTACGA GCACGTGGAC
GATCTCAAGG GCAAGCAGAA AGAGAAGATC GTCGACAACA AGGACATGGC GTATCTGAGC
CGCGAGGTGG CCACCATCGT GCGCGACCTC GACTTCCCGC TCGACCTCGA AGCGTGCTCG
TTCCCGTCGT TCGATTCCGA CAAGGTGACC GAGGCGTTCA AGAGCGTGCA GTTCAACGCG
CACCTCAGCC GCGTGCTCAA GCTCGTGGGC AAGGAGCTTG AGAAGAAGGC GGCCCCGCTC
GTGGTGGAGC CGGTGGTCTC GGGGCCCGAG GCGCAGGCGC TCGTCGACGC GGCCATCGCG
CGCGGCGAGA CGGTGGGCGT GGCGTTCATC GAGCCCGAGC AGGTGTCGCT GTTCAACGCC
GGCCTGCACG GCGCGGTGAA CACGAGCGAG GGCACGGCGG TGTTCGAGGA CGACGAGGGC
CGCGAGGCGT TCGCGCGCAT CGTGCGCGCC GGCTCGTTCG CCGCGCTCGA CGTCAAGCGC
GAGGTGCATC GCATCTATCC CGCCGATACG GCCGAGGCCG CGCTCGTGGA GGACGCCGAG
CTCATGGGCA TGCGCGCGTT CGACCTGGGG CTGGCCGGCT ACGTGCTGAA CTCGTCGGTG
TCCGAGTACT CCTTCGACGC GCTGCTCGAC GCCTACTACG GCGGCGTGCT GCCCGAGACG
AAGGACGAGG CCGGCGCCGT GGCGGCCCAG GCCGCGGCGG CGCGCATGCT GGTGGGCCCG
CTCACCGACG CGCTCGGGCG CGACGAGAGC AAGCGCGCCT ACTTCGACAT CGACCTGCCG
CTCGTGGCGG TGCTCGCCAT CGTCGAGCGC ACGGGCGCCG CGGTGGATTG CGACCGCCTA
GCCGAGCTGG GGGCCACGAC GCAAGCCGAG CTCGACGAGC TGCGTGCGCG CATCATCGAG
ATCGCGGGGG AGGAGTTCAA CCTCGACAGC CCCAAGCAGC TCGCGCACAT CCTGTTCGAG
GTGCTGGGCC TGCGCACGCT CAGGAAGAAC CAGCGCGGCT ACTCCACCGA CGCCGCCGTG
CTCAAGGAGC TGTCGAACGA CCACGAGCTG CCGGCGCTCG TCCTGCGCTA CCGCGAGCTG
GCGAAGATCA AGTCCACCTA CATCGACGCG CTGCCCCGCA TGCGCGCCGA CGACGGGCGC
GTGCACACGA GCTTCAACGA GACGGTGACC ACCACGGGGC GCCTGTCGTC GTCCGAGCCG
AACCTGCAGA ACATCCCCGT GCGCACCGAG TTCGGCCGCC AGATCCGCGA ATGCTTCGTG
CCGCTCGAGG AGGGCCACGC GTTCCTCTCG GCCGACTACT CGCAGATCGA GCTGCGCCTG
CTCGCGCACC TGTCGAACGA CGAGCACCTC GTGGCCGCAT TCTGCTCCGG CGCCGATTTC
CACGCCGCCA CGGCCAGCCG CGTGTTCGGC CTGCCCGTGG AGGACGTTAG CCCTGAGCTG
CGCAGCCGCG CGAAGGCCGT GAACTTCGGC ATCGTGTACG GCCAGCAGGC GTTCGGCCTG
TCACAGAGCC TGGGCATCCC GTTCGGCGAG GCCAAGGAGA TGATCGAGCG CTACTTCGAG
GCCTACCCCG GCGTGCGCGC CTACCTCGAC CGCACCATCG CCGAAGCGAA GGAGAAGGGC
TATGCCGAGA CGATGTTCGG CCGCAAGCGC CATATCCCCG AGCTCAAGGC CGCGAACGCC
ACGCAGCGCG GCTTCGGCGA GCGCACGGCC ATGAACCACC CCATGCAGGG CAGCGCGGCC
GACATCATCA AGCTGGCCAT GACCGAGGTG CAGCGCCGCA TCATGGAGCG CGGCTTCGAA
GCGAAGCTGC TGTTGCAGGT GCACGACGAG CTGGACTTCA GCGTGCCGGA AGGCGAGATC
GAGGAGCTGT CGGCCATCGT GAAAGACGTG ATGGAGCACA TCGTGGATCT GCGCGTGCCT
CTCGACGTCG ACGTCTCCTA TGCCGATAAC TGGGCGGAAG CGCACTGA
 
Protein sequence
MPKKIAVIDG NSLMHRAYHA VPQTMNAPDG RPTNAVFGFV AMLLKFIDIA NPDALICAFD 
AGRPAFRMEA LEQYKAQRPP MDDDLKVQFP IVEELLEAMN VPVVRIKGWE GDDVLGTIAA
RDEELGYETL LVTGDKDAYQ LATDKTRIVT TKKGITDVAI YGPAEVLERY GVRPDQFIDF
LGLKGDSSDN IPGVPGIGDK TAAKLLQTYG NLEGIYEHVD DLKGKQKEKI VDNKDMAYLS
REVATIVRDL DFPLDLEACS FPSFDSDKVT EAFKSVQFNA HLSRVLKLVG KELEKKAAPL
VVEPVVSGPE AQALVDAAIA RGETVGVAFI EPEQVSLFNA GLHGAVNTSE GTAVFEDDEG
REAFARIVRA GSFAALDVKR EVHRIYPADT AEAALVEDAE LMGMRAFDLG LAGYVLNSSV
SEYSFDALLD AYYGGVLPET KDEAGAVAAQ AAAARMLVGP LTDALGRDES KRAYFDIDLP
LVAVLAIVER TGAAVDCDRL AELGATTQAE LDELRARIIE IAGEEFNLDS PKQLAHILFE
VLGLRTLRKN QRGYSTDAAV LKELSNDHEL PALVLRYREL AKIKSTYIDA LPRMRADDGR
VHTSFNETVT TTGRLSSSEP NLQNIPVRTE FGRQIRECFV PLEEGHAFLS ADYSQIELRL
LAHLSNDEHL VAAFCSGADF HAATASRVFG LPVEDVSPEL RSRAKAVNFG IVYGQQAFGL
SQSLGIPFGE AKEMIERYFE AYPGVRAYLD RTIAEAKEKG YAETMFGRKR HIPELKAANA
TQRGFGERTA MNHPMQGSAA DIIKLAMTEV QRRIMERGFE AKLLLQVHDE LDFSVPEGEI
EELSAIVKDV MEHIVDLRVP LDVDVSYADN WAEAH