Gene Elen_1050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1050 
Symbol 
ID8415340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1267027 
End bp1270488 
Gene Length3462 bp 
Protein Length1153 aa 
Translation table11 
GC content63% 
IMG OID645024013 
ProductDNA polymerase III, alpha subunit 
Protein accessionYP_003181410 
Protein GI257790804 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000216104 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.00986832 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCATTCG TCCACCTGCA CAACCATACC GAATACTCCC TGCTCGACGG CCACACCCAC 
ATCTACGACA TGGTGAAACG GGCCGCCGAC CTCGACATGC CCGCGGTGGC CATCTCGGAT
CACGGCGTGA TGTCGGGCGT GCCGCAGCTG TGTGAGATGT GCGACAAGGT GGAAGCCGAG
ACGGGCAAGC GCGTGAAGCC CATCTACGGC TGCGAGGTGT ACTTCACCAC CGACGAAGAG
CTGCGCAAGG ACACGAAGCC GAAGCTGTAC CACCTGCTGC TGCTGGCCAA GACGAACGAG
GGCTACCACA ACCTGGTGAA GCTGGTCAGC GAGTCGCACG TGGACAACTT CTACTACAAG
CCGCGCACCA CCTTCAGCAT GCTGCAGAAG TACGGCAAGG GCATCATCGG CTCGTCGGCT
TGCATCGCCG GCATCATTCC GAAGCTGCTC GACAACCGAC AGGTAGACGA GGCGGTCGAA
TGGGCCAAGA AGTTCGCCAG CTGCTTCGAA CCGGGCGATT TCTACATCGA GCTGCAGAAC
CAGGGCATCC GCACCGACGC CGGGTTCACG CAGACCGAGC TCAACCACAT GCTGACCGAC
GTGGCCAAGG CCGCCGGCCT CAAGACCATC GCCACGAACG ACTTCCACTA TCTCACGCGC
GAGGACGCGC GCGCCCAGGA CTACATGCTG TGCATCGGCA CGGGCGCGGC GTTCAACGAC
GCCAACCGCA TGCGCTTCGA GAACGACCAG TTCTACATGA AGACCGAGGA GGAGATGCGC
GAGGCCCTCA AGGACTTCCC CGAGGCCTGC GACACCACGG TGGAAGTGGC CGAGAAGGTG
AACGTGGTGC TGGAGCGCGA CTCTATCCTT CCGCGTTTCC CGTTGCCCGA GGGCGAGACC
GAGGAAAGCT ACTTCCGCAA GCGCGTGCAG GAGGGCTTGG TCAAGCACTA CGGCAATCCT
GTTCCTCAGG AGGCGCAGGA GCGCGCCGAC TACGAGATGG GCATCATCAT CCAGCAGGGC
TTCCCGGCGT ACTTCCTCAT CGTGCAGGAG TACATCGAAT GGGCGCGCAG CCAGGGCATC
GGCGTGGGTC CGGGTCGCGG TTCGGCTGCA GGCGCCATCG TGGCGTACGC CATGGACATC
ACCGCGCTTG ACCCGCTGTC CAACGGCCTG CTGTTCGAGC GATTCCTGTC GCCCGAGCGC
GTGGAGATGC CCGATATCGA CGTCGACTTC GAGCAGGGCC GTCGCGAAGA AGTGATCAGC
CACATCAAGG ACGTGTACGG CGAGGATCAC GTGTCGCAGG TCATCACGTT CGGCACCCTG
CAGGCCAAGA ACGCCGTGCG CGACGCCGCG CGCGTGCTGG ACTACCCGTA CAGCACCGGA
GACAAGATCA CGAAGATGAT CGGCGACGAG CTGGGCATCA CCATCGACAA GGCGCTGGCC
ACGAACCCCG ACCTCAAGAA GGCCTACGAG ACCGAAGAGG ACGTGAAGGC CGTCATCGAC
GCCGCGCTGT CCATCGAAGG CCACGTGCGC GGCGAGGGCG TGCACGCGTG CGCCACCATC
ATCTGCCGCG ACCCCATGGC CGATCACGTG CCCATGAAGC GCGACACCAA GGGCGGCGGC
ATCATCACCC AGTACGACGG CCATTACACG CCCGAGCTGG GTCTGCTGAA AATGGACTTC
CTCGGCCTGC GCACGCTCGA CGTGCTCACC ATCGCGTGCC GCAACATCGA GCAGCGCTTC
GGCACGAAGG TGATTCCCGA GGACATCCCC ATCGACGACG AGGGCGCCTT CAAGCTCATG
CAGTCCGGCA ACATGGACGG CCTGTTCCAG GTGGAGGGCG CGCTGTACGT CAGCCTGTTC
GCGCGCCTGC CTCCCACGCG CTTCTCCGAC ATCGTCGCCT CGATCGCGCT CAACCGTCCG
GGCCCGCTGG AATCCGGCAT GGTCGAGGAC TACATCAAGG TGGCCAGCGG CAAGACCTCC
ATCCACTACT ACGACGAGCG CCTCCGCCCC ATCCTGGAGG AGACGTACGG CACCATGGTC
TACCAAGAGC AGATCATGCA GATATCCATG GCTATGAGCG GTTTCTCGGC GGGCAAGGCC
GACAAGCTGC GCAAGGCTAT GGGCAAGAAG AAGCTCGACG TCATGCGCGC GTTGCAGGAA
GACTGGAACA CCGGCGCGGT GGAGAACGGC TATCCGCTGG AAATTGCGAA GCAGATCTGG
GAAGACGCGG AGAAATTCGC TAAGTACGCG TTCAACAAAT CGCACTCGGC CGCCTATGCC
ATCCTGGTCA TGCGCACGGC GTACCTGAAG GCGCACTACC CGAACGACTT CATGGCGGCC
GTGCTGTCGT CCTACATGGG CAACACCGAC CGTTTGATCC GTTACATCGC AAGCTGCAAC
CACTCGGGCA TTCCCGTGTT GCCGCCGGAC ATCAACTCGT CCAACGCCGA GTTCACGCCC
ACCGACGAGG GCGTGCGCTT CGGCCTGGTG GGCGTGCGCG GCGTGGGCGC GAACGTGGCC
GAGGCCATCA TCGAGGAGCG CGAGGCGAAC GGGCCGTTCA CGTCGCTGCA CGACTTCGTG
AACCGCCTGG ACGCGAAATG CTACAACCGC AAGACGCTGG AAGCGCTCAT CAAGGGCGGG
GCGTTCGACT CGACGGGCTA CACGCGCAAG CAGCTCATGT ACTTCGTTGA CGAGACGCCG
CTGCTGGAAA GCGCCTCGAA GCGCCAGAAG GATCGTGAGA GCGGCCAGGT GTCGATGTTC
GACCTGTTCG GCGACGATCC CGATTCGGGC TTCGAGGAGG AGGTTCCGGA ACCGGACGGC
GTGGAGTGGC CGAAGCGTCA GCTGTTGACG TTCGAGAAGG AGATCATGAA GATCTACGTC
TCGGACCACC CGCTGCGCCC GTACGAGGGC ACCATCGCGC GTATGACCAA GTTCTCGCTG
GGAGACCTGG CCGAGCGCAC GAAGGAAATC AAGTCCGCGG TGTTCGTGGG CATGATCTCG
AACGTGGTGA CGAAGCTGAC GAAGCGCGGC ACGAAGATGG CCACGTTCAC CTTGGAGGAT
ACGACGGGCC ACGTGGAGTG CATCTGCTTC AAGTACGACG ACAACGCCGA GGCCATCCAG
GAAGACGCCA TCGTGAAGGT GAAGGGCAAG TTCGAGGCGA ACGACCGCGG CAATCAGATC
ATGGCGTTCG AGGTGGAGGT CATCGAGCTG AACGAGGCCG ATGCGCGTCC GTCGCACCTC
GAGCTGAAAG TTGCGTCCTC GGACTTCGAC CAGTCGAAGT CGCTTCGGCT CAACCGCATC
TTGAAGTCGT ATCCGGGTCG CGACGGCGTG GTTCTGCTCG TGCAGCAGAG CGATGGCCGC
AAGTTCCGCG CCGAACTGCC CGTGTCGGTG GATTCCCGCA GCCCCGTCAT GCGTTCCGAG
ATCCAGGATC TGTTCGGTTC GCAGGTGTGG AGGGCTTCTT GA
 
Protein sequence
MAFVHLHNHT EYSLLDGHTH IYDMVKRAAD LDMPAVAISD HGVMSGVPQL CEMCDKVEAE 
TGKRVKPIYG CEVYFTTDEE LRKDTKPKLY HLLLLAKTNE GYHNLVKLVS ESHVDNFYYK
PRTTFSMLQK YGKGIIGSSA CIAGIIPKLL DNRQVDEAVE WAKKFASCFE PGDFYIELQN
QGIRTDAGFT QTELNHMLTD VAKAAGLKTI ATNDFHYLTR EDARAQDYML CIGTGAAFND
ANRMRFENDQ FYMKTEEEMR EALKDFPEAC DTTVEVAEKV NVVLERDSIL PRFPLPEGET
EESYFRKRVQ EGLVKHYGNP VPQEAQERAD YEMGIIIQQG FPAYFLIVQE YIEWARSQGI
GVGPGRGSAA GAIVAYAMDI TALDPLSNGL LFERFLSPER VEMPDIDVDF EQGRREEVIS
HIKDVYGEDH VSQVITFGTL QAKNAVRDAA RVLDYPYSTG DKITKMIGDE LGITIDKALA
TNPDLKKAYE TEEDVKAVID AALSIEGHVR GEGVHACATI ICRDPMADHV PMKRDTKGGG
IITQYDGHYT PELGLLKMDF LGLRTLDVLT IACRNIEQRF GTKVIPEDIP IDDEGAFKLM
QSGNMDGLFQ VEGALYVSLF ARLPPTRFSD IVASIALNRP GPLESGMVED YIKVASGKTS
IHYYDERLRP ILEETYGTMV YQEQIMQISM AMSGFSAGKA DKLRKAMGKK KLDVMRALQE
DWNTGAVENG YPLEIAKQIW EDAEKFAKYA FNKSHSAAYA ILVMRTAYLK AHYPNDFMAA
VLSSYMGNTD RLIRYIASCN HSGIPVLPPD INSSNAEFTP TDEGVRFGLV GVRGVGANVA
EAIIEEREAN GPFTSLHDFV NRLDAKCYNR KTLEALIKGG AFDSTGYTRK QLMYFVDETP
LLESASKRQK DRESGQVSMF DLFGDDPDSG FEEEVPEPDG VEWPKRQLLT FEKEIMKIYV
SDHPLRPYEG TIARMTKFSL GDLAERTKEI KSAVFVGMIS NVVTKLTKRG TKMATFTLED
TTGHVECICF KYDDNAEAIQ EDAIVKVKGK FEANDRGNQI MAFEVEVIEL NEADARPSHL
ELKVASSDFD QSKSLRLNRI LKSYPGRDGV VLLVQQSDGR KFRAELPVSV DSRSPVMRSE
IQDLFGSQVW RAS