Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1555 |
Symbol | |
ID | 8415853 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 1847982 |
End bp | 1849250 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 645024523 |
Product | DNA-directed DNA polymerase |
Protein accession | YP_003181912 |
Protein GI | 257791306 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0389] Nucleotidyltransferase/DNA polymerase involved in DNA repair |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.558973 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.0210398 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCTCG AGCCGTGGGA GGGCCCGGCC ATCCTGCTCG TGGACCTCGA CGCGTTCTTC GCCTCGGTCG AGCAGCTGGA CCATCCCGCC TGGCGCGGCA AGCCCGTCAT CGTGGGCGGC GACGCCGACA AGCACGGCGT GGTGTCCACG GCGTCCTACG AGGCGCGCCC GTACGGCGTG CGAAGCGCCA TGCCCTCTTC CACGGCGAAG CGCCTGTGCC CTCACGCCAT CTGGACGCAC GGCCGCTTCG ACCGCTACCG CGAGATGTCG AACGCCATCA TGGACATACT GCGCGCCGAA ACGCCGCACG TGCAGCAGGT CAGCATCGAC GAGGCGTTCA TGGACGTGTC TCCCACGTCG GTGAACCGAG AGCATCCCGT GCGCGTCGCG CAGCGTATCC AGCAGCGCGT CGAGGAGCTG GGCGTCACGT GCTCCATCGG GGTGGGCACG TCGAAGACCG TCGCGAAGAT AGCCTCGGAC ATGGACAAGC CGCGCGGGCT GACCGTGGTG TACCCCGGAG GCGAGCGGGA TTTCCTCGCG CCCCTGCCCG TGCGCACGAT GAGCGGCATC GGAGCCGCAG CCGAGGAGAA GCTGCACTCC CGCGGCATAC GCACCCTCGG CCAGCTGGCC GATGCGGACG AAGGAATGCT GCTGCGCGCG TTCGGCAAGA ACGGCCGCGT CATGCACGTG CGGGCGAACG GGGGCGACGA CGCGCCCGTG GAGCAGGACG ACACGGTGAA ATCGGTGTCG AACGAGATGA CGTTCGCGGT GGACCTCACC ACGCGCGAAG ACGTCGAGGG CGCCATCGCC ACCATCGCCG CGAAGGTGGG GCGCCGCCTG CGACGCAAGG GGCTGCGCGG CCGCACCCTG GGCCTGCGCA TGCGCTACGA CGACCGCAGC GTGCGCTCGG TGCAGCGTCA GCTGCCGGCG CCCAGCGACG ACGAGCTTTC CTACACGCCG CTTCTGTACC GCATGGCCGA CGAGCTGTGG CGCCCCGGCA TGCCCGTGCG CCTCATCGGC GTGGCGATGA CCGGGTTCGG CGGAGGCGAA AGCGTGCAGG ACAGCCTGTT CGACATCGCC GAGGCTGCTC CCAGCGATGA CGATGTGGAC CCCGTCATCA AAGACGAGGC GAAGCGGCGC GGACTCATCG AGGCAACCGA CCTCGTCAAG GACAAGTTCG GCGAATCCGC GGTCCGCTTC GGCCGCGAGC TGCGAGGCGA GGGCAACACC ACCGGCTCCG CCAGCAAGAA CCCGGCCGAC TACAAGTAG
|
Protein sequence | MPLEPWEGPA ILLVDLDAFF ASVEQLDHPA WRGKPVIVGG DADKHGVVST ASYEARPYGV RSAMPSSTAK RLCPHAIWTH GRFDRYREMS NAIMDILRAE TPHVQQVSID EAFMDVSPTS VNREHPVRVA QRIQQRVEEL GVTCSIGVGT SKTVAKIASD MDKPRGLTVV YPGGERDFLA PLPVRTMSGI GAAAEEKLHS RGIRTLGQLA DADEGMLLRA FGKNGRVMHV RANGGDDAPV EQDDTVKSVS NEMTFAVDLT TREDVEGAIA TIAAKVGRRL RRKGLRGRTL GLRMRYDDRS VRSVQRQLPA PSDDELSYTP LLYRMADELW RPGMPVRLIG VAMTGFGGGE SVQDSLFDIA EAAPSDDDVD PVIKDEAKRR GLIEATDLVK DKFGESAVRF GRELRGEGNT TGSASKNPAD YK
|
| |