Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1841 |
Symbol | |
ID | 3831701 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1897551 |
End bp | 1900208 |
Gene Length | 2658 bp |
Protein Length | 885 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637829772 |
Product | DNA polymerase I |
Protein accession | YP_430684 |
Protein GI | 83590675 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0941024 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCACCA AAACCGCTAG ACTTCTCCTG GTCGACGGCA ACAGCGTCAT TCACCGGGCT TTTCATGCCC TGCCCCCCCT GCAGACCAGG GAAGGTATTC GCACCAATGC CGTCTACGGC TTTGCAACCA TGTTGCAAAA GGCCAGGGAG ATGTTTAAAC CCGATTATAT TATTGTCGCC TTTGACCATA GTAAGGTTAC CTTTCGTAAC GAACTGTACG ATGAATATAA AGGAACCCGG CCGGAGACCG ACCCTGAGCT CAGGCCCCAG TTTGCCCTGG TCAAACGCCT CCTGGCAGCC TGGAACCTGG CCAGTTGTGA GGTTGAGGGC TACGAAGCCG ACGACCTTAT CGGCACCTTA AGCCGCCAGG GCGCCGCCAC GGGCCTGGAA GTCCTCATCC TCACCGGGGA CCGCGACGCC CTGCAACTAG TGGGAGAACG GGTGAAGGTC CTCCTTATGC GGCGCGGCCT TTCCCAGGTA GAGGTCATCG ACCGGGAAGC AATCAAGAAA AACTATGGCC TGGAACCGGA GCAGCTCATT GACGTCAAGG CCCTGATGGG CGACGCCTCG GATAATATTC CCGGCGTACC GGGGGTGGGG GAGAAAACGG CCGTCCAGCT CGTCCGCCAG TACGGCGACC TGGAAGGAGT CCTGGCCCAC AGCGGGGAGA TAAAAGGACG CCGGGTAGCG GAGAACCTGG TGACCTTCGC CGACCAGGCG CGCCTGGCCC GGCGCCTGGC CACAATTGAC TGCCAGGCTC CTGTGACCCT CGACCTGGCA GGGTGCTGCA ACCAGTCGCC GGACTACGAG GCCGTCCTGG CCCTTTATAA AGAACTGGAG TTCCACAGTC TGGTCAAGGA CGTCCTCAGG GCCATGGAAC AGGAAGGCAA GAAGGCCTCG CAAGAAACGA CTGCTGTCCG GGGACTGTCG CTGCCGGACC CACTCACCCT GGACAGCCTG GAAGAACTGG CGGAACTGGT GGCCAGGTTG GTCGGGAAGA CGGATGTGGC CCTGGAATTG ATTCTTAATA ATCCCAGCTA CCTGGAGGCC GCAGCCGTCG CCGTAGGGCT GGCCTGGGAG GATGGGGTGG CGGTCCTGGG TACAGCCGGG ATAGAGCCTG CCGCCCTGGC CGGTACCCTG GAACCATTGC TCCGGGTCAA CCCCATCTTC CACGACGCCA AAAGGGCCCT GGTCTGGTTC AGCAATGCCG GGGCGGGGGT TGCCGACCCC GGCGGCGATA CTATGGTGGC CGGTTATCTC TTAAACCCCT CGGCCTCGCG CCATGACCTG CCCGAACTCT GCCTGGAACA CTTGAACCTG GCCCTGGTGG AGGGGGATTC CCCGCAACTG GCAGCGGCCC GGCGGGCCGC TGTCATCAGG TTGCTCCACC GGGAACTGGC CAGTAAACTC CAGGTTGCCG GGATGGAGAA CCTCTACCGG CGGGTGGAGT TGCCCCTGAC CCGGGTCCTA GGGGCCATGG AAAGCTACGG CGTGGCCGTC AACATGGAAA CCCTGGATCT CATGGGAATA GAGCTGGAGG GTGGGCTGGC CGCCCTCACA GAGGCCATCT ACGAGCTGGC CGGGGAAGAG TTTAACCTGA ATTCCCCCAA GCAGCTGGCC GTTATACTCT TTGAAAAGCT CGGACTGCCG CCGGTAAAGC GTACCAAGAC CGGTTTTTCT ACCGATGCCG CCGTCCTGGA GGAACTGGCC TGCAGGCACC CCATAGCCGC CAAACTGGTC GAGTACCGCC AGCTGGCCAA ACTGAAATCG ACCTATGTGG ATGGTCTTAA ACCCCTGGTC AATCCCCGCA CGGGGAGCCT GCATACCAGC TTTAACCAGA CGGTGACGGC CACCGGCCGC CTCTCCAGCA GCGAGCCCAA TCTCCAGAAT ATCCCGGTGC GCCTGGAACT GGGCCGGCGC CTGCGCAAGG CCTTTGTACC CCACGGTCCC GGCCGGTTGC TCCTGGCCGC CGACTACTCC CAGATCGAGC TGCGCATCCT GGCCCATATT TCCGGCGATG AAGCCATGAT TGCAGCCTTT CGCCGGGGGG AGGATATCCA CGCCCGGACT GCGGCCGAGG TCTTCGGCGT CCCCCTTGGT GAAGTGACGC CGGCTATGCG CCGCAGCGCC AAGGCGGTGA ACTTCGGTAT TGTCTACGGC ATCAGCGACT ACGGCCTGAG CCGGGATCTG GGGATAAGCC GCAGCGAAGC CCACGATTAT ATCGAACGTT ACTTTCGGCG TTACCGGGGC GTCAAGGCCT ATCTGGAGGA GATCGTAGCC CGGGCGCGGC AGGAGGGCTA TGTCACCACC CTCCTGGGCC GCCGCCGTTA TCTGCCGGAC CTCTTTAGCT CCAACCGCAA TGTCCGCAGC TTTGGCGAGC GCACGGCCAT GAATACGCCT ATCCAGGGCA CGGCCGCCGA CATCATCAAG ATGGCCATGG TGAAAATCTT TCGCCTCCTG GAGGCACAAT ACCCGGCCGC GCGTATGATC CTCCAGGTCC ACGACGAACT CATCTTTGAT GTCCCGGATG ACGACCTGCC GGCCGTGGCC GGCCTGGTCA AGGATACCAT GGAGCATACC CTGGAACTCC AGGTCCCCCT CCAGGTAGAT TTAAAGGCCG GGCCCAACTG GTATGACCTG GAGCCGTATA AGGAGTAA
|
Protein sequence | MPTKTARLLL VDGNSVIHRA FHALPPLQTR EGIRTNAVYG FATMLQKARE MFKPDYIIVA FDHSKVTFRN ELYDEYKGTR PETDPELRPQ FALVKRLLAA WNLASCEVEG YEADDLIGTL SRQGAATGLE VLILTGDRDA LQLVGERVKV LLMRRGLSQV EVIDREAIKK NYGLEPEQLI DVKALMGDAS DNIPGVPGVG EKTAVQLVRQ YGDLEGVLAH SGEIKGRRVA ENLVTFADQA RLARRLATID CQAPVTLDLA GCCNQSPDYE AVLALYKELE FHSLVKDVLR AMEQEGKKAS QETTAVRGLS LPDPLTLDSL EELAELVARL VGKTDVALEL ILNNPSYLEA AAVAVGLAWE DGVAVLGTAG IEPAALAGTL EPLLRVNPIF HDAKRALVWF SNAGAGVADP GGDTMVAGYL LNPSASRHDL PELCLEHLNL ALVEGDSPQL AAARRAAVIR LLHRELASKL QVAGMENLYR RVELPLTRVL GAMESYGVAV NMETLDLMGI ELEGGLAALT EAIYELAGEE FNLNSPKQLA VILFEKLGLP PVKRTKTGFS TDAAVLEELA CRHPIAAKLV EYRQLAKLKS TYVDGLKPLV NPRTGSLHTS FNQTVTATGR LSSSEPNLQN IPVRLELGRR LRKAFVPHGP GRLLLAADYS QIELRILAHI SGDEAMIAAF RRGEDIHART AAEVFGVPLG EVTPAMRRSA KAVNFGIVYG ISDYGLSRDL GISRSEAHDY IERYFRRYRG VKAYLEEIVA RARQEGYVTT LLGRRRYLPD LFSSNRNVRS FGERTAMNTP IQGTAADIIK MAMVKIFRLL EAQYPAARMI LQVHDELIFD VPDDDLPAVA GLVKDTMEHT LELQVPLQVD LKAGPNWYDL EPYKE
|
| |