Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1045 |
Symbol | |
ID | 3831851 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1073699 |
End bp | 1077343 |
Gene Length | 3645 bp |
Protein Length | 1214 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637828973 |
Product | DNA polymerase III, alpha subunit |
Protein accession | YP_429902 |
Protein GI | 83589893 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG2176] DNA polymerase III, alpha subunit (gram-positive type) |
TIGRFAM ID | [TIGR01405] DNA polymerase III, alpha chain, Gram-positive type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0186954 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.0000000541415 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGTCTACAA AGGAAAACAT CTGGCTGCAA CTACTTCAGG GCGGGGCGGT GCGCAAAGTA GAGATTACCC GCCGGGAGGG CAGGTGTTGC CTCTGGCTCT GTCCACCCCA TCCCCTGGAG CCGGAAGACA TCGCCGCCTG CACTCGCTTC CTGGCCGGGG AATTTCCCGG CTTTACTTTT AGCATCAAAA CCCTTGACCA GGCCGACAGC GGCGACCTGG CAGCCCTGAT CCGGGAAAAG CGGGGTGCGA TCCTGGACCA GGTGGCCGGT ATCCTTGGAA ACGGCAGTTC TGCCTGGCTG GCCGGCGCCA GGCTGGAACT GCAGGGCCGG CACCTGAAGC TGGTTTTGCC GGCTTCCCTG GCCGTCGAGG CCCTGAAGGC CCGCCGGGGG GATGTGGTTT TACAGGAAGT ACTGGGCCAT TACGGTCATC AGGTGACTGT CGACCTGGTG TCAGATAAGG ATTACCAGGA AGAACTCCTC TCGGCTTCCC GCCAGCTCCA GGCCCGGCAA CTGGAAGGGG TGCGGCGGGA AGTCCCTGCC GAGGTTAAAA CACCGGCTGG CACCAAGAGT ACTTCCACTG ACGGCCAGTT GCTGGGTAAA AAAATCACCG CTGCTCCCCG GCCCCTGAAG GATGTCCAGG AAGAGGAGAA GCAGGTGGCC GTCCAGGGCG AGGTGTTAAA GTTTGAGTCC CGCCAGCTAA AATCGGGCCG CTGGTTGATT ACCTTCGATA TAACCGATTA CACCGATTCC CTGACGGTGA AAGCCTTTGT TGATAAAGGC CCCTTAATTG AAGGTGGCCT GGCAGAGGGG GACTGGGTCC TGGTCCAGGG GCAGGCCCAG TATGATCGCT ACAGTCAGGA GTTAATACTT CTGGCCGACG CTGTGGCCCG CGGCCAGCGG CCCACCCGGG AAGACAGGGC AGCGGAGAAA AGGGTGGAGT TGCATTTACA TACCAAAATG AGCGCCATGG ACGGTGTTAC TGAGGTGGCC GCGGTGGTCC AGCAGGCAGC CCGTTGGGGC CACGGAGCGG TAGCCATCAC CGACCACGGG GTGGTCCAGG CCTTCCCGGC AGCGGCTGAG GCCGGCCGCA AGTATGGCGT CAAGATCATC TACGGGGTGG AAGGATATCT CTTCGACCAG GATAACCAGC CTCCCGACCA CCAGCGTACC TATCACATTA TTATACTGGT TAAAAATAAG CAGGGCCTGG CCAACCTTTA CCGCCTTATA TCCCGGGCTC ACCTGGATTT CTTTTATCGC AAGCCCCGCC TGCCGCGCCA CCTCATCCAG GAGTACCGCG AAGGGCTCCT CTTAGGCACG GCCTGCGAAG CCGGCGAGTT GATCCGGGGT TACCTGGCCG GGGCCGATCA GACCCGCCTG GAAGAGATCG CTTCTTTCTA CGACTTCCTG GAAATCCAGC CCCTGGCCAA TAATGACTTT TTAATTCGCC AGGGACAGGT GGCCGACCGC CAGGCCCTCA TGGATATGAA CCGCCAGATT ATCGCCCTAG GACAAAAATT GGGGAAGCCA GTGGTGGCCA CCGGGGACGT CCATTTCCTT AACCCTGAGG ACGCCATCTA CCGCCAGATC CTCCTGGCCG GCCAGGGCTA TGCCGACGAA GTCCAGGCCC CCCTGTACTA CCGGACTACG GAGGAGATGC TGGCCGAATT TGACTATCTT GACGGGGAAA CGGCCCACCA GGTAGTTATA ACCAACCCCC GGTTGATAGC GGAGCAGGTG GAAGAGTTAA AGCCCATTCC CGACGAATTC TACCCGCCGG AGATCCCGGG AGCAGAAGAA GAGTTAACTC GCATCGTAAC TACCCGGGCC AAGGAGTGGT ACGGCGACCC CCTGCCGGAA ATAGTTCGGG CCCGCCTGGA CAAAGAAATG CAGGCCATTA TCGGCCACGG CTTTGCCGTC CTGTACCTCA TAGCCCACAA GCTGGTGCAC AAGTCCAACG AAGACGGCTA TCTGGTTGGC TCCCGGGGTT CGGTGGGGTC CTCCCTGGTG GCGACCATGG CCGGGATTAC CGAGGTTAAC CCCCTGCCGC CCCATTACCG CTGCCCATCC TGCCGTTACA GCGAGTTTAT TAGCGACGGC AGCGCCAAGT GCGGTGCCGA TCTCCCGGCC AGGGATTGTC CCCGATGCGG GACCAGGCTC TTGAAGGACG GCCACGATAT CCCCTTTGAA GTCTTCCTGG GCTTTAAAGG GGACAAGGTA CCCGATATTG ATCTCAACTT TTCCGGGGAA TATCAACCCC GGGCCCATCA GTATACTGAG ACCATTTTCG GTAAAGACCA TGTTTTCCGG GCCGGGACTA TCGCTACCCT GGCGGAGCGA ACGGCCTTCG GTTTTGTCCA TAAGTACCTG GAGGAACGGG GCCTTAAAGC TCGACAGGCG GAAATAAACT GCCTTGTAAA GGGCATCACC GGCGTCAAAA GGACTACCGG CCAGCATCCA GGCGGATTGA TGGTAGTTCC CAAAGGAGTG GACATGCACC TCTTCACACC CCTCCAGCAC CCGGCCGACG ATACCGGCAG CGGCACCATC ACCACCCACT TTGATTACCA CTCCATCAGC AGCCGCCTGG TTAAATTAGA CCTCCTGGGC CACGACGACC CGACGGTCAT TAAAATGCTG GAAGATCTCA CCGGTGTCAA TGCCAGGGAG ATCCCCCTGG ACGAGCCCCG GACCATGTCC CTTTTTTCCA GCGTCGAGGC CCTTGGCATC AGGCCGGAAG ACATCGGCTC CCAGGTGGGG ACCTTAGGCA TCCCCGAATT TGGCACCCGC TTTGTTCGCC AGATGCTGGA GGATACCAGG CCGCGGACCT TTGCCGAGCT GGTACGCATC AGTGGTTTTT CCCACGGTAC CGATGTTTGG TTAAACAACG CCCAGGATCT GATTAAAAGC GGCACGGCGA AACTGAGCGA AGCCATCTCT ACCCGGGACG ACATCATGAA CTACCTCATG CAGCACGGTG TGGTGGCCGA CATCGCCTTC CGCACCATGG AGGATGTGCG TAAGGGTAAA GGAGTGAAAA AAGAGTACGA AGAAGCCATC CGGGCCGCGG GCGTACCCGA GTGGTTCATC CAGTCCTGCA AAAAAATCAG CTACCTTTTC CCCAAGGCTC ACGCTGTGGC TTACGTGACC ATGGCCTTCC GCATCGCTTA TTTTAAGGTC TATTACCCGG AGGCCTTCTA TGCATCCTTT TTCAGCATCC GCGCCGACGA ATTCGACGCC GACGTGGTTG CCGCCGGGTT GCCCCGCATT CAGGAAGAGA TTGCCGCCCT GGAGCGCAAA GGCAACGAAG CTACCGCCAG GGAGAAGAAC ATCCTTACCA TCCTGGAGGT AGCCAGGGAG ATGTATTGCC GGGGCATCAC CCTGGAACGT ATCGACCTGC AAAAGGCGGA CGCCAGCCGT TTTCTGGTGG AGCCGGGTAA GCTCTTGCCA CCCCTGGCGG CCTTGCCCGG GGTTGGCCGG GCGGCGGCCG AGGCCATCGT CCGCGCCCGC CAGGAACGCC CCTTTACTTC CGTCGAAGAT TTGCAGTACC GTTCCCGGGT CAGCAAGACG GTCATCGAAG CCCTGGAAAA GCACGGCGCC CTGGCGGATC TACCGCCCTC GGACCAACTG GTGTTTTTCG GGTAA
|
Protein sequence | MSTKENIWLQ LLQGGAVRKV EITRREGRCC LWLCPPHPLE PEDIAACTRF LAGEFPGFTF SIKTLDQADS GDLAALIREK RGAILDQVAG ILGNGSSAWL AGARLELQGR HLKLVLPASL AVEALKARRG DVVLQEVLGH YGHQVTVDLV SDKDYQEELL SASRQLQARQ LEGVRREVPA EVKTPAGTKS TSTDGQLLGK KITAAPRPLK DVQEEEKQVA VQGEVLKFES RQLKSGRWLI TFDITDYTDS LTVKAFVDKG PLIEGGLAEG DWVLVQGQAQ YDRYSQELIL LADAVARGQR PTREDRAAEK RVELHLHTKM SAMDGVTEVA AVVQQAARWG HGAVAITDHG VVQAFPAAAE AGRKYGVKII YGVEGYLFDQ DNQPPDHQRT YHIIILVKNK QGLANLYRLI SRAHLDFFYR KPRLPRHLIQ EYREGLLLGT ACEAGELIRG YLAGADQTRL EEIASFYDFL EIQPLANNDF LIRQGQVADR QALMDMNRQI IALGQKLGKP VVATGDVHFL NPEDAIYRQI LLAGQGYADE VQAPLYYRTT EEMLAEFDYL DGETAHQVVI TNPRLIAEQV EELKPIPDEF YPPEIPGAEE ELTRIVTTRA KEWYGDPLPE IVRARLDKEM QAIIGHGFAV LYLIAHKLVH KSNEDGYLVG SRGSVGSSLV ATMAGITEVN PLPPHYRCPS CRYSEFISDG SAKCGADLPA RDCPRCGTRL LKDGHDIPFE VFLGFKGDKV PDIDLNFSGE YQPRAHQYTE TIFGKDHVFR AGTIATLAER TAFGFVHKYL EERGLKARQA EINCLVKGIT GVKRTTGQHP GGLMVVPKGV DMHLFTPLQH PADDTGSGTI TTHFDYHSIS SRLVKLDLLG HDDPTVIKML EDLTGVNARE IPLDEPRTMS LFSSVEALGI RPEDIGSQVG TLGIPEFGTR FVRQMLEDTR PRTFAELVRI SGFSHGTDVW LNNAQDLIKS GTAKLSEAIS TRDDIMNYLM QHGVVADIAF RTMEDVRKGK GVKKEYEEAI RAAGVPEWFI QSCKKISYLF PKAHAVAYVT MAFRIAYFKV YYPEAFYASF FSIRADEFDA DVVAAGLPRI QEEIAALERK GNEATAREKN ILTILEVARE MYCRGITLER IDLQKADASR FLVEPGKLLP PLAALPGVGR AAAEAIVRAR QERPFTSVED LQYRSRVSKT VIEALEKHGA LADLPPSDQL VFFG
|
| |