Gene Moth_1045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1045 
Symbol 
ID3831851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1073699 
End bp1077343 
Gene Length3645 bp 
Protein Length1214 aa 
Translation table11 
GC content59% 
IMG OID637828973 
ProductDNA polymerase III, alpha subunit 
Protein accessionYP_429902 
Protein GI83589893 
COG category[L] Replication, recombination and repair 
COG ID[COG2176] DNA polymerase III, alpha subunit (gram-positive type) 
TIGRFAM ID[TIGR01405] DNA polymerase III, alpha chain, Gram-positive type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0186954 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000541415 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGTCTACAA AGGAAAACAT CTGGCTGCAA CTACTTCAGG GCGGGGCGGT GCGCAAAGTA 
GAGATTACCC GCCGGGAGGG CAGGTGTTGC CTCTGGCTCT GTCCACCCCA TCCCCTGGAG
CCGGAAGACA TCGCCGCCTG CACTCGCTTC CTGGCCGGGG AATTTCCCGG CTTTACTTTT
AGCATCAAAA CCCTTGACCA GGCCGACAGC GGCGACCTGG CAGCCCTGAT CCGGGAAAAG
CGGGGTGCGA TCCTGGACCA GGTGGCCGGT ATCCTTGGAA ACGGCAGTTC TGCCTGGCTG
GCCGGCGCCA GGCTGGAACT GCAGGGCCGG CACCTGAAGC TGGTTTTGCC GGCTTCCCTG
GCCGTCGAGG CCCTGAAGGC CCGCCGGGGG GATGTGGTTT TACAGGAAGT ACTGGGCCAT
TACGGTCATC AGGTGACTGT CGACCTGGTG TCAGATAAGG ATTACCAGGA AGAACTCCTC
TCGGCTTCCC GCCAGCTCCA GGCCCGGCAA CTGGAAGGGG TGCGGCGGGA AGTCCCTGCC
GAGGTTAAAA CACCGGCTGG CACCAAGAGT ACTTCCACTG ACGGCCAGTT GCTGGGTAAA
AAAATCACCG CTGCTCCCCG GCCCCTGAAG GATGTCCAGG AAGAGGAGAA GCAGGTGGCC
GTCCAGGGCG AGGTGTTAAA GTTTGAGTCC CGCCAGCTAA AATCGGGCCG CTGGTTGATT
ACCTTCGATA TAACCGATTA CACCGATTCC CTGACGGTGA AAGCCTTTGT TGATAAAGGC
CCCTTAATTG AAGGTGGCCT GGCAGAGGGG GACTGGGTCC TGGTCCAGGG GCAGGCCCAG
TATGATCGCT ACAGTCAGGA GTTAATACTT CTGGCCGACG CTGTGGCCCG CGGCCAGCGG
CCCACCCGGG AAGACAGGGC AGCGGAGAAA AGGGTGGAGT TGCATTTACA TACCAAAATG
AGCGCCATGG ACGGTGTTAC TGAGGTGGCC GCGGTGGTCC AGCAGGCAGC CCGTTGGGGC
CACGGAGCGG TAGCCATCAC CGACCACGGG GTGGTCCAGG CCTTCCCGGC AGCGGCTGAG
GCCGGCCGCA AGTATGGCGT CAAGATCATC TACGGGGTGG AAGGATATCT CTTCGACCAG
GATAACCAGC CTCCCGACCA CCAGCGTACC TATCACATTA TTATACTGGT TAAAAATAAG
CAGGGCCTGG CCAACCTTTA CCGCCTTATA TCCCGGGCTC ACCTGGATTT CTTTTATCGC
AAGCCCCGCC TGCCGCGCCA CCTCATCCAG GAGTACCGCG AAGGGCTCCT CTTAGGCACG
GCCTGCGAAG CCGGCGAGTT GATCCGGGGT TACCTGGCCG GGGCCGATCA GACCCGCCTG
GAAGAGATCG CTTCTTTCTA CGACTTCCTG GAAATCCAGC CCCTGGCCAA TAATGACTTT
TTAATTCGCC AGGGACAGGT GGCCGACCGC CAGGCCCTCA TGGATATGAA CCGCCAGATT
ATCGCCCTAG GACAAAAATT GGGGAAGCCA GTGGTGGCCA CCGGGGACGT CCATTTCCTT
AACCCTGAGG ACGCCATCTA CCGCCAGATC CTCCTGGCCG GCCAGGGCTA TGCCGACGAA
GTCCAGGCCC CCCTGTACTA CCGGACTACG GAGGAGATGC TGGCCGAATT TGACTATCTT
GACGGGGAAA CGGCCCACCA GGTAGTTATA ACCAACCCCC GGTTGATAGC GGAGCAGGTG
GAAGAGTTAA AGCCCATTCC CGACGAATTC TACCCGCCGG AGATCCCGGG AGCAGAAGAA
GAGTTAACTC GCATCGTAAC TACCCGGGCC AAGGAGTGGT ACGGCGACCC CCTGCCGGAA
ATAGTTCGGG CCCGCCTGGA CAAAGAAATG CAGGCCATTA TCGGCCACGG CTTTGCCGTC
CTGTACCTCA TAGCCCACAA GCTGGTGCAC AAGTCCAACG AAGACGGCTA TCTGGTTGGC
TCCCGGGGTT CGGTGGGGTC CTCCCTGGTG GCGACCATGG CCGGGATTAC CGAGGTTAAC
CCCCTGCCGC CCCATTACCG CTGCCCATCC TGCCGTTACA GCGAGTTTAT TAGCGACGGC
AGCGCCAAGT GCGGTGCCGA TCTCCCGGCC AGGGATTGTC CCCGATGCGG GACCAGGCTC
TTGAAGGACG GCCACGATAT CCCCTTTGAA GTCTTCCTGG GCTTTAAAGG GGACAAGGTA
CCCGATATTG ATCTCAACTT TTCCGGGGAA TATCAACCCC GGGCCCATCA GTATACTGAG
ACCATTTTCG GTAAAGACCA TGTTTTCCGG GCCGGGACTA TCGCTACCCT GGCGGAGCGA
ACGGCCTTCG GTTTTGTCCA TAAGTACCTG GAGGAACGGG GCCTTAAAGC TCGACAGGCG
GAAATAAACT GCCTTGTAAA GGGCATCACC GGCGTCAAAA GGACTACCGG CCAGCATCCA
GGCGGATTGA TGGTAGTTCC CAAAGGAGTG GACATGCACC TCTTCACACC CCTCCAGCAC
CCGGCCGACG ATACCGGCAG CGGCACCATC ACCACCCACT TTGATTACCA CTCCATCAGC
AGCCGCCTGG TTAAATTAGA CCTCCTGGGC CACGACGACC CGACGGTCAT TAAAATGCTG
GAAGATCTCA CCGGTGTCAA TGCCAGGGAG ATCCCCCTGG ACGAGCCCCG GACCATGTCC
CTTTTTTCCA GCGTCGAGGC CCTTGGCATC AGGCCGGAAG ACATCGGCTC CCAGGTGGGG
ACCTTAGGCA TCCCCGAATT TGGCACCCGC TTTGTTCGCC AGATGCTGGA GGATACCAGG
CCGCGGACCT TTGCCGAGCT GGTACGCATC AGTGGTTTTT CCCACGGTAC CGATGTTTGG
TTAAACAACG CCCAGGATCT GATTAAAAGC GGCACGGCGA AACTGAGCGA AGCCATCTCT
ACCCGGGACG ACATCATGAA CTACCTCATG CAGCACGGTG TGGTGGCCGA CATCGCCTTC
CGCACCATGG AGGATGTGCG TAAGGGTAAA GGAGTGAAAA AAGAGTACGA AGAAGCCATC
CGGGCCGCGG GCGTACCCGA GTGGTTCATC CAGTCCTGCA AAAAAATCAG CTACCTTTTC
CCCAAGGCTC ACGCTGTGGC TTACGTGACC ATGGCCTTCC GCATCGCTTA TTTTAAGGTC
TATTACCCGG AGGCCTTCTA TGCATCCTTT TTCAGCATCC GCGCCGACGA ATTCGACGCC
GACGTGGTTG CCGCCGGGTT GCCCCGCATT CAGGAAGAGA TTGCCGCCCT GGAGCGCAAA
GGCAACGAAG CTACCGCCAG GGAGAAGAAC ATCCTTACCA TCCTGGAGGT AGCCAGGGAG
ATGTATTGCC GGGGCATCAC CCTGGAACGT ATCGACCTGC AAAAGGCGGA CGCCAGCCGT
TTTCTGGTGG AGCCGGGTAA GCTCTTGCCA CCCCTGGCGG CCTTGCCCGG GGTTGGCCGG
GCGGCGGCCG AGGCCATCGT CCGCGCCCGC CAGGAACGCC CCTTTACTTC CGTCGAAGAT
TTGCAGTACC GTTCCCGGGT CAGCAAGACG GTCATCGAAG CCCTGGAAAA GCACGGCGCC
CTGGCGGATC TACCGCCCTC GGACCAACTG GTGTTTTTCG GGTAA
 
Protein sequence
MSTKENIWLQ LLQGGAVRKV EITRREGRCC LWLCPPHPLE PEDIAACTRF LAGEFPGFTF 
SIKTLDQADS GDLAALIREK RGAILDQVAG ILGNGSSAWL AGARLELQGR HLKLVLPASL
AVEALKARRG DVVLQEVLGH YGHQVTVDLV SDKDYQEELL SASRQLQARQ LEGVRREVPA
EVKTPAGTKS TSTDGQLLGK KITAAPRPLK DVQEEEKQVA VQGEVLKFES RQLKSGRWLI
TFDITDYTDS LTVKAFVDKG PLIEGGLAEG DWVLVQGQAQ YDRYSQELIL LADAVARGQR
PTREDRAAEK RVELHLHTKM SAMDGVTEVA AVVQQAARWG HGAVAITDHG VVQAFPAAAE
AGRKYGVKII YGVEGYLFDQ DNQPPDHQRT YHIIILVKNK QGLANLYRLI SRAHLDFFYR
KPRLPRHLIQ EYREGLLLGT ACEAGELIRG YLAGADQTRL EEIASFYDFL EIQPLANNDF
LIRQGQVADR QALMDMNRQI IALGQKLGKP VVATGDVHFL NPEDAIYRQI LLAGQGYADE
VQAPLYYRTT EEMLAEFDYL DGETAHQVVI TNPRLIAEQV EELKPIPDEF YPPEIPGAEE
ELTRIVTTRA KEWYGDPLPE IVRARLDKEM QAIIGHGFAV LYLIAHKLVH KSNEDGYLVG
SRGSVGSSLV ATMAGITEVN PLPPHYRCPS CRYSEFISDG SAKCGADLPA RDCPRCGTRL
LKDGHDIPFE VFLGFKGDKV PDIDLNFSGE YQPRAHQYTE TIFGKDHVFR AGTIATLAER
TAFGFVHKYL EERGLKARQA EINCLVKGIT GVKRTTGQHP GGLMVVPKGV DMHLFTPLQH
PADDTGSGTI TTHFDYHSIS SRLVKLDLLG HDDPTVIKML EDLTGVNARE IPLDEPRTMS
LFSSVEALGI RPEDIGSQVG TLGIPEFGTR FVRQMLEDTR PRTFAELVRI SGFSHGTDVW
LNNAQDLIKS GTAKLSEAIS TRDDIMNYLM QHGVVADIAF RTMEDVRKGK GVKKEYEEAI
RAAGVPEWFI QSCKKISYLF PKAHAVAYVT MAFRIAYFKV YYPEAFYASF FSIRADEFDA
DVVAAGLPRI QEEIAALERK GNEATAREKN ILTILEVARE MYCRGITLER IDLQKADASR
FLVEPGKLLP PLAALPGVGR AAAEAIVRAR QERPFTSVED LQYRSRVSKT VIEALEKHGA
LADLPPSDQL VFFG