Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1872 |
Symbol | dnaE |
ID | 3831503 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1933856 |
End bp | 1937326 |
Gene Length | 3471 bp |
Protein Length | 1156 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637829804 |
Product | DNA polymerase III DnaE |
Protein accession | YP_430715 |
Protein GI | 83590706 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0587] DNA polymerase III, alpha subunit |
TIGRFAM ID | [TIGR00594] DNA-directed DNA polymerase III (polc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTCTT TCGTCCACCT CCATGTCCAC AGTGAATACA GCCTCCTGGA TGGAGCCGGC CGCATCAAGG ATCTGGTCCG GGCCGCCGGG GAAATGGGTA TGCCCGCCCT GGCCCTCACG GATCACGGCG TCATGTACGG CGCCGTGGAG TTCTATAAAG CGGCCCGGGA ACAAGGCATC AAGCCCATTA TCGGCTGCGA GGTTTATGTG GCTCCCCGCT CCCGCCACGA CCGGGAGCCC CACCGGGATG ATTACCAGTA CCACCTGGTC CTCCTGGCCA CCGACGCCAC GGGTTACCGC AACCTCACGG CCCTGGTCTC GGCTGCCTTC CTGGAAGGTT TCTATTACAA GCCCCGGGTG GACCGGGAGC TTTTGAGCCG CCACAGCCAG GGTCTCATAG CCCTGAGCGC CTGCCTGGCT GGGGAAGTGC CAGCCCACCT TTTAAAGAAC CGGGAAGATG CGGCCTATGA GGCGGCTACC TGGCTGCGGG AGGTCTTTGG CCCGGAGAAT TTTTACCTGG AACTCCAGGA CCAGGGCCTG CCGGAACAGC GGCAGCTCAA CCGCCGGCTG GTAGACCTGG CAGCCAGATT AAAAATTCCC CTGGTCGCCA CCAATGACGT CCACTACATT CGCCAGGACC AGGCCCGGGC CCATGACGTC CTCCTCTGTA TCCAGACAGG CAAGACCCTG GACGATCCTA ACCGCCTGCG TTTTCCCACG GCCCAGTTTT ATTTAAAATC TCCTGCCGAG ATGGACAACC TCTTCCGGGA GGTACCCTCA GCCCTGGCCA ACACCCTGGC CATCGCCGAG CGCTGCCATT TCGACTTCAC CTTCGGCCGG TTACACCTGC CGGCCTACCA GGTGCCGGCA GGAGAGGACG CAGCCAGTTA TCTCCGCCGC CTGAGTTACG AGGGCCTTAA ACGGCGTTAC CCCCATGACG ATGGAACGGC CCGGCAGCGC CTGGAGTACG AGCTGGGTAT AATCGAGCAA ATGGGCTATC CGGGCTATTT CTTGATAGTT TGGGATATGG TCAATTTCGC CCGCCGGCGG GGGATCCCGG TGGGGCCGGG TCGGGGTTCG GCCGCCGGCA GCCTGGTGGC CTACTGCCTG GGGATAACCT CCATTGATCC CCTGCGCTAC AACCTCCTCT TTGAACGTTT TTTAAACCCC GAACGGGTCA GTATGCCGGA CATCGACATG GACTTCTGCT TCGAACGGCG GGATGAAGTC ATCCAGTATG TCCTGGAGAA GTACGGCCGT GACCATGTGG CCCAGATCAT TACCTTCGGT ACCATGGCCG CCCGGGCGGC CGTCCGGGAT GTGGGCCGTG TCCTGGGTTT GCCCCTGGGT GAGGTGGACA GGATCGCCAA GATGGTGCCC ATGGAGCTGG GTATTACCCT GGAGAGGGCC CTGGCCACCA GCCCGGATTT AAAAGAGAGC TACGAAGGCA ACCGGGAGGT GCGGGAACTC CTGGATACGG CCCGGGCCAT CGAGGGTATG CCCCGCCATG CCTCCATCCA TGCCGCGGGG ATTGTCATCA CCCGGGAGCC TTTAATTCAT TATCTGCCCC TGCAAAAGAC AGGGGATGCC GTTACCACCC AGTACCCCAT GCAGGCTGTG GAGGAATTGG GGCTATTGAA GATGGACCTG CTGGGCCTCC GCACCCTGAC GGTTATCGGC CACGCCCGGG AGGCCATCCG GCGCAACCAT GGCCGGGAAC TGGACCTGGA AAGCCTGCCC CTGGATGACC GGCCAACCTA CCAGCTTCTG GCCAGCGGTG AAACCAGCGG CATCTTCCAG CTAGAAAGTA GCGGCATGCG GGCCATCCTG AAAGAACTCA AGCCGGAGCG CTTTGAAGAT ATCATCGCCC TGGTGGCCCT CTACCGGCCC GGGCCACTGG GAAGCGGTAT GGTCGAAGAT TTTATTAAGC GCAAACACGG GGTTACCCCC ATCAGTTACC TGCACCCGGC CCTGGAACCG ATACTCAAAG ACACCTACGG GGTTATCCTC TACCAGGAGC AGGTCATGCG CATTGCCAGC GAGTTGGCCG GTTTTACCCT GGGCCAGGCC GACCTCCTGC GCCGGGCCAT GGGGAAAAAG AAGCCCGAGG TCCTGGCGGC CCAGCGGGAT CGCTTCCTGG CCGGGGCAGA GGCCAAGGCT ATACCCCGGG ATGTCGCGCA GAAAATTTTT GAGCTGATGG AATACTTCGC CGGCTACGGC TTCAATGCCA GCCATTCGGC GGCCTACGCC CTGGTAGCCT ATCAGACGGC CTACCTGAAG GCCCATTACC CGGCCGAGCT TATGGGCGCC CTTCTTTCCA GCGTGGCCGA GCACCTGGAC AAGATGGAGC CCTACCTGGC TGAGTGCCAG CGCCTGGGGA TCGCCGTCCT GCCGCCGGAT GTCAATGAAT CAGGGGTCGA TTTCACCGTC ACTGGGAGGC AGATCCGCTT TGGCCTGGCG GCCGTTAAAA ACGCTGGCCG GGCCGCCGCC CTGGCCATTA TTGCCGCCCG GGAAGCAGGC GGTCCCTTTA CCTCCCTCCT GGACTTCTGC CGGCGGGTGG ACAGCCGCCA GGCCAATAAA AGGGTGGTGG AGAGCCTCAT CCGCTGCGGC GCCTTTAATT CTATCAACCC CAACCGCCGC CAGCTCCTGG CCGGCCTGGA CGAGTGCCTG GAAGTGGCCG CCAGGCGTCA GGAAGACCGC CGCAGCGGCC AGGTTTCCCT CATGGACCTG GTGCCTGAAG AGGTTAACGA TCCCCCCCTG CCCCAGGTAA CCGACTTCTC CCGGGCCGAT ATACTGGCCA TGGAAAAGGA ACTCCTGGGC TTTTACCTGA GCGGCCATCC CCTGGAACCC TATGCTCCGG TATTAAAAAA GCTGACCTCC CATTCCCTGG CCGATCTGGC GGAGATAGCC GATGGCAGCC AGGTTACCAT CGGCGGCCTG GTCAGCGGCC TGCGCCGGCT GGTAACCCGT AAGGGTGATC CCATGGCCGT TCTCACCCTG GAGGATTTCA GCGGTCAGGT AGAAGCTGTC CTCTTTCCCC GGATGTACAG CCAGGTACGT TCCTGGCTCG CTCCCGACCG GGCCGTCCTG GTCCAGGGGC AGGTCGATAA ACAGGAAGAA GGGGTCCAGG TCCTGGCCAA CCTGGTCCGG CCCCTGGCGC CGGAGGCCGG CGGGGGAGAC CGGACCCTCC CTCCTCCGGC AGGGACCCAC GGGACATTAC CGGAGGCCCG GCAGGGTGGG GGACGTCTCT ACCTCAAGCT ATCTAGTGAG GGACAGCTAA CAGCCCTGAG GGATACTTTA GCCGCCCACC CGGGTCCCTG TCAGGTATAT CTCTATCTGG CCGACAGCCG TAGGACTATC GCCTTACACA GGCGTTATTG GATAGAACCG GTGCCGGAGT TGATGGCAAG TTTGGCCGGG CTTTTAGGCG GGTATGATAA AATTAAGCTG GTGCAGGAAA ATAGAATTTA A
|
Protein sequence | MKSFVHLHVH SEYSLLDGAG RIKDLVRAAG EMGMPALALT DHGVMYGAVE FYKAAREQGI KPIIGCEVYV APRSRHDREP HRDDYQYHLV LLATDATGYR NLTALVSAAF LEGFYYKPRV DRELLSRHSQ GLIALSACLA GEVPAHLLKN REDAAYEAAT WLREVFGPEN FYLELQDQGL PEQRQLNRRL VDLAARLKIP LVATNDVHYI RQDQARAHDV LLCIQTGKTL DDPNRLRFPT AQFYLKSPAE MDNLFREVPS ALANTLAIAE RCHFDFTFGR LHLPAYQVPA GEDAASYLRR LSYEGLKRRY PHDDGTARQR LEYELGIIEQ MGYPGYFLIV WDMVNFARRR GIPVGPGRGS AAGSLVAYCL GITSIDPLRY NLLFERFLNP ERVSMPDIDM DFCFERRDEV IQYVLEKYGR DHVAQIITFG TMAARAAVRD VGRVLGLPLG EVDRIAKMVP MELGITLERA LATSPDLKES YEGNREVREL LDTARAIEGM PRHASIHAAG IVITREPLIH YLPLQKTGDA VTTQYPMQAV EELGLLKMDL LGLRTLTVIG HAREAIRRNH GRELDLESLP LDDRPTYQLL ASGETSGIFQ LESSGMRAIL KELKPERFED IIALVALYRP GPLGSGMVED FIKRKHGVTP ISYLHPALEP ILKDTYGVIL YQEQVMRIAS ELAGFTLGQA DLLRRAMGKK KPEVLAAQRD RFLAGAEAKA IPRDVAQKIF ELMEYFAGYG FNASHSAAYA LVAYQTAYLK AHYPAELMGA LLSSVAEHLD KMEPYLAECQ RLGIAVLPPD VNESGVDFTV TGRQIRFGLA AVKNAGRAAA LAIIAAREAG GPFTSLLDFC RRVDSRQANK RVVESLIRCG AFNSINPNRR QLLAGLDECL EVAARRQEDR RSGQVSLMDL VPEEVNDPPL PQVTDFSRAD ILAMEKELLG FYLSGHPLEP YAPVLKKLTS HSLADLAEIA DGSQVTIGGL VSGLRRLVTR KGDPMAVLTL EDFSGQVEAV LFPRMYSQVR SWLAPDRAVL VQGQVDKQEE GVQVLANLVR PLAPEAGGGD RTLPPPAGTH GTLPEARQGG GRLYLKLSSE GQLTALRDTL AAHPGPCQVY LYLADSRRTI ALHRRYWIEP VPELMASLAG LLGGYDKIKL VQENRI
|
| |