Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0003 |
Symbol | |
ID | 3831313 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1894 |
End bp | 3018 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637827930 |
Product | DNA polymerase III, beta subunit |
Protein accession | YP_428886 |
Protein GI | 83588877 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0592] DNA polymerase sliding clamp subunit (PCNA homolog) |
TIGRFAM ID | [TIGR00663] DNA polymerase III, beta subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.00459981 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000116577 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCATATCC TTTGTCCTCA ACCCCAACTT GTTAATGCTG TGCAAAAGGT ATACCGGGCG GTAGCCACAA CGACAACCTA TCACGCTATT ACCGGGATTC TATTGCAGGC CCATGAAAAT ACCTTGACCC TCCAGGGTAC CGATCTTGAT CTGGGAATTA TTTATACCTT TCCTGTTGAG GTTATCGAAG AAGGCGAGCT CTTACTGCCG GCACGTATCT TTACCGAGAT GGTCCGGCGC CTGCCGCCTA CCTCCCTTTC TTTACAGAGT TTACAGGATA ACACCGTGGA GATCGCTTAC CAGCAGTCCA AAGTCCAACT TAACAGCATT GACGCCAGCC AGTTTCCGCT CCTGCCGCCG GTAGAAGGTA ACTTCTCCTT TACAGTGGCC ATTACCGCCC TCAAGGATGC CATCCGTAAG GTAACAATTG CCGCCGGTAA TGACGACCTG CGCAGCATTT TCAATGGTGT TCTCTGGGAA TTAGAACCCG GGGAAAACAG GTTTAACCTG GTGGCCACCG ATACCCATCG TCTGGCTGTC TACCACGGCC AACCAGAAGA TTCCACGAGT AACGAAACGG CTACCGCCCT GGTACCATGC CGGGCTATGA ATGAACTGGC GCGTTTACTC CCCGGAGAAG ATGGTTTAGT AAAAATAACC ATCGGTGAAA GTCAGATCTA CGCCCAGCAC GAGGGCTTAA CGTTATACAC CCGATTATTG AATGGTAAAT TTCCTCATTA CCAGCAAGTT ATCCCAACTG ATCATATAAC TACCATAGAA ATAGCCACCC GGGATCTCCT GGACACCGTT GAACGGGCTA CCTTACTGGC CCGGGATGAG AATAAAGCCA GGGCCCATAT TATTATTTTG CAGGTAGGGG AAAAATCTTT AAAAATAACC AGTGAAGCTG CCGAGATAGG CCACCTGGAA GAGGAGTTAA CGGCAGAAAT AGCAGGACAA CCCCTGGAAC TAGCTTTGAA CGGGCGCTAC CTGCTGGAAA CCCTGCGGGT AATTGATACC GAAAACGTAA TTCTGGAACT CCTGGCCCCG TTGAAACCCG TTGTTGTCAG GCCGGCCGGC CAGGAAAACT ACTTCTGCCT TATCCTACCG GTCAGGATTG GCTAA
|
Protein sequence | MHILCPQPQL VNAVQKVYRA VATTTTYHAI TGILLQAHEN TLTLQGTDLD LGIIYTFPVE VIEEGELLLP ARIFTEMVRR LPPTSLSLQS LQDNTVEIAY QQSKVQLNSI DASQFPLLPP VEGNFSFTVA ITALKDAIRK VTIAAGNDDL RSIFNGVLWE LEPGENRFNL VATDTHRLAV YHGQPEDSTS NETATALVPC RAMNELARLL PGEDGLVKIT IGESQIYAQH EGLTLYTRLL NGKFPHYQQV IPTDHITTIE IATRDLLDTV ERATLLARDE NKARAHIIIL QVGEKSLKIT SEAAEIGHLE EELTAEIAGQ PLELALNGRY LLETLRVIDT ENVILELLAP LKPVVVRPAG QENYFCLILP VRIG
|
| |