Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1154 |
Symbol | |
ID | 3833122 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1184542 |
End bp | 1187307 |
Gene Length | 2766 bp |
Protein Length | 921 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637829085 |
Product | DNA polymerase III, epsilon subunit |
Protein accession | YP_430011 |
Protein GI | 83590002 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0847] DNA polymerase III, epsilon subunit and related 3'-5' exonucleases [COG1199] Rad3-related DNA helicases |
TIGRFAM ID | [TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family [TIGR01407] DnaQ family exonuclease/DinG family helicase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCCCCC AGACTTATGT CGTCCTGGAT GTGGAAACCA CCGGCCTGGA TCCAGCCCGG GACAGAATAA TTGAGATTGC TGCCGTCCGC CTGGAAGGAG GCAACATTAC CCGCCAGTTT CAAACCCTGG TGAACCCGGG CCGGCCTATA CCCCCGGCCA TCGAGAGGTT GACGGGTATT AGTGACGCCA TGGTCCGCGA GGCGCCCCCC CTTCCTGAAG TTTTGCCCGG GCTGCTGGAT TTATTCAGGG ATGCCATCCC AGTAGGCCAT AACGGCACCT TTGACCTGGC CTTCCTAAAC CAAGCCCTGG GCCACGGCTG GCATTCCCCG CTGCTGGATA CCCTTGCCCT GAGCCGGATT CTCTTTCCCT GCCTGGCTTC TCACCGCCTG GATTATATGA GCAAGTACCT GACCCTCGAA GCTACCGGCC ATCATCGCGC CCTGGATGAT GTGTTAACTA CCGCCCGCCT CCTGGAGAAC CTCTGGCAGG CCACCCTGGA ACTGGACAAA AACCTGCTGA CAAAACTCCT GAACCTAGCC CCGGTCGGCC TCCAGTCCTG GTTCCGGGCT GCTCTGGTTC AGGGAACGCC AGCCAGCCAT TTTGAAGTGG CCGCTACCGG GTTATTTGCC CCGACCAGGG TACCCTCCCC GCAGTCTGGA ACCTTGCCGG CCTTTAATGT CGACGACCTG GTAGCCATGC TGGACCACAG GGGTCTGCTG GCCGAGCAGA TACCAGGTTA TGAATACCGT CCCCAGCAGG TGGAAATGCT CAGGGCTGTA GCCTCGGCCC TGGCCGGCAA TCACTACCTG ACAGTAGAAG CCGGCACAGG GACCGGCAAA TCTCTGGCCT ACCTTTTGCC GGCGATTTAC TGGGCCTGTA GCCAGAGAAA GAGGGTAGCC ATTGCCACCC ATACCATCAG CCTGCAGGAA CAACTCTGGC AGAAGGACCT GCCCCAGCTA CGGGAACTCC TGCCCTTTTC CTTTAAGGCG GCCCTGGTTA AAGGACGGAG CAACTATATC TGCAGGCGGA AGTTGCGGGA CTACCTGGCT AACCCGCCGG CCGGGGAGGC AGAGCGTCTC TTCGCCATGC GGGTTTTACG CTGGCTGGAG GTAACAACCA GCGGCGACTG GAGCGAAATG AAACTTACCC CGGAAGAAGA AGGCTTCAAA TTTGCCCTGG CGGCCGATAC CGAGACCTGC ACCGGTAGCG CCTGCCCCTT CAACGATGAA TGTTTTGTCA ATGCCGCCCG CCGGGAGGCT GAGGCTGCCA ATATCCTGAT CTTGAACCAT TCCCTTTTAC TGAGCGATAT CCGCTTAAAC AACCAGGTCC TGCCTGACTA CCCCTACCTG ATTATCGATG AAGCCCATCA CCTGGAGGAG GCGGCTACCG AACACCTGGG GAGCAGTGTC AGCCAGGCCA GCTGTGAGCT TTTCTTTCGC CGCCTGGGCC GGGGTGAGCA GGCCTATAGC TTCCTGGGTC GGGTCCGCAA TCTCGCCCGC CGTTTACCTC CGGAAGGGAA CCTTGAATTG GCAGACTTCC TGGAGGATAT GGAACTAACA GTAACAGCAA CCCTGGCCGG TTGGCAGGAG TTCTGGGAGA GCTTGGGCAG GTTAAGTGAT GCCGCCCGCT GGGAAGAGGC GGGTTATACC CTGCGTTTCA CCAGCCGTTT AAAGGAAACC CCTGCTTGGG ACAACCTGCT GTCAGTCTTC GGGAGCCTGG AGGAAAACCT CAGCGGCCTG GCCAGTCGCC TGGAGCGCCT CTCGGAGTTA TTGAGCGCTG CCGGGGCCGG CGAGTTTGCT GCCGATGCCG GTAATTTCGC CGCCGTTGTT GCCCAGTACA GCTATGATCT GGGGCAGATC CTTGATGCCG ACCCGGCCAC CAGCGTCAGC TGGCTGGAAA AGAATAACCA TGGCCAGTAT ATCTTACGCT CCGCCCCCCT GGATATTGGT CCCCTGCTGG CAGAACTTCT CTTTTCCCGC AAGCAGGCTG TGATCCTGAC CTCGGCCACC CTGACAGTCA ACAATAGCTT TGATTATTAC CACCAGCAGA CCGGCCTCCA GGAACTGCCA GCCGACAGGG TCGTCAGCTG CCAGGTATCC TCACCCTTTG ACTACCGGTC GCAGGCCCTG GTTTGTTCTA TTAGAGGGCT GCCCAACCCG GGCCAGTTAA AGGATGCCGA CTATGCCCGG GCTATTACCC CGGTCCTGAC AGCCATCTGC CCGGCCGTCG GCGGCCGTTC CCTGGTTCTC TGTACCTCCC ACCGTTTTTT ACGGGAAGTC TACGAACTTT TAAGCGCCGA CCTCAACGGC AGCGGTTACC GGGTCCTGGC CCAGGGAATA GACGGCAGCC GTTCCCGCCT GCTGGAAGAA TTCATCCAGA CTCCCCGGGC TGTCCTTCTG GGGGCCAACA GCTACTGGGA AGGTATCGAC CTGCCCGGGG ATCTGCTTCG CTGCGTCATC ATCCCGCGTT TACCATTCCC CTCCCCGGGC ATACCGACCC TGGCGGCGCG GATGGAACAC CTGGCCGCCC GGGGACAGAA TGCCTTTGCC ACCCTGAGTC TACCCCAGGC GATTATTCGT TTCCGCCAGG GGTTTGGTCG CCTGATCCGG CGGGCCAGCG ACAGAGGAGT ACTGGTAATC CTTGACCAGC GCCTCCTCTC CCAGCGGTAC GGCCGCCTTT TTATCCAGTC CCTGCCGCCA GTAACCCTTG AGGAGGTTGA CCCCGCCGGG GCTCCCTCCC GGATAAAAAC ATGGTTTCAG GGTTGA
|
Protein sequence | MLPQTYVVLD VETTGLDPAR DRIIEIAAVR LEGGNITRQF QTLVNPGRPI PPAIERLTGI SDAMVREAPP LPEVLPGLLD LFRDAIPVGH NGTFDLAFLN QALGHGWHSP LLDTLALSRI LFPCLASHRL DYMSKYLTLE ATGHHRALDD VLTTARLLEN LWQATLELDK NLLTKLLNLA PVGLQSWFRA ALVQGTPASH FEVAATGLFA PTRVPSPQSG TLPAFNVDDL VAMLDHRGLL AEQIPGYEYR PQQVEMLRAV ASALAGNHYL TVEAGTGTGK SLAYLLPAIY WACSQRKRVA IATHTISLQE QLWQKDLPQL RELLPFSFKA ALVKGRSNYI CRRKLRDYLA NPPAGEAERL FAMRVLRWLE VTTSGDWSEM KLTPEEEGFK FALAADTETC TGSACPFNDE CFVNAARREA EAANILILNH SLLLSDIRLN NQVLPDYPYL IIDEAHHLEE AATEHLGSSV SQASCELFFR RLGRGEQAYS FLGRVRNLAR RLPPEGNLEL ADFLEDMELT VTATLAGWQE FWESLGRLSD AARWEEAGYT LRFTSRLKET PAWDNLLSVF GSLEENLSGL ASRLERLSEL LSAAGAGEFA ADAGNFAAVV AQYSYDLGQI LDADPATSVS WLEKNNHGQY ILRSAPLDIG PLLAELLFSR KQAVILTSAT LTVNNSFDYY HQQTGLQELP ADRVVSCQVS SPFDYRSQAL VCSIRGLPNP GQLKDADYAR AITPVLTAIC PAVGGRSLVL CTSHRFLREV YELLSADLNG SGYRVLAQGI DGSRSRLLEE FIQTPRAVLL GANSYWEGID LPGDLLRCVI IPRLPFPSPG IPTLAARMEH LAARGQNAFA TLSLPQAIIR FRQGFGRLIR RASDRGVLVI LDQRLLSQRY GRLFIQSLPP VTLEEVDPAG APSRIKTWFQ G
|
| |