Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3956 |
Symbol | |
ID | 7873602 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 4352685 |
End bp | 4354193 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643700893 |
Product | DNA polymerase III, epsilon subunit |
Protein accession | YP_002890916 |
Protein GI | 237654602 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0847] DNA polymerase III, epsilon subunit and related 3'-5' exonucleases |
TIGRFAM ID | [TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.323553 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGACGCCCA CCCTCCGCAG CTTTCCCGGC GGCCTCGCCT TTGTCGACAT CGAGACCACC GGCGGCCCCG CCCAACGCGA ATCCATCACC GAGATCGGCA TCGTCCAGGT GGATGAGGAC GGCGTGCGCG AGTGGTCGAC GCTGGTGCGT CCCGCGTCGC GCATCCCAGA GACCATCCAG CGCCTCACCG GCATCGACGA CGACATGGTC GCCGACGCAC CGCGCTTCGA GGACATCGCC GACGAGGTCT TCGACCGTCT CGACGGCCGC CTCTTCGTCG CCCACAACGC ACGCTTCGAC CACGGCCACC TGCGCGCCGC CTTCCGCCGC GCCGGGCTCG ACATGCGGCC GCAGGTGCTG TGCACCGTCA AGCTGTCGCG CCGGCTGTTC CCCGACCACC GCCGCCACGG TCTCGACCAC CTCATCGAGC GCCACGGCCT GGCGGTGGCC GACCGCCACC GTGCGCTCGG CGACGCCCGG CTGCTGTGGC AGTTCTGGCA GAAGATCCAC GAACGCTTTC CGCCCGGTCA CATCGATGCC GCGGTGCGCG AACTCATCGG CCACCCCAGC CTGCCCCCCC ACCTCGACCC CGAGCAAATC GCCGACCTGC CCGACACGCC GGGGGTGTAT CTGTTCTACG GCGAGCGCGG GGGCGAGAGC AGCCAGCTTG GGGGCAACGA TGAAGCCGAC GCCGAAGCCG AGGCCGATCC GCTCGGACCC GGCAGGGCGC GCACCGGCGC GCGCGACCGC AAGCGCCACG CGCCGCTGCA GGACTTGCCG CTCTACATCG GCAAGAGCAC GCGGCTGCGC AGCCGGGTGT TGTCGCACTT CGCCGCCGAC CACAGCAGCG ACCGCGAGCT CAGCCTCTCC CAGCAGGTGC GCCGCATCGA ATGGATCGCG ACCGCCGGCG AGATCGGCGC GCTGCTGAAG GAAGCCGAAC TGGTCAAGCG CCTGCAGCCC ACCCACAACC GCCAGCTGCG CCGCAACCGC GAGCTGTGCA CCTGGCGGCT CGCCACCGAC ATCGTCGGCG ACTGGCGGCT GGAGCTGGTG CATGCGGCCG ACCTCGACTT CGGCCGCCGC GACGACCTCT ACGGTTTCTT CCGCACCCGC CGCGAGGCCA CCAACCGGTT GCGCGCGCTC GCCCGCGACC ACGCCCTGTG CCCGCCGCTG CTCGGCCTGG AGAAACCCCC GCAAGGTGCG CGCTGCTTCG ACTTCCAGTT GAAGCGCTGC CGTGGCGCCT GCCACGGCGG CGAATCCCCC CAGGCCCACG CCCTGCGCCT GATCGAGGCC CTGCACGCGC TGAAGGTCGA GCACTGGACC TGGCCCGGCC CGGTCGGCCT GCGCGAGGGC GAGGCCATCC ACGTCGTCGA CGGCTGGCGC TGGCTCGGCA CCGCCACCGA CGAAGCCATG CTCGCCGACC TGCTGGAGGC CGGCCGCCCG GCCTTCGACC ACGACATCTA CAAGATCCTG GTCAAGGCGG TGAGGCGGCT GCCGGTGGTG CAGCTCTAA
|
Protein sequence | MTPTLRSFPG GLAFVDIETT GGPAQRESIT EIGIVQVDED GVREWSTLVR PASRIPETIQ RLTGIDDDMV ADAPRFEDIA DEVFDRLDGR LFVAHNARFD HGHLRAAFRR AGLDMRPQVL CTVKLSRRLF PDHRRHGLDH LIERHGLAVA DRHRALGDAR LLWQFWQKIH ERFPPGHIDA AVRELIGHPS LPPHLDPEQI ADLPDTPGVY LFYGERGGES SQLGGNDEAD AEAEADPLGP GRARTGARDR KRHAPLQDLP LYIGKSTRLR SRVLSHFAAD HSSDRELSLS QQVRRIEWIA TAGEIGALLK EAELVKRLQP THNRQLRRNR ELCTWRLATD IVGDWRLELV HAADLDFGRR DDLYGFFRTR REATNRLRAL ARDHALCPPL LGLEKPPQGA RCFDFQLKRC RGACHGGESP QAHALRLIEA LHALKVEHWT WPGPVGLREG EAIHVVDGWR WLGTATDEAM LADLLEAGRP AFDHDIYKIL VKAVRRLPVV QL
|
| |