Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1948 |
Symbol | |
ID | 7084416 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2189236 |
End bp | 2192463 |
Gene Length | 3228 bp |
Protein Length | 1075 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 643698973 |
Product | DNA polymerase III, alpha subunit |
Protein accession | YP_002355595 |
Protein GI | 217970361 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0587] DNA polymerase III, alpha subunit |
TIGRFAM ID | [TIGR00594] DNA-directed DNA polymerase III (polc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTCCCG CCTACGCCGA ACTCCACTGC CTGAGCAACT TCAGCTTCCA GCGCGGCGCC TCGCACCCCG AGGAGCTCGT CGCGCAGGCG GCAGCCTTCG GCTACAGCGC GATCGCCCTC ACCGACGAAT GCTCCTTCGC CGGCGTGGTG CGCGCCCACC GCGCGCTGCA GGCCCTGCCC GAAGCCGGCC GGCCGCGCCT GATCGTCGGC TGCGAGCTGC GCCTCGCCGA CGGGCCCTGC CTCGTCCTGC TCGCCGCCGA TCGCGCCGGC TACGGCCGGC TCTCGCGCCT GCTCACCCAG GCCAGGCGGA GCGCCGACAA GGGCCGCTAC CACCTCACGC GCGCGATGCT CGACGCCACC CGCGCAGACG AGGCGGCCGC CGAAGCCCTG CCCGGCTGCC TGGCGCTGCT GATCCCACCC GAGGAATTCC CGCCGCCGGG TGCCGCCGGC GACGCGGCCG TCGCCCTGCG CAGCGAAGCG CGCTGGCTGG CGAAGCGCTT CCCCGGACGC TCCTGGATCG CCCACACCCC GCGCCTGGAC GGACGCGATC CCGCCCGCCT GCACTGGATC CTGCGCATCG CCCAGGCCAC CGGCGTCCCT GCGGTCGCCG CCGGCGGCGC GCTCATGCAC GAGGCCGCGC GGCGCCCGCT CGCCGACGTG ATGAGCGCGC TGCGCCTGCA CTGCAGCGTG GCCGAGGCCG GCCTGGCGCT CGCGCCCAAT GCCGAGCTGC GCCTGCACGA GCGCGCGACC CTCGCCCGCC GCCATCCTCC CGAGCTGCTC GCCGAGACCG TGGCGGTGGC GGCGCGCTGC CGCTTCGACC TCGACGAGCT GCGCTACGAG TACCCCGCCG AGCTCGTCCC CGCGGACGAG ACCCCGACGC GCTGGCTGCG CCGGCTGGTC GAGGGCGGGC TGGCCTGGCG CTACGGCAAG GCCGGCAAGG AGCGCCGCGC AGCCGCGCCT GCCACGATCG CGGCCGCACC GCCACCGGCC GAAGACCCCG CCCCGCCCAC GGTGCGCGCG CAGATCGAGC ACGAGCTCGC GCTCATCGCC GAGCTCGGCT ACGAGCCCTA CTTCCTCACC GTGCACGACA TCGTGCGCTT CGCCCGCGAG CGCGGCATCC TGTGCCAGGG CCGCGGCTCG GCGGCCAACT CGGTGGTGTG CTGGGCACTG GGCATCACCG AGGTCGATCC GCAGCTGGGC ATCATGCTGG TCGAGCGCTT CATCTCGCGA GAGCGCGACG AGCCGCCCGA CATCGACGTC GACTTCGAGC ACGAGCGCCG CGAGGAGGTC ATCCAGTACC TCTACCGCAA GTACGGCCGC GAACGCGCCG CGCTCGCCGC CACCGTGATC CGCTACCGCG CACGCAGCGC GCTGCGCGAC GTCGGCCGCG CGCTCGGCCT CGACGAGACG CAGATCGAGC GCCTCACCCG CGAGCACCAC TGGTTCGACG GCCGCCACAT CCTGCCCGAG CGCCTCGCGG AGGCCGGGCT CGACCCCGCC AGCCCGGTCA CGCAAAGGCT GGTCGCACTC ACCGAGGAAC TCATCGGCTT TCCGCGCCAC CTCTCCCAGC ACGTCGGCGG CTTCGTCATC GCGCGCGGCC GGCTCGACGA GCTGGTGCCG GTGGAGAACG CCGCCATGTC CGATCGCACC GTGATCCAGT GGGACAAGGA CGACCTCGAC GCGGTCGGCC TGATGAAGAT CGACGTGCTC GCGCTCGGCA TGCTGTCGGC GCTGCGTCGC GGCCTCGCCC TGGTGTCGGC ATGGCGCGGC GAGGCGCTGA CGCTGGCGAC CATCCCGCGC GAACGAAAGG AGGTCTACGA AATGCTGTCG CGCGCCGATT CGGTCGGCGT GTTCCAGGTC GAGTCGCGCG CGCAGATGAC CATGCTGCCG CGCCTGAGGC CGCAACGCTT CTACGACCTG GTGGTGGAAG TCGCGATCGT GCGCCCCGGC CCGATCCAGG GCGGCATGGT CCATCCCTAC CTGCAGGCGC GCGAGCGTGC GGCGCGCGGC GAGGATCCGT TGGACGGCCT GCGCGAAGAG ATCCGCGGTG TGCTGGCGCG CACGCTGGGC GTGCCGATCT TCCAGGAGCA GGTGATGCAG CTCGCGGTGG TCGCGGCCGA CTTTACCGGC GGCGAGGCCG ACCAGCTGCG CCGCGCGATG GGTGCCTGGC GGCGCAAGGG CGAGCTCGAG CGCTACCGCC AGAAGCTGCT CGACGGGCTG GCGAACAACG GCTACGACCC CGACTTCGCC CAGCGCCTGT GCCAGCAGAT CGAGGGCTTC GGCAGCTACG GCTTTCCCGA ATCCCACGCC GCCAGCTTCG CGCTGCTGGT GTATGCCTCG GCCTGGCTCA AGTGCTTCGC GCCCGCGGCC TTCCTCGCCG CGCTGCTGAA CAGCCAGCCG ATGGGCTTCT ACGCGCCCGC GCAGCTGATC CAGGACGCCC GCCGCCATGG GGTGGAGGTG CGCGCGGCGG ACGTCGGCGC CAGCGCGTGG GACTGCATGC TGGAGACCGC CGACGGCGCG CAGCCCGCGG TGCGCCTCGG CCTGCGCATG ATCCGCGGCC TCGGCCGCGA GGCCGCCACG CGCATCGCCG CCGCGCGTGG CGAACACGCC TTCGCCGACA CCCGGGACCT CGCCGCGCGC GCGCGGCTCG ACACCGGCGA GCTGCGCACC CTCGCCGCCG GCGGTGCCCT CGCCAGCCTC GCCGGTCATC GCCGCCAGGC CCTGTGGCAG GCGAGCGGCG CCGCGCCCCT GCCCGGCCTG CTCGCCGAAG CCCCTGGCGG CGACGTCGCC GCCACGCTGG ACGCCCCCGC CGAGGCCGAA GACCTGCTCG CCGACTACGC CCGCCTCGGC TTCACCCTCG GTCGCCACCC GCTCGCCTTC GTGCGCGAGC AGCTCGCCCG CCTGCGCTTT CTCACCGCCG CCGACATCAC CGCCGCGCCC GACCGCATGC TCGCCCGCGG CGCCGGCCTG GTCACCTGCC GCCAGCGCCC CGGCACCGCG AAGGGCACGC TCTTCCTCAC CCTGGAAGAC GAGACCGGGC TCACCAACGT GATCGTCCGC CCCGAGCTCT TCGAACAGCA GCGCCGCATC CTGCTCGGGG CGCGCCTGAT GGGCGTGTTC GGCCAGATCC GCCGTCAGGG CCGGGTCGTG CATCTGGTGG CGAGCCGGGT GGTCGACCAC TCGCCCCTGC TCGGCAGCCT CGCCGCGCGC AGCCGGGATT TTCACTGA
|
Protein sequence | MLPAYAELHC LSNFSFQRGA SHPEELVAQA AAFGYSAIAL TDECSFAGVV RAHRALQALP EAGRPRLIVG CELRLADGPC LVLLAADRAG YGRLSRLLTQ ARRSADKGRY HLTRAMLDAT RADEAAAEAL PGCLALLIPP EEFPPPGAAG DAAVALRSEA RWLAKRFPGR SWIAHTPRLD GRDPARLHWI LRIAQATGVP AVAAGGALMH EAARRPLADV MSALRLHCSV AEAGLALAPN AELRLHERAT LARRHPPELL AETVAVAARC RFDLDELRYE YPAELVPADE TPTRWLRRLV EGGLAWRYGK AGKERRAAAP ATIAAAPPPA EDPAPPTVRA QIEHELALIA ELGYEPYFLT VHDIVRFARE RGILCQGRGS AANSVVCWAL GITEVDPQLG IMLVERFISR ERDEPPDIDV DFEHERREEV IQYLYRKYGR ERAALAATVI RYRARSALRD VGRALGLDET QIERLTREHH WFDGRHILPE RLAEAGLDPA SPVTQRLVAL TEELIGFPRH LSQHVGGFVI ARGRLDELVP VENAAMSDRT VIQWDKDDLD AVGLMKIDVL ALGMLSALRR GLALVSAWRG EALTLATIPR ERKEVYEMLS RADSVGVFQV ESRAQMTMLP RLRPQRFYDL VVEVAIVRPG PIQGGMVHPY LQARERAARG EDPLDGLREE IRGVLARTLG VPIFQEQVMQ LAVVAADFTG GEADQLRRAM GAWRRKGELE RYRQKLLDGL ANNGYDPDFA QRLCQQIEGF GSYGFPESHA ASFALLVYAS AWLKCFAPAA FLAALLNSQP MGFYAPAQLI QDARRHGVEV RAADVGASAW DCMLETADGA QPAVRLGLRM IRGLGREAAT RIAAARGEHA FADTRDLAAR ARLDTGELRT LAAGGALASL AGHRRQALWQ ASGAAPLPGL LAEAPGGDVA ATLDAPAEAE DLLADYARLG FTLGRHPLAF VREQLARLRF LTAADITAAP DRMLARGAGL VTCRQRPGTA KGTLFLTLED ETGLTNVIVR PELFEQQRRI LLGARLMGVF GQIRRQGRVV HLVASRVVDH SPLLGSLAAR SRDFH
|
| |