Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1583 |
Symbol | |
ID | 7084787 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1762601 |
End bp | 1766125 |
Gene Length | 3525 bp |
Protein Length | 1174 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643698600 |
Product | DNA polymerase III, alpha subunit |
Protein accession | YP_002355237 |
Protein GI | 217970003 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0587] DNA polymerase III, alpha subunit |
TIGRFAM ID | [TIGR00594] DNA-directed DNA polymerase III (polc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.422448 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCCTG CTGCCCCCTC CACGGCCGAA GCGCCCATCG TGCCGAGCGC TCCGCGCTTC GTGCATCTCC GCCTGCACTC CGAATATTCG ATCACCGACG GCATCGTCCA GCTCGACCAG GCGATCGCCG CCGCCGCGGC CGACGGCATG CCGGCGCTCG GCGTCTCCGA TCTCGCCAAC CTGTTCGGCA TGGTCAAGTT CTACAAGGGC GCGCGCGGCA AGGGCATCAA GCCCATCGTC GGCGTCGACG CCTGGATCCG CAACGAGGCC GAGCGCGACA AGCCGCAGCG CGTGCTGCTC ATCTGCAAGA ACCGCGCGGG CTATGGTCAG CTGTGCGAGC TGCTGACGCG CGCCTACCTC GAGAACAAGC ACCGCGGCCG CGCCGAGATG CGCCGCGAGT GGTTCGAGAA TGGTGCCGCA AGCGCGCTGC TGTGCCTGTC GGGTGCGATG AGCGGCGACA TCGGCGCGGC GATCGCGGCC GGCAACCTCG CGCTCGCCGA ACAGCTCGCG GCCGACTGGG CGCGGCTGTT TCCGGACGCC TTCTACATCG AGATCCAGCG CGCCGGGCAT CCCGGCACCG AGTCCTACAT CCGCCACGCG GTGGAACTCG CCGGCCGGCT CGGCCTGCCG GTGGTGGCGA CCCACCCGGT GCAGTTCCTC AAGCGCGAGG ATTTCAAGGC ACACGAGGCG CGCGTCTGCA TCGCGCAGGG CTACGTGCTC GCCGACAAGC GCCGGCCGCG CGACTTCACC GAGGAGCAGT ACCTCAAGAG CCAGGCGGAG ATGTGCGAGC TCTTCGCCGA TCTGCCCGAG GCGCTCGAGA ACGCGGTCGA GATCGCGCGC CGCTGCTCGC TCACCGTGCA ACTCGGCAAG AACTTCCTGC CCTTGTTCCC GACGCCCGAG GGCATGACGC TCGACGACTT CCTCGTCGCG GAGGCGAAGA AGGGGCTGGA GGAGCGCCTC GCCCAGCTCT ACCCGCACCC GGAGGAGCGC GAGCGTCAGC GCCCGCGCTA CGAGCAGCGG CTGAAGTTCG AGACCGACAC CATCATCCAG ATGGGCTTCC CCGGCTACTT CCTGATCGTG GCCGACTTCA TCCAGTGGGG CAAGAACAAC GGCGTGCCGG TCGGACCGGG GCGGGGCTCC GGCGCGGGCT CGCTGGTGGC CTACTCGCTC AAGATCACCG ACATCGACCC CCTCGAGTAC GCGCTGCTGT TCGAGCGCTT CCTCAACCCC GAGCGGGTGT CGATGCCCGA CTTCGACATC GACTTCTGCC AGGACAACCG CTACCGCGTC ATCGAGTACG TGCGCGAGCG CTACGGCAAG GACGCGGTGA GCCAGATCGC CACCTTCGGC ACCATGGCCT CGAAGGCGGT GGTGCGCGAC GTCGGCCGCG TGCTGGATCT GCCTTACGGC CTGTGCGACC GCCTCTCCAA GCTGATCCCG ATCGAGGGCG CCAAGCCCGT CTCGCTGAAC AAGGCCTACG AGATGGAGCC GCAGATCGGC GAGATGATGG CCGACGGCAA CGACGGCGAG TCGGTGCGCG ACCTGTGGAG CCTGGCGCAG CCGCTGGAGG GCTTGAGCCG CAACGTCGGC ATGCACGCCG GCGGCGTGCT GATCGCGCCC GGCAAGCTCA CCGACTTCTG TCCGCTCTAC ATCGCCGACG GCGACGACGC CACGCCGGTG TCGCAGTTCG ACAAGGACGA CGTCGAAGCC GTGGGCCTGG TCAAGTTCGA CTTCCTCGGC CTGCGCAACC TGACCATCAT CGAGCTCGCG CTGGAGTACG TGGCGCGCCT GGAGGGCAGC CGTCCGGACC TGATGAGCCT GGGCTTCGAG GATCCCGCCG CCTACCAGAT CCTCAAGGAC GCCAACACCA CGGCGATCTT CCAGGTCGAA TCGGACGGCA TGAAGAAGCT GCTCAAGAAG CTCGCGCCCG ACCGCTTCGA GGACATCATC GCGGTGCTCG CGCTCTACCG TCCCGGCCCG CTCGGCTCGG GCATGGTGGA CGACTTCATC CTGCGCAAGA AGGGCCAGCA GGAGATCGAC TACTTCCACC CCGACCTCAA GGCCTGCCTG GAGCCGACCT ACGGCGTGAT CGTGTACCAG GAACAGGTGA TGCAGATCTC CCAGATCATC GGGGGCTACA CGCTTGGCGG TGCCGACATG CTGCGCCGCG CGATGGGCAA GAAGAAGGCC GACGAGATGG CCAAGCACCG CGCCACCATC GCCGAGGGTG CGAAGCAGAA GGGCTATGAC CCGGCGCTCG CCGAGCAGCT CTTCGACCTG ATGACCAAGT TCGCGGAGTA CGGCTTCAAC AAGTCGCACA CCGCGGCCTA CGCGGTGGTC ACCTACCACA CCGCCTGGCT CAAGGCGCAC CACTGCGCGG CCTTCATGGC GGCGACGATG TCGTCCGACC TGGACAACAC CGATACCGTC AAGATCTTCT ACGAAGACAC GCTCGCCAAC GGCATCAAGG TGCTGCCGCC GGATGTCAAC GCCTCCGACT ACCGCTTCGT GCCGGTGGAT CGCAAGATCA TCCGCTACGG CCTGGGCGCG GTGAAGGGCG TGGGCGAGCC TGCGGTGCGC GCGATCCTCG CCGCGCACGA GAAGGGCGGA GACTTCCGCG ACCTGTTCGA TTTCTGCGAG CGCGTCGATC GCCGCATGGT AAACCGCCGC GTCATCGAGG CGCTGATCCG CGCCGGCGCG ATGGACACCC TGCCGGGCCA CAAGGGCCTG GACCGCGCGC AGCTGATGGC GACCGTCGCG CTGGCGATGG AGGCGGCCGA ACAGGCGGCG GCCAACGCGA TGCAGGGTGG GCTCTTCGAC CTGATGCCGG AGGCCGCGGG CGCGGCGCCC GAGTTCGCCA AGGTGCGTGC GTGGACCGAG CGCGAGCGCC TCAAGGAAGA GAAGCTCGCG ATCGGCTTCT TCCTCTCCGG CCATCCCTTC AACGCCTTCA AGCCCGAGGT GCGCCGCTTC GTGCGCCGCA CGCTGGCGCA GCTCGAGCCC TCGCGCGACA TCACCATGCT GGCCGGCGTG GTGATGGAGC AGCGCACCAA GGTCGGCAAC CGCGGCAAGA TGGCCTTCGT GCTGCTCGAC GACGGTACCG AGCCGCGCGA GGTCACGGTG TATTCCGAGG TGCTGGAGGC GAGCCGCGGC AAGATCGTCA CCGACGAGGT GCTGGTGGTG GAGGCCAAGG TGAGCAACGA CGATTTCTCC GGTGGCCTGC GCATCAACGC CGACCGCCTG CTCACCCTCG GCGAGGCGCG CAGCCGTTTC GCACGCGCGC TGTCGCTACG CATCGACGGC GAGCTTGCCG CCAGCGGCGG TGCGGCGGGC GCGGCCGGCA AGCTGCAGAC CCTGCTCGAG CCCTTCCGCG AGGGCGGCTG CCCGATCCGC GTCCGCTATC GCAACGCCGC CGCCGAAGCC GAGCTCCCGC TCGGCGACGG CTGGCGCGTG CGCCTGGACG ACGCGCTGCT GGAGAGCCTG CGCGAATGGC TGCCGGGCGA GGCGGTCGAG GTGGTGTACC CCTGA
|
Protein sequence | MNPAAPSTAE APIVPSAPRF VHLRLHSEYS ITDGIVQLDQ AIAAAAADGM PALGVSDLAN LFGMVKFYKG ARGKGIKPIV GVDAWIRNEA ERDKPQRVLL ICKNRAGYGQ LCELLTRAYL ENKHRGRAEM RREWFENGAA SALLCLSGAM SGDIGAAIAA GNLALAEQLA ADWARLFPDA FYIEIQRAGH PGTESYIRHA VELAGRLGLP VVATHPVQFL KREDFKAHEA RVCIAQGYVL ADKRRPRDFT EEQYLKSQAE MCELFADLPE ALENAVEIAR RCSLTVQLGK NFLPLFPTPE GMTLDDFLVA EAKKGLEERL AQLYPHPEER ERQRPRYEQR LKFETDTIIQ MGFPGYFLIV ADFIQWGKNN GVPVGPGRGS GAGSLVAYSL KITDIDPLEY ALLFERFLNP ERVSMPDFDI DFCQDNRYRV IEYVRERYGK DAVSQIATFG TMASKAVVRD VGRVLDLPYG LCDRLSKLIP IEGAKPVSLN KAYEMEPQIG EMMADGNDGE SVRDLWSLAQ PLEGLSRNVG MHAGGVLIAP GKLTDFCPLY IADGDDATPV SQFDKDDVEA VGLVKFDFLG LRNLTIIELA LEYVARLEGS RPDLMSLGFE DPAAYQILKD ANTTAIFQVE SDGMKKLLKK LAPDRFEDII AVLALYRPGP LGSGMVDDFI LRKKGQQEID YFHPDLKACL EPTYGVIVYQ EQVMQISQII GGYTLGGADM LRRAMGKKKA DEMAKHRATI AEGAKQKGYD PALAEQLFDL MTKFAEYGFN KSHTAAYAVV TYHTAWLKAH HCAAFMAATM SSDLDNTDTV KIFYEDTLAN GIKVLPPDVN ASDYRFVPVD RKIIRYGLGA VKGVGEPAVR AILAAHEKGG DFRDLFDFCE RVDRRMVNRR VIEALIRAGA MDTLPGHKGL DRAQLMATVA LAMEAAEQAA ANAMQGGLFD LMPEAAGAAP EFAKVRAWTE RERLKEEKLA IGFFLSGHPF NAFKPEVRRF VRRTLAQLEP SRDITMLAGV VMEQRTKVGN RGKMAFVLLD DGTEPREVTV YSEVLEASRG KIVTDEVLVV EAKVSNDDFS GGLRINADRL LTLGEARSRF ARALSLRIDG ELAASGGAAG AAGKLQTLLE PFREGGCPIR VRYRNAAAEA ELPLGDGWRV RLDDALLESL REWLPGEAVE VVYP
|
| |