Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2823 |
Symbol | |
ID | 7873231 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3052158 |
End bp | 3055007 |
Gene Length | 2850 bp |
Protein Length | 949 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643699744 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_002889799 |
Protein GI | 237653485 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.156592 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCGGC ACGATCGCGA GGACCGCAGC CCCACGCCCC AGCCCCCCGG CCTGGCGACC GCGCCGGATC GTGCGGCAGC ACACGACGCG CTCGACCAGC GTGCGGACGC CTTCGAGATC CTCCTCGACG GCGTCGCGGT GCAGGCTTTC CCCGGCGAGA CGCTGTGGAA GGTCGCACTG CGCGCCGGTG AACGCATCCC CCACCTGTGT TTCAAGGACG CGCCCGGCTA CCGCGCCGAC GGCAACTGCC GCGCCTGCAT GGTCGAGATC GAGGGCGAGC GCGTGCTCGC CGCCTCCTGC ATGCGCGCGG CGCAGCCGGG CATGGTGGTA CGCAGCGCCA GCTCGGCGCG GGCACGGCAG GCGCGCGAGG GTGTGCTCGA ACTGCTGCTC GAACAGCAGC CGGCACGCAC GCACAGCCCC GACCGCTCCA GCCATTTCTG GGAGCTGGTC GAGCACATGC GTTTGCCCGC CGCCGACCGC GGCCCCGCAG CGCAGCAGCC GGACCGCACC CACCCCGCGA TCCACGTGAA CCTGGACGCC TGCATCACCT GCGGCCTGTG CGTGCGCGCC TGCCGCGAGG TGCAGGTGAA CGAGGTCATC GGCCTCGCCC ACCGCGGTGC ACGCGCGAAG ATCGTCTTCG ACTTCGACGA TGCGCTTGGC GACAGCAGCT GCGTGGCCTG CGGCGAGTGC GTGCAGGCCT GTCCCACCGG CGCGCTGATG CCCGCCACCC TGGTCGATGC GCAGGGCCGC GGCGACTCGG CCTGCGCCGA GCGCAAGGTG GATTCGGTGT GCCCCTACTG CGGCGTGGGC TGCCAGCTCA CCTACCACGT CCGGGACGAG CGCATCCTCT TCGTCGAGGG CCGCAACGGC CCGGCCAACG AGAACCGCCT GTGCGTGAAG GGCCGCTTCG GCTTCGATTA CCCCAACCAC CGCGGCCGCC TCACGACGCC GCTGATCCGC CGCGCGGGCG TGCCCAAGGG CGTGGATCCC GCCTTCGATC CGGCGAATCC GCTCAGCCAC TTCCGCCCGG CGAGCTGGGA CGAGGCGCTC GACTTCGCCG CCGCCGGCCT GCGCCGCCTG CTCGACACCC ACGGCCCCGG CGTGCTCGCC GGCTTCGGCA GCGCCAAGTG CTCGAACGAG GAGGCCTGGC TGTTCCAGAA GCTGGTGCGC ACGGGCTTCG GCTCCAACAA CGTCGACCAC TGCACCCGGC TGTGCCACGC CAGCTCGGTC GCCGCGCTGA TGGAGTGCAT CGGCTCGGGC GCGGTCACGG CCTCCTTCAT GCAGGCCGCG CACGCGGACG TGGTCATCCT CACCGGCTGC AACCCGACGG TGAACCACCC GGTCGCCGCC ACGTACTTCA AGCAGGCCGC CAAGCGCGGC ACCAGGCTGA TCGTGCTCGA CCCGCGCGGC CTGGTGCTCG GCCACCACGC CCACCGCATG GTGCGCTTCA CGCCGGGCAG CGACGTCGCG CTCTTCAACG CCATGCTCAA CGTCATCGTC GGCGAGGGCC TGTGCGACCG CGACTTCATC GCCGCGCGCA CCGAGGGCTT CGAGGCGCTC GCCGCCCACG TCGCACCGCT CACGCCCGAG GCCATGGCGC CGCTGTGCGG GGTGGCGCCG GACGAGATTC GCGCCATCGC CCGCCTCTAC GCCACCGCCG AGCGCGCGAT GATCTTCTGG GGCATGGGCA TCTCGCAGCA TGTGCACGGC ACCGACAACG CGCGCTGCCT GATCGCGCTC GCGCTCGCCA CCGGCCATGT CGGCCGCCCC GGCACCGGCC TGCATCCGCT GCGCGGGCAG AACAACGTGC AGGGGGCGTC CGACGCCGGC CTCATCCCCA TGGTGCTGCC CGACTACCGC CCGGTGGGCG ACGCGCAGTA CCGCGCCGCC TTCGAGGAAC TGTGGGCGAC GCCGCTCCCC GCCGAGCCCG GCCTCACGGT CGTCGAGACC ATGGACGCGA TTGCCGCCGG CCGGGTGCGC GGCATGTACA TCCTCGGCGA GAACCCGGCG ATGTCCGACC CCGACCTGCA CCACACCCGC GCCGCGCTCG CCAGGCTCGA GCACCTCGTC GTGCAGGACC TCTTCGTCAC CGAGACCGCG CAGTTCGCCG ACGTGATCCT GCCCGCCTCG GCCTGGCCCG AGAAGGACGG CACCGTCACC AACACCAACC GCCAGATCCA GCTCGGCCGC GCCGCCGTGC CGCTGCCCGG CGAGGCGCGG CCCGACTGGT GGATCCTCCA GCAGCTCGCC CGCTGCCTCG GCCTCGACTG GCAGTACGCG CACCCGCGCG AAGTCTTCGC CGAGATGAAG CGGGCGATGC GCTCGCTCGA CCACATCGAC TGGGCGCGTC TGGAGCGCGA GGGAGCGGTC ACAACCCCTT GCCCGGCCGA GGACGCCCCG GGCAAGGACG TGGTCTTCGA CGACCGCTTC CCCACCGCGA GCGGACGCGC GCGCTTCCGC CCCACCGTGC CGCTGCCGCC CGACGAGCCG GTCGACGCCG CCTGGCCCAC GGTGCTGATC ACCGGCCGCC AGCTCGAACA CTGGCACACC GGCGCGATGA CGCGGCGCAG CGCGGTGCTC GACGCGCTCG AACCGGCCGC GGTGGCGACG CTGGCCCCGG CCGAACTCAC CCGACTCGGC CTCGCCCCCG GCGCCGCGCT GAGCATCGAG ACCCGCCGCG GCCGCATCAC GCTCGCCGCC CGCGCCGATC CGCTGATGCC CGCGGGCATG GTCTTCGTGC CCTTCTGCTA CGTCGAAGCC GCAGCCAACC TGCTCACCAA CCCGGCCCTC GATCCGTACG GAAAGATTCC GGAGTTCAAG TACGCTGCGT GCCGGCTCGC GCCCGCATGA
|
Protein sequence | MSRHDREDRS PTPQPPGLAT APDRAAAHDA LDQRADAFEI LLDGVAVQAF PGETLWKVAL RAGERIPHLC FKDAPGYRAD GNCRACMVEI EGERVLAASC MRAAQPGMVV RSASSARARQ AREGVLELLL EQQPARTHSP DRSSHFWELV EHMRLPAADR GPAAQQPDRT HPAIHVNLDA CITCGLCVRA CREVQVNEVI GLAHRGARAK IVFDFDDALG DSSCVACGEC VQACPTGALM PATLVDAQGR GDSACAERKV DSVCPYCGVG CQLTYHVRDE RILFVEGRNG PANENRLCVK GRFGFDYPNH RGRLTTPLIR RAGVPKGVDP AFDPANPLSH FRPASWDEAL DFAAAGLRRL LDTHGPGVLA GFGSAKCSNE EAWLFQKLVR TGFGSNNVDH CTRLCHASSV AALMECIGSG AVTASFMQAA HADVVILTGC NPTVNHPVAA TYFKQAAKRG TRLIVLDPRG LVLGHHAHRM VRFTPGSDVA LFNAMLNVIV GEGLCDRDFI AARTEGFEAL AAHVAPLTPE AMAPLCGVAP DEIRAIARLY ATAERAMIFW GMGISQHVHG TDNARCLIAL ALATGHVGRP GTGLHPLRGQ NNVQGASDAG LIPMVLPDYR PVGDAQYRAA FEELWATPLP AEPGLTVVET MDAIAAGRVR GMYILGENPA MSDPDLHHTR AALARLEHLV VQDLFVTETA QFADVILPAS AWPEKDGTVT NTNRQIQLGR AAVPLPGEAR PDWWILQQLA RCLGLDWQYA HPREVFAEMK RAMRSLDHID WARLEREGAV TTPCPAEDAP GKDVVFDDRF PTASGRARFR PTVPLPPDEP VDAAWPTVLI TGRQLEHWHT GAMTRRSAVL DALEPAAVAT LAPAELTRLG LAPGAALSIE TRRGRITLAA RADPLMPAGM VFVPFCYVEA AANLLTNPAL DPYGKIPEFK YAACRLAPA
|
| |