Gene Tmz1t_2823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2823 
Symbol 
ID7873231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3052158 
End bp3055007 
Gene Length2850 bp 
Protein Length949 aa 
Translation table11 
GC content73% 
IMG OID643699744 
Productformate dehydrogenase, alpha subunit 
Protein accessionYP_002889799 
Protein GI237653485 
COG category[R] General function prediction only 
COG ID[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID[TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.156592 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCGGC ACGATCGCGA GGACCGCAGC CCCACGCCCC AGCCCCCCGG CCTGGCGACC 
GCGCCGGATC GTGCGGCAGC ACACGACGCG CTCGACCAGC GTGCGGACGC CTTCGAGATC
CTCCTCGACG GCGTCGCGGT GCAGGCTTTC CCCGGCGAGA CGCTGTGGAA GGTCGCACTG
CGCGCCGGTG AACGCATCCC CCACCTGTGT TTCAAGGACG CGCCCGGCTA CCGCGCCGAC
GGCAACTGCC GCGCCTGCAT GGTCGAGATC GAGGGCGAGC GCGTGCTCGC CGCCTCCTGC
ATGCGCGCGG CGCAGCCGGG CATGGTGGTA CGCAGCGCCA GCTCGGCGCG GGCACGGCAG
GCGCGCGAGG GTGTGCTCGA ACTGCTGCTC GAACAGCAGC CGGCACGCAC GCACAGCCCC
GACCGCTCCA GCCATTTCTG GGAGCTGGTC GAGCACATGC GTTTGCCCGC CGCCGACCGC
GGCCCCGCAG CGCAGCAGCC GGACCGCACC CACCCCGCGA TCCACGTGAA CCTGGACGCC
TGCATCACCT GCGGCCTGTG CGTGCGCGCC TGCCGCGAGG TGCAGGTGAA CGAGGTCATC
GGCCTCGCCC ACCGCGGTGC ACGCGCGAAG ATCGTCTTCG ACTTCGACGA TGCGCTTGGC
GACAGCAGCT GCGTGGCCTG CGGCGAGTGC GTGCAGGCCT GTCCCACCGG CGCGCTGATG
CCCGCCACCC TGGTCGATGC GCAGGGCCGC GGCGACTCGG CCTGCGCCGA GCGCAAGGTG
GATTCGGTGT GCCCCTACTG CGGCGTGGGC TGCCAGCTCA CCTACCACGT CCGGGACGAG
CGCATCCTCT TCGTCGAGGG CCGCAACGGC CCGGCCAACG AGAACCGCCT GTGCGTGAAG
GGCCGCTTCG GCTTCGATTA CCCCAACCAC CGCGGCCGCC TCACGACGCC GCTGATCCGC
CGCGCGGGCG TGCCCAAGGG CGTGGATCCC GCCTTCGATC CGGCGAATCC GCTCAGCCAC
TTCCGCCCGG CGAGCTGGGA CGAGGCGCTC GACTTCGCCG CCGCCGGCCT GCGCCGCCTG
CTCGACACCC ACGGCCCCGG CGTGCTCGCC GGCTTCGGCA GCGCCAAGTG CTCGAACGAG
GAGGCCTGGC TGTTCCAGAA GCTGGTGCGC ACGGGCTTCG GCTCCAACAA CGTCGACCAC
TGCACCCGGC TGTGCCACGC CAGCTCGGTC GCCGCGCTGA TGGAGTGCAT CGGCTCGGGC
GCGGTCACGG CCTCCTTCAT GCAGGCCGCG CACGCGGACG TGGTCATCCT CACCGGCTGC
AACCCGACGG TGAACCACCC GGTCGCCGCC ACGTACTTCA AGCAGGCCGC CAAGCGCGGC
ACCAGGCTGA TCGTGCTCGA CCCGCGCGGC CTGGTGCTCG GCCACCACGC CCACCGCATG
GTGCGCTTCA CGCCGGGCAG CGACGTCGCG CTCTTCAACG CCATGCTCAA CGTCATCGTC
GGCGAGGGCC TGTGCGACCG CGACTTCATC GCCGCGCGCA CCGAGGGCTT CGAGGCGCTC
GCCGCCCACG TCGCACCGCT CACGCCCGAG GCCATGGCGC CGCTGTGCGG GGTGGCGCCG
GACGAGATTC GCGCCATCGC CCGCCTCTAC GCCACCGCCG AGCGCGCGAT GATCTTCTGG
GGCATGGGCA TCTCGCAGCA TGTGCACGGC ACCGACAACG CGCGCTGCCT GATCGCGCTC
GCGCTCGCCA CCGGCCATGT CGGCCGCCCC GGCACCGGCC TGCATCCGCT GCGCGGGCAG
AACAACGTGC AGGGGGCGTC CGACGCCGGC CTCATCCCCA TGGTGCTGCC CGACTACCGC
CCGGTGGGCG ACGCGCAGTA CCGCGCCGCC TTCGAGGAAC TGTGGGCGAC GCCGCTCCCC
GCCGAGCCCG GCCTCACGGT CGTCGAGACC ATGGACGCGA TTGCCGCCGG CCGGGTGCGC
GGCATGTACA TCCTCGGCGA GAACCCGGCG ATGTCCGACC CCGACCTGCA CCACACCCGC
GCCGCGCTCG CCAGGCTCGA GCACCTCGTC GTGCAGGACC TCTTCGTCAC CGAGACCGCG
CAGTTCGCCG ACGTGATCCT GCCCGCCTCG GCCTGGCCCG AGAAGGACGG CACCGTCACC
AACACCAACC GCCAGATCCA GCTCGGCCGC GCCGCCGTGC CGCTGCCCGG CGAGGCGCGG
CCCGACTGGT GGATCCTCCA GCAGCTCGCC CGCTGCCTCG GCCTCGACTG GCAGTACGCG
CACCCGCGCG AAGTCTTCGC CGAGATGAAG CGGGCGATGC GCTCGCTCGA CCACATCGAC
TGGGCGCGTC TGGAGCGCGA GGGAGCGGTC ACAACCCCTT GCCCGGCCGA GGACGCCCCG
GGCAAGGACG TGGTCTTCGA CGACCGCTTC CCCACCGCGA GCGGACGCGC GCGCTTCCGC
CCCACCGTGC CGCTGCCGCC CGACGAGCCG GTCGACGCCG CCTGGCCCAC GGTGCTGATC
ACCGGCCGCC AGCTCGAACA CTGGCACACC GGCGCGATGA CGCGGCGCAG CGCGGTGCTC
GACGCGCTCG AACCGGCCGC GGTGGCGACG CTGGCCCCGG CCGAACTCAC CCGACTCGGC
CTCGCCCCCG GCGCCGCGCT GAGCATCGAG ACCCGCCGCG GCCGCATCAC GCTCGCCGCC
CGCGCCGATC CGCTGATGCC CGCGGGCATG GTCTTCGTGC CCTTCTGCTA CGTCGAAGCC
GCAGCCAACC TGCTCACCAA CCCGGCCCTC GATCCGTACG GAAAGATTCC GGAGTTCAAG
TACGCTGCGT GCCGGCTCGC GCCCGCATGA
 
Protein sequence
MSRHDREDRS PTPQPPGLAT APDRAAAHDA LDQRADAFEI LLDGVAVQAF PGETLWKVAL 
RAGERIPHLC FKDAPGYRAD GNCRACMVEI EGERVLAASC MRAAQPGMVV RSASSARARQ
AREGVLELLL EQQPARTHSP DRSSHFWELV EHMRLPAADR GPAAQQPDRT HPAIHVNLDA
CITCGLCVRA CREVQVNEVI GLAHRGARAK IVFDFDDALG DSSCVACGEC VQACPTGALM
PATLVDAQGR GDSACAERKV DSVCPYCGVG CQLTYHVRDE RILFVEGRNG PANENRLCVK
GRFGFDYPNH RGRLTTPLIR RAGVPKGVDP AFDPANPLSH FRPASWDEAL DFAAAGLRRL
LDTHGPGVLA GFGSAKCSNE EAWLFQKLVR TGFGSNNVDH CTRLCHASSV AALMECIGSG
AVTASFMQAA HADVVILTGC NPTVNHPVAA TYFKQAAKRG TRLIVLDPRG LVLGHHAHRM
VRFTPGSDVA LFNAMLNVIV GEGLCDRDFI AARTEGFEAL AAHVAPLTPE AMAPLCGVAP
DEIRAIARLY ATAERAMIFW GMGISQHVHG TDNARCLIAL ALATGHVGRP GTGLHPLRGQ
NNVQGASDAG LIPMVLPDYR PVGDAQYRAA FEELWATPLP AEPGLTVVET MDAIAAGRVR
GMYILGENPA MSDPDLHHTR AALARLEHLV VQDLFVTETA QFADVILPAS AWPEKDGTVT
NTNRQIQLGR AAVPLPGEAR PDWWILQQLA RCLGLDWQYA HPREVFAEMK RAMRSLDHID
WARLEREGAV TTPCPAEDAP GKDVVFDDRF PTASGRARFR PTVPLPPDEP VDAAWPTVLI
TGRQLEHWHT GAMTRRSAVL DALEPAAVAT LAPAELTRLG LAPGAALSIE TRRGRITLAA
RADPLMPAGM VFVPFCYVEA AANLLTNPAL DPYGKIPEFK YAACRLAPA