Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2247 |
Symbol | |
ID | 7083679 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2532318 |
End bp | 2534201 |
Gene Length | 1884 bp |
Protein Length | 627 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643699266 |
Product | Respiratory-chain NADH dehydrogenase domain 51 kDa subunit |
Protein accession | YP_002355882 |
Protein GI | 217970648 |
COG category | [C] Energy production and conversion |
COG ID | [COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.179824 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGCAG AACTCGAAAC CATCCTGGAG CGTCACCGGC GCGATCCGCT GCAGTTGCTG CAGATCCTGA TCGAGCTCCA GGCCCGTGAC GGCTGGCTGC CGCCGGCCAC GCTCTCCGCG CTGGCGGGCG CGCTCGGCAT CCCGCGCGCC CGCGTCGAGA GCACCGCGAG CTTCTACAGC TTCCTGCACA CCCGGCCGGC GGGCGAATAC CGCATCCTGT TCTCCGACAA CATCACCGAC CGCATGCTCG GCAACCAGGC GCTGATGCAG ACGCTGTGCG ACAAGCTCTG GCTGCAGCCG GGCAAGGTGT CGGAGGACGG CCTGGCGAGA GTGTCGACCA CCTCGTGCAC CGGCATGTGC GACCAGGGGC CGGCGCTGCT GGCCAACGGG CGCACGATCA CCCGCCTCAC GCTCGAGCGC ATCGACGAGA TGGCGCACCT GATCCGCCGC CGGGTGCCGG TGGGCGACTG GCCGCCGGAG TGGTTCCATG TCGAGGACAA GCTCCGCCGC CGCGACGTGC TGCTCGACCA TGGCCTGGCG CCGGGTGCGG CGCTCGCCGC GGCGCTCGAG CGCGGCCCCA CCGGGCTGCT GTGGGAGATC GAGCGCTCCG GACTGCGCGG CCGCGGCGGC GCGGGTTTCG GCAGCGACAT CAAGTGGCGC TCGTGCCGCG ACGCCTGGGG CGACGCGCAC TACGTGATCT GCAACGCCGA CGAGGGCGAG CCCGGCACCT TCAAGGACCG CGTGCTGCTG TCGAGCTACT TCGACCTGGT GGTCGACGGC ATGTGCATCG CCGGCATGGC GATCGGCGCC AACAAGGGCT TCATCTACCT GCGCGGCGAA TACCGCTACC TGCTCGACCG CCTGGAAACA CGCCTCGCCC AGCGCCGCGA ACAGAACCTG CTCGGGCGCA ACATCCTGGG CCGCGGCTTC TCCTTCGACA TCGAGATCCA CCTCGGCGCC GGCGCCTACA TCTGCGGCGA GGAATCCGCG CTGATCGAGT CGCTCGAAGG CAAGCCGGGC AAGCCGCGCA TCCGCCCGCC CTTCCCGGTC ACCAACGGCT ATCGCGGGCA GCCGACCACG GTGAACAACG TCGAGACGCT CGCCCTCGCC GCGCTGATCG CGGTGCGCGG CGGCGACTGG TTCCACGCCA TCGGCACCCC GGCCTCGACC GGCACCAAGC TGCTGTCGGT GTCGGGCGAC GTCGAGCGCC CGGGCATCTA CGAGTTTCCC TTCGGTGTCA CCGTGGCCGA GGTGCTCGAG GCCGCCGGCG CGCGCGACAC GCAGGCGGTG ACGAGCGCCG GCGCGGCCGG GCATTGCCTG GCTGCCGACG AGTTCGGTCG TCGCATCGCC TTCGAGGATG TCGCCACCGG CGGCTCGATC ATGGTGTTCG ACCGTTCCCG CGACATGTTC GAGGTGGCGC ACAACTTCGC CCACTTCTTC GCCCACGAGA GCTGCGGCTT CTGCACGCCC TGCCGCGTCG GCACCGCGGT CAACGCCCGC CTGCTGGACA AGCTCGCCGC CGACCACGGC TCGCCCTACG ACCTCGACGA GATCGCGAAG ATGCACCGCC TGATGCAGGG CGCCAGCCAC TGCGGCCTGG GCAACACGGC GACGATCGCG ATCGACGACA TGCTGGCCAA GTTCCGCCCC GCCTTCGAGC GCCGCCTGCA CTCGCCCGAC TACGAGCCCG CCTTCGACCT CGACGCGGCG CTGTCGCAGG CGCGCCGGAT GACCGGCCGC GACGACCCCG GCGCCCACCT CGGCGACAAC CACGAGGCCC TGGTCGAGGA GCGCCCCCAT GCGCAGGGCG CCACCACAGC GGGCGCCGCG CAGTCCGTCC CCTCCTCCGC CGAGGCCGCC TCGGCAAAGG ATCTGCAGCC ATGA
|
Protein sequence | MTAELETILE RHRRDPLQLL QILIELQARD GWLPPATLSA LAGALGIPRA RVESTASFYS FLHTRPAGEY RILFSDNITD RMLGNQALMQ TLCDKLWLQP GKVSEDGLAR VSTTSCTGMC DQGPALLANG RTITRLTLER IDEMAHLIRR RVPVGDWPPE WFHVEDKLRR RDVLLDHGLA PGAALAAALE RGPTGLLWEI ERSGLRGRGG AGFGSDIKWR SCRDAWGDAH YVICNADEGE PGTFKDRVLL SSYFDLVVDG MCIAGMAIGA NKGFIYLRGE YRYLLDRLET RLAQRREQNL LGRNILGRGF SFDIEIHLGA GAYICGEESA LIESLEGKPG KPRIRPPFPV TNGYRGQPTT VNNVETLALA ALIAVRGGDW FHAIGTPAST GTKLLSVSGD VERPGIYEFP FGVTVAEVLE AAGARDTQAV TSAGAAGHCL AADEFGRRIA FEDVATGGSI MVFDRSRDMF EVAHNFAHFF AHESCGFCTP CRVGTAVNAR LLDKLAADHG SPYDLDEIAK MHRLMQGASH CGLGNTATIA IDDMLAKFRP AFERRLHSPD YEPAFDLDAA LSQARRMTGR DDPGAHLGDN HEALVEERPH AQGATTAGAA QSVPSSAEAA SAKDLQP
|
| |