Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3709 |
Symbol | |
ID | 7873708 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 4074414 |
End bp | 4077326 |
Gene Length | 2913 bp |
Protein Length | 970 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643700655 |
Product | molybdopterin oxidoreductase |
Protein accession | YP_002890679 |
Protein GI | 237654365 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGACGA AGAAATCCGC GACCGCCGCC ACCAGCGCGC GTCGCCTGCG CGGCGCCGCC GCGCGCCAGC TCGGCCAGAC CATGGACCGC CGCGCCTTCC TGAAGCGCTC CGGCCTGGGC GTGGGCGCCG GCGCGCTCGC CACCCAGCTG CCCTACAACT TCATCGGCGC CGCCGACGCC GCGGCGCCCG CGGGCGTGTC GCGCGCCGAG GGCAAGGCCG AGATCCGCCG CACCGTGTGC ACCCACTGCT CGGTGGGCTG TGCCGCCGAC GCGGTGGTGC GCAACGGCGT GTGGGTGCAT CAGGAGCCGG TGTTCGACTC GCCGATCAAC CTCGGGTCGC ACTGCGCCAA GGGCGCCGCG CTGCGCGAGC ATGGCCACGG TGAATACCGC CTGAAGTATC CGATGAAGCT GGTCGACGGC AAGTACCAGA AGATCTCGTG GGAGCAGGCG CTCAACGAGG TCGGCGACCG CCTGCTGAAG ATCCGCGAGG AGTCCGGCCC CGACGCGGTG TATTTCATCG GATCGTCGAA ACACAACAAC GAGCAGGCCT ACCTGCTGCG CAAGTTCGTC TCCTTCTGGG GCACCAACAA CACCGACCAC CAGGCGCGCA TCTGCCACTC CACCACCGTG GCGGGCGTGG CGAACACCTG GGGCTACGGC GCGATGACGA ACTCCTACAA CGACATGCAG AACGCCAAGG CGATGCTGTT CATCGGCTCG AACGCGGCCG AGGCGCACCC GGTGTCCTTG TTGCACATCC TGCACGCCAA GGAAAACGGC GCCAAGATGA TCGTCGTTGA CCCGCGCTTC ACCCGCACCG CGGCCAAGGC CCACCAGTAC ATCCGCATCC GCTCCGGCAC CGACGTTCCC TTCCTGTTCG GCCTGCTGTA CCACATCTTC CAGAACGGCT GGGAAGACAA GCAGTACATC GACGACCGCG TCTTCGGCAT GGACAAGATC CGCGACGAAG TGCTCGCCAA GTGGACGCCG GACAAGGTGG AAGAAGCCTG CGGTGTGCCC GCGGAGCAGA TGCTGCTCGC CGCCCGCACC ATGGCCGAGA ACCGCCCGAG CACCATCGTC TGGTGCATGG GCCAGACCCA GCACTCGATC GGCAACGCGA TGGTGCGCGC CTCCTGCATC CTGCAGCTCG CGCTCGGCAA CATCGGCAAG TCGGGCGGCG GCGCCAACAT CTTCCGCGGC CACGACAACG TGCAGGGCGC GACCGACGTC GGCCCCAACC CCGACTCCCT GCCCGGCTAC TACGGCCTCG CCACCGGCTC GTGGAAGCAC TGGGCGAACG TGTGGGGCGT GGACTACGAG TGGGTCAAGG GTCGCTACGT CTCGCAGGAG CTGATGGAGA AGTCCGGCAT CACCGTGTCG CGCTGGATCG ACGGCGTGCT CGAGAACAAG GACCTCATCG ACCAGCCCGC CGGCAACCTG CGCGCGGTGG TGTATTGGGG CCATGCGCCC AACTCGCAGA CGCGCGGCGC CGAGATGGTC GAGGCGATGA AGAAGCTCGA CACCCTGGTG GTGATCGATC CCTATCCGTC GGCCACCGCG GCGATGGCGG CGATGGTGCG GAAGGACGGC GTGTACCTGC TGCCGGCCTG CACCCAGTTC GAGACCCACG GCTCGGCGAC CGCGTCGAAC CGCTCGATCC AGTGGCGCGA GAAGGTCATC GAGCCCCTCT TCGAGTCCAA GCCCGATCAC ACCATCATGT ACGCCTTCGC GAAGAAGTTC GGCTGGGCCG AGCAGTTCGT GAAGAACATC AAGCTCGAGA AGGATGCCCA GGGCTGGGAC GAGCCGAGCA TCGAGGACAC GCTGCGCGAG ATCAACCGCG GCACCTGGAC GATCGGCTAC ACCGGCCAGT CGCCCGAGCG CCTCAAGCTG CACATGAAGA ACATGCACAC CTTCGACGTC AAGACGCTGA AGGCGGTGGG CGGCCCCTGC GACGGCGAGT ACTTCGGCCT GCCGTGGCCG TGCTACGGCA CGCCCGAGAT GAAGCACCCC GGCACGCCCA ACCTCTACGA CACCTCCAAG CACGTGATGG ACGGCGGTGG CAATTTCCGC GCCAACTTCG GCGTGGAGAA GGACGGTGTG TCGCTGCTCG CCGGCGCGGG CTCGGCCTCC AAGGGCGCCG ACCTGCAGAT GGGCTACCCC GAGTTCGACC ACGTGCTGTT GAAGAAACTG GGCTGGTGGG ACGAGCTCAC CGACGCCGAG AAGGCTGCGG CCGAGGGCAA GAACTGGAAG ACCGACCCCT CGGGCGGAAT CATCCGCGTG GCCATGAAGA ACCACGGCTG CCACCCCTTC GGCAACGCCA AGGCGCGCGC GGTGGTGTGG AACTTCCCCG ACCCCATCCC GCAGCACCGC GAGCCGATCT ACTCGCCGCG CCCGGACCTG GTCGCCAAGT ACCCGACCCA CGACGACAAG ATGAAGTTCT GGCGCCTGCC TACCCTGTAC AAGTCGATGC AGGAGAAGGT GAAGGACATC TCGAAGGACT ACCCGCTGGT GATGACCTCC GGACGCCTGG TCGAGTACGA GGGCGGCGGC GAGGAGACCC GCTCCAACCC CTGGCTGGCC GAGCTGCAGC AGGAGATGTT CGCCGAGGTG AATCCGAAGG ACGCCAACGA CGCGGGCTTC CGCAATGGCG AGTACATCTG GGTCGAGTCG CCCACCAAGG CGAGGCTCAA GGTGCGCGCG CAGGTCACCC AGCGCGTGGC GCCGGGCACG GTCTTCCTGC CTTTCCACTT CTCCGGCTGG TGGCAGGGCA AGGACATGCT CGAGTTCTAC CCCGCGGGTG CGGCCCCGGT CGTGCGCGGC GAGGCGGTCA ACACCGCCAC CACCTACGGC TACGACTCGG TGACCATGAT GCAGGAAACC AAGACCACCC TGTGCCGCGT GAGCAAGGCC TGA
|
Protein sequence | MLTKKSATAA TSARRLRGAA ARQLGQTMDR RAFLKRSGLG VGAGALATQL PYNFIGAADA AAPAGVSRAE GKAEIRRTVC THCSVGCAAD AVVRNGVWVH QEPVFDSPIN LGSHCAKGAA LREHGHGEYR LKYPMKLVDG KYQKISWEQA LNEVGDRLLK IREESGPDAV YFIGSSKHNN EQAYLLRKFV SFWGTNNTDH QARICHSTTV AGVANTWGYG AMTNSYNDMQ NAKAMLFIGS NAAEAHPVSL LHILHAKENG AKMIVVDPRF TRTAAKAHQY IRIRSGTDVP FLFGLLYHIF QNGWEDKQYI DDRVFGMDKI RDEVLAKWTP DKVEEACGVP AEQMLLAART MAENRPSTIV WCMGQTQHSI GNAMVRASCI LQLALGNIGK SGGGANIFRG HDNVQGATDV GPNPDSLPGY YGLATGSWKH WANVWGVDYE WVKGRYVSQE LMEKSGITVS RWIDGVLENK DLIDQPAGNL RAVVYWGHAP NSQTRGAEMV EAMKKLDTLV VIDPYPSATA AMAAMVRKDG VYLLPACTQF ETHGSATASN RSIQWREKVI EPLFESKPDH TIMYAFAKKF GWAEQFVKNI KLEKDAQGWD EPSIEDTLRE INRGTWTIGY TGQSPERLKL HMKNMHTFDV KTLKAVGGPC DGEYFGLPWP CYGTPEMKHP GTPNLYDTSK HVMDGGGNFR ANFGVEKDGV SLLAGAGSAS KGADLQMGYP EFDHVLLKKL GWWDELTDAE KAAAEGKNWK TDPSGGIIRV AMKNHGCHPF GNAKARAVVW NFPDPIPQHR EPIYSPRPDL VAKYPTHDDK MKFWRLPTLY KSMQEKVKDI SKDYPLVMTS GRLVEYEGGG EETRSNPWLA ELQQEMFAEV NPKDANDAGF RNGEYIWVES PTKARLKVRA QVTQRVAPGT VFLPFHFSGW WQGKDMLEFY PAGAAPVVRG EAVNTATTYG YDSVTMMQET KTTLCRVSKA
|
| |