Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2959 |
Symbol | |
ID | 7874349 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3201259 |
End bp | 3204000 |
Gene Length | 2742 bp |
Protein Length | 913 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643699880 |
Product | molybdopterin oxidoreductase |
Protein accession | YP_002889935 |
Protein GI | 237653621 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.791344 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGTCA GCAGCCCACA GGGCACAACC AAGGTCGCCA CCTACTGTTA CCAGTGCGTG GCGGGGCCGG ACCTGCTCAA GGTGAAGGTC GAGGACGGCG TGGCGACCGC CGTCGAGCCG AACTTCGACG CCGAGGACGT GCATCCGGCC GCGGGCCGGG TCTGCGTGAA GGCCTTCGGC CTCGTGCAGA AGACCTACAA CCCGAACCGC GTCCTGCAGC CGATGAAGCG CACCAACCCG AAGAAGGGGC GCCACGAGGA CCCGCGTTTC GTGCCCATCT CCTGGGATGA GGCGCTCGAC ACCATCGCCG CCAAGCTGCG CGGCATCCGC GAGACCGGCC TGCTCGACGC CTCGGGCTAC CCGCGCGTGG CGGCGAGCTT CGGCGGCGGC GGCACGCCCA CCGCCTACAT GGGCACCTTC CCCGCCCTGC TGGCGGCCTG GGGCCCGGTG GACATGAGCT TCGGCAGCGG CCAGGGCGTC AAGTGCTACC ACTCCGAGCA CCTCTACGGC GAGCTCTGGC ATCGCGCCTT CATCGTCTGT CCGGACACGC CGCGCAGCAA GTACATCCTG TCCTTCGGCA GCAACATCGA GGCCTCGGGC GGCGTATGCG GCGTTTGGCG CCACGCCGCC GCACGCGTCG AGCAGGGCGT CAAGCGCGTG CAGGTCGAGC CGCACCTGTC GGTCACCGGC GGTTGCTCCG CCGAGTGGCT GCCGATCAAG CCCAAGACCG ACGCCGCCTG CATGCACGCG ATGATCCACG TGATGCTGTT CGAGAACGCA CGCTCGCGGC TGGACATCGA TTTCCTCAAG CACATGACGG CGTCGCCCTA CCTGGTGGCG CCCAATGGCT TCTACCTGCG CGATCCCGAC ACCCGCAAGC CGCTGGTGTG GGACCTCAAG TCCGGCAAGG CCGTCGTCTT CGACACCCCC GGCATCGATC CCGCGCTCGA CGGCGACTTC CTCGCCTCCG GCCTCGAGGT GCTGCCCGAC CAGGAACTCG TCACCCATGA GGCCGCGCCG GTGCGCAGTG CCTTCGGCAA GCTGCTCGAC CACGAGCGCA CCTTCACCCC GGAGTGGGCG CAGGGCATCT GCGACGTACC GGCGGCGACC ATCCGCCGCA TCGCCAACGA ATACCTCGAC CATGCCGAGA TCGGCGCGAC GATCGAGATC GAGGGCCGCA CCCTGCCCTA TCGCCCGGTG GCGGTCACGC TCGGCAAGAC GGTGAATAAC GGCTGGGGCG GCTACGACTG CTGCTGGGCG CGCACGCTGA TGGCCTGCCT GGTCGGCGCG CTCGACGTGC CCGGCGGCAC CATCGGCACC GCGGTGCGCC TGAACCGCCC AGCCAACGAC CGCCAGAGCA GCGCCAAGCC GGGCGTCGAC GGGTTCATGG ACTATCCGTT CAACCCGACC GACAAGGAAA ACTGGATCTC GCGCCCGCAG ATCCGCAACG CCAACCGCAC CCTGGTGCCG CTGGTGGCCA ACTCGGCGTG GAGCGCCGCG CTCGGCCCCA CCCACCTGGC CTGGATGCAG CAGCGTCACG GCTTCGAGAG CTTCCCCGAG CCCACCCAGC CCGACGTCTG GTTCTTCTAC CGCACCAACC CGGTGATCTC GTTCTGGGAC ACGCCCCAGG TCGCCGAGGC GGTGGCGAAG TTCCCCTTCG TGGTCGCCTT CACCTACACC CGCGACGAGA CCAACCACTT CGCCGACATC CTGCTGCCCG ACTGCACCGA CCTCGAAGGC CTGCAGATGC TGCGCATCGG TGGCACCAAG TACGTCGAGC AGTTCTGGGA CCACCAGGGC TTCGCGCTGC GCCAACCGGT CGTGCCGACG CAGGGCGACA CCCGCGACTT CACCTGGATC AGCACCGAAC TGGCCCGCCG CGCGGGCATC CTCGAGCCGT ACAACAAGGC GATCAACCGC GGCGCCGCAG GCGTGCCGCT GAAGGGGCCG AACTACGATT TCTCGCTGCA GCCCGACCAC GCCTACGGGG TCGAGGAGAT CTGGGACGCA AGCTGCCGCG CGGCCAGCGC CGAGCTCACC GAAGGCGCCG AGGACCACGG GCTGGAGTGG TGGAAGGAGC ACGGCTTCCG CACCATCGAG TATCCGCGCC TGCAGTGGTA TCTCTATCCC CACCTGGTCG ACCAGGGGCT GCGCTTCGAG ATGCCCTACC AGGAGCGCAT CTACCGCATC GGCGAGGAGC TCGGCCGCCG CCTGCACGAA TCCGGCATCG ACTGGTGGGA CCGCCAGCTC ACCGAATACC GCCCGCTGCC CGAGTTCCAC GACTTCTCCC ACCTCATCAA GCACGCGGTG ATCTCCAACC TGGGCGGGCG CGACGAGGAC TTCCCGTTCT GGCTGCTAAC CGCGCGCAGC ATGCAGTACT CCTGGGGCGG CAACGTCAGC CTGCAGATGG TGCGCGAGGT GGCCGCGAAC GTGAAGGGCC ACCGCGGCGT GATCATGAAC CCGACGGCCG CACGCAAGCT CGGCATCGAG GACGGCGACC TCGTCGAGGT GCGCTCGCCG CTGCGCGAGA CCCGCGGTCG CGTCGTGCTG CGCCAGGGCA TCCGCCCCGA CACCCTGCTG ATGGTGGGTC AGTTCGACCA CTGGATCACG CCCTACGCGA AGGACTTCGA CGTCCCCAGC ATGAACGCCC TGGTGCCGAT GCTGATGGAC CTGACCGACG CCACCGGTTC GGCCGCCGAC ATCGTGCCGG TGTCGATCAA GCGCATCGGA GGTGCCCAAT GA
|
Protein sequence | MTVSSPQGTT KVATYCYQCV AGPDLLKVKV EDGVATAVEP NFDAEDVHPA AGRVCVKAFG LVQKTYNPNR VLQPMKRTNP KKGRHEDPRF VPISWDEALD TIAAKLRGIR ETGLLDASGY PRVAASFGGG GTPTAYMGTF PALLAAWGPV DMSFGSGQGV KCYHSEHLYG ELWHRAFIVC PDTPRSKYIL SFGSNIEASG GVCGVWRHAA ARVEQGVKRV QVEPHLSVTG GCSAEWLPIK PKTDAACMHA MIHVMLFENA RSRLDIDFLK HMTASPYLVA PNGFYLRDPD TRKPLVWDLK SGKAVVFDTP GIDPALDGDF LASGLEVLPD QELVTHEAAP VRSAFGKLLD HERTFTPEWA QGICDVPAAT IRRIANEYLD HAEIGATIEI EGRTLPYRPV AVTLGKTVNN GWGGYDCCWA RTLMACLVGA LDVPGGTIGT AVRLNRPAND RQSSAKPGVD GFMDYPFNPT DKENWISRPQ IRNANRTLVP LVANSAWSAA LGPTHLAWMQ QRHGFESFPE PTQPDVWFFY RTNPVISFWD TPQVAEAVAK FPFVVAFTYT RDETNHFADI LLPDCTDLEG LQMLRIGGTK YVEQFWDHQG FALRQPVVPT QGDTRDFTWI STELARRAGI LEPYNKAINR GAAGVPLKGP NYDFSLQPDH AYGVEEIWDA SCRAASAELT EGAEDHGLEW WKEHGFRTIE YPRLQWYLYP HLVDQGLRFE MPYQERIYRI GEELGRRLHE SGIDWWDRQL TEYRPLPEFH DFSHLIKHAV ISNLGGRDED FPFWLLTARS MQYSWGGNVS LQMVREVAAN VKGHRGVIMN PTAARKLGIE DGDLVEVRSP LRETRGRVVL RQGIRPDTLL MVGQFDHWIT PYAKDFDVPS MNALVPMLMD LTDATGSAAD IVPVSIKRIG GAQ
|
| |