Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0362 |
Symbol | |
ID | 7084868 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 410291 |
End bp | 412864 |
Gene Length | 2574 bp |
Protein Length | 857 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643697394 |
Product | molybdopterin oxidoreductase |
Protein accession | YP_002354042 |
Protein GI | 217968808 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR00509] molybdopterin guanine dinucleotide-containing S/N-oxide reductases [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR02166] anaerobic dimethyl sulfoxide reductase, A subunit, DmsA/YnfE family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.196674 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAGG ACACACTCGC CGCGCCGATC AACCCGGCGC GCCGCGACTT CCTCAAGACC ACCGCGGCGG GCGCCGCCGC GGCCGCCGCC AGCAGCTTCG TGGAACTGGC AAGCACGGGC AAGGAGGCTT CGGCCTTCGC TTACGAGCCC TACTACACCG ACGACCAGCT CACCACCATG GTGACCAGCT GCGCGCACAA CTGCGGCTCC CGCCACATGC TGGTCGCGCA CAAGAAAGGA GACGTGATCG TTCGCCTGTC GTCCGATGAC GGCCGCTACC AGGCAGGCGG CGCCTTTGGT TTCGAGACCG AGGCGGTGCC GCAGCTGCGC GCCTGCTTGC GCGGACGCTC TTACCGCGCG CGGCTCTACT CCGCCGAGCG CCTGCTCTAC CCGATGGTGC GCGTGGGCGA ACGCGGGGAG GGCCGCTTCA AGCGCGTGTC GTGGGACGAA GCGCTCGACC GCATCGCCAA GAAGATGATC GAGCTGAAGG ACACCTACGG ACCGACGGCC ATCCTCGACC AGTCCTATGC CGGCGCCTCC TACGGTGTCC TGCACAAATC GGATCAGATC GAGGGCTTGC TGGCACGTTT CCTCGGGATG TTCGGTTGCC GAACCAGCTC CTGGTCCGTG CCAAGCTACC AGGGCACAAC CTTCTCCAGC CGCACCACGT TCGGCACCAT CGAGGACGGC AACGAGGACG ACGCCTTCGC GCACTCGAAG CTCATCATCA TGTGGGGCTG GAACCCGGCC TACACCTTCC ACGGCGGCAA CACGTTCCAC TACATGCGGC TGGCCAAGCA GCGCGGCTGC AAGTTCGTCG TGGTCGACCC CCAGTACACG GATTCGGCGT CGGCCTATGA CGCCTGGTGG ATTCCCATCC GCCCCAATAC CGACGCGGCG ATGCTGGCCG CGATGGCGCA CTACATCTTT GTCAATGAGC TGCAGGACCA GGCCTTCATC AACCGTTTCT GCCTGGGCGT GGACGCGGGC ACGATGCCGG ACTGGGCGCG CGGGCACGAG AACTTCAAGG ACTACATCCT CGGCACCTAC GACGGGCAGC CGAAGACGCC GGAGTGGGCT GCCGAGATCT GCGGCGTGGC GGCAGACGAC ATCCGCAAGC TTGCCGACAT GTACGCCCGC ACCAAGCCGG CGGCGCTGAA GGCCTCGTGG TCACCCGGGC GCAACGCCTA CGGCGAGCAG TACAACCGCA TGGCGGCGGC CCTGCAGGCA ATGACCGGCA ACATCGGCAT CCTCGGCGGC TGTGCGGAAG GCGTCGGCAA GGGCTGGCAC GCGGAAGCGG TGGCCTACCC CTATGACGAG TACGCCAACG TGTGGTCGGC TTCCATCAAG TCCGACCGCT GGGCGCATTG CGTACTCAAC TACCCCGACG TGCGGCGCGA GGAGATCGGC CTGTGGCCGC GCGCTGACCA GTTCGACGGC GTCATCCCGA ACATCAAGGC GATCTGGTGG CATGGCTCGG ACTGGTTCAA TCAGCTGACC AACATCAACA AGGAAATCGA AGCCATCCGC AAGCTCGAGC TGGTGGTATG CGCCGACTCC ACGATCACGC CCTCGGGGCT GTGGGCCGAC ATCCTGCTGC CGGTGGCCAC GCACTTCGAA CGCCACGATG TCGCCTTGCC TTGGTACAAG GGGCACTACT ACATCCATCG GCCCAAGGTC ATCGCCCCGC TCGGCGAGTC CAAGAGCGAC TTCCAGATCT TCACCGAGCT CGCCTATCGC CTCGAGGCCC TCGACCCGGC CCGACTGTCA GGTTTCGGCA AGCGCTACAA CCCCAAGGCC GACCGCAGCT ACTTCGAGAA CGACGATCCG GTAGACGAGG CCTATCTGGT GGACTGGTGG CACCGGGTGC AATCGCACCA AGGGGTCACG ATGAGCTGGG AAGAATTCAA GCAGCGCGGG GTATACAAGT TCGAATTCGC GCAACCGCAC GTAGCGTTCC GCGACCAGAT CGAAAAAGGG GTGCCGTTCA ACACCCCGTC GGGGCGGATC GAGATCTTCT CCACCACGCT GGCCCAGGTC AGCGACTGGA CCAAGACCCA GTACGGTTAC CCGATCCCGG CGATCCCGAA GTGGATCGAG CCCTTCGAGT GGCTGGGCGA TACCGGCAAG GCCGCGAAGC ATCCCTTCCA CCTGATCTCT CCGCACCCGC GCTGGCGCAC CCACAGCATC TTCAACAACA TTCCCTGGCT GCGGGAGACC TACCAGCAAG AGGTCACCAT CGCAGCGCGC GATGCGGAGC GGCTCGGCAT CCGCACCGGT GACGTAGTGG AGGTGTGGAA CGAACGCGGC AAGGTGGTGG TGCCGGCCTA TGTGACCGAA CGCTGCATGC CGGGCGTCCT CGTGCTGCAT GAGGGCGCGT GGATGGATCT CGACGACAAG GGCGTCGATC GCGCGGGTAA CCCGGATTTT CTGACCGACG ACCAACCCTC GCCTGCCGGG GCATTCGCCT ACAACACCGT GCTGGCCGAC GTGAAGAAGA CGACGCTGGA GCATCGCCCA GGCTGGGATT CGCTCGCTAC CGCGCGCTCG ATCGTTTTCC GACGCGACTA TTGA
|
Protein sequence | MNKDTLAAPI NPARRDFLKT TAAGAAAAAA SSFVELASTG KEASAFAYEP YYTDDQLTTM VTSCAHNCGS RHMLVAHKKG DVIVRLSSDD GRYQAGGAFG FETEAVPQLR ACLRGRSYRA RLYSAERLLY PMVRVGERGE GRFKRVSWDE ALDRIAKKMI ELKDTYGPTA ILDQSYAGAS YGVLHKSDQI EGLLARFLGM FGCRTSSWSV PSYQGTTFSS RTTFGTIEDG NEDDAFAHSK LIIMWGWNPA YTFHGGNTFH YMRLAKQRGC KFVVVDPQYT DSASAYDAWW IPIRPNTDAA MLAAMAHYIF VNELQDQAFI NRFCLGVDAG TMPDWARGHE NFKDYILGTY DGQPKTPEWA AEICGVAADD IRKLADMYAR TKPAALKASW SPGRNAYGEQ YNRMAAALQA MTGNIGILGG CAEGVGKGWH AEAVAYPYDE YANVWSASIK SDRWAHCVLN YPDVRREEIG LWPRADQFDG VIPNIKAIWW HGSDWFNQLT NINKEIEAIR KLELVVCADS TITPSGLWAD ILLPVATHFE RHDVALPWYK GHYYIHRPKV IAPLGESKSD FQIFTELAYR LEALDPARLS GFGKRYNPKA DRSYFENDDP VDEAYLVDWW HRVQSHQGVT MSWEEFKQRG VYKFEFAQPH VAFRDQIEKG VPFNTPSGRI EIFSTTLAQV SDWTKTQYGY PIPAIPKWIE PFEWLGDTGK AAKHPFHLIS PHPRWRTHSI FNNIPWLRET YQQEVTIAAR DAERLGIRTG DVVEVWNERG KVVVPAYVTE RCMPGVLVLH EGAWMDLDDK GVDRAGNPDF LTDDQPSPAG AFAYNTVLAD VKKTTLEHRP GWDSLATARS IVFRRDY
|
| |