Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0199 |
Symbol | |
ID | 7084320 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 232474 |
End bp | 235227 |
Gene Length | 2754 bp |
Protein Length | 917 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643697241 |
Product | molybdopterin oxidoreductase |
Protein accession | YP_002353890 |
Protein GI | 217968656 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.273799 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGCCG CCACCGCCCG CAACGAACGC AAGATTCCGA GCTACTGCTA CAACTGCGTC GCCGGCCCCG ACTTCATGAC CGTCAAGGTG ATCGACGGCG TGGCCACCGA GATCGAACCA AACTTCGCCG CCGCCGAGGT TCATCCCGCA CGCGGCCGCG TGTGCGTGAA GGCGCACGGC CTGGTGCAGA AGACCTACAA CCCGCACCGC ATCCTGCAGC CGATGAAGCG CACCAACCCG AAGAAGGGGC GCAACGAGGA TCCGGGCTTC GTGCCGATCT CCTGGGACGA GGCCCTCGAC ACCATCGCCG CCCGCCTCGC CGGGGTGCGC GAGAAGGGCC TCGTCGACGA CTCCGGCCTG CCGCGCGTGG CGGCCAGCTT CGGCCACGGC GGCACGCCGG CGATGTACAT GGGCACCCTG CCCGCCTTCC TCGCCGCCTG GGGGCCGATC GACTTCAGCT TCGGCTCCGG CCAGGGCGTC AAGTGCGTGC ACTCCGAGCA CCTCTACGGC GAGTTCTGGC ACCGCGCCTT CACCGTCGCC GCCGACACGC CCAACTGCCG CTACGTGATC TCGATCGGCA GCAACGTCGA CGCCTCCGGC GGCCCGTGCG CGGTCACCCG CCACGCCGAC GCGCGCGTGC GCGGCTACAA GCGCGTGCAG GTCGAGCCCC ACCTGTCGAT CACCGGCGCC TGTGCGTCCG AGTGGGTGCC GATCCGCCCC AAGACCGACC CCGCCTTCAT GTTCGCGCTG ATCCACGTGC TGGTGTGCGA GCACGGCCTC GGGCAGCTCG ACCTGCCCTT CCTGCGCGAC CGCACGTCCT CGCCCTACCT GGTCGGCCCC GACGGTCTCT ATCTGCGCGC CCCCGACAGC GCGAAGCCGC TGGTGTGGGA TCCCGCCGCC GGCCGCGCCG TGCCCTTCGA CACCCCGGGC GTCGAGCCCG CGCTCGAAGG CCGCTTCCGC GTCGCCGCGG CGGTGACCGT GGATGCCGAC GACGCCCGCC ATGCGCTCGT CGACGTGGAG GGCACGCCCG CCCACACGAT GCTGGTCGAG CACATGCGCA AATACACGCC GGAGTGGGCG GAGGGCATCT GCGACGTGCC CGCGACCACC ATCCGCCGCA TCGCGGGCGA GTACCTCGAG AACGCGCAGG TCGGCGCCAC CATCGAGGTG GACGGCGCCA CCCTGCCGCT GCGCCCGGTC GCGGTCACCC TCGGCAAGTC GGTGAATAAC GGCTGGGGCG CCTTCGAGTG CTGCTGGGCG CGCACGGTGC TCGCCACCCT GGTGGGCGCG CTCGAGGTGC CGGGCGGTAC CCTCGGCACC ACGGTGCGGC TGAACCGCCC GCACGACGAC CGCCACCTTA GCGTCGCCGC TGGCGAGGAC GGCTTCATGG CGCAGAAATT CAACCCCACC GACAAGGAGC ACTGGGTCGC CCGGCCCACC GGCCGCAACG CCCACCGCAC CCTGGTGCCG ATCGTCGGCA ACTCGGCATG GAGCCAGGCG CTCGGCCCCA CCCAGCTGGC GTGGATGTTC CAGCGCGAGG TGCCGCGCGA CTTCAACATG CCCAAGCCGA CCCTGCCCGA CATCTGGTTC ATCTACCGCT CCAACCCGGC GATCTCGTTC TGGGACACGC CCAGCCTGGT GGAGACGATC GCCACCTTCC CCTTCACCGT GTCCTTCGCC TACACGGTGG ACGAGACCAA CTTCTTCGCC GACCTGCTGC TGCCCGAGGC CACCGACCTC GAATCCCTGC AGATGATCAA GGTCGGCGGC ACCAAGTTCG TCGAGCAGTT CTGGACCGCG CGCGGCGTGG TGCTGCGCCA GCCGGCGGTC GAGCCGCAGG GCGAGGCGCG CGACTTCACC TGGATCAGCA CCGAGCTGGC GCGCCGCACC GGCCTGCTCG AGCCCTACAA CAAGGCGATC AACCGCGGCG CCGGCGGCGT CTCGCCGCTC GCCGGCGAGG GCTACGACTT CAGCCTCGAC CCCACGCGCA CGCACGACGT GGACACGATC TGGGACGCCA TCTGCCGCGC GGCGAGCACC GATCTCTCGC AGGGGCAGGA GACCCACGAC CTCGCCTGGT TCAAGGAGCA CGGCTTCTAC ACCGTGCCGA TGCCGCAGCG CTCGTGGTAC CTCACCCCCA CCCTGGCCGA GAAGGGCCTG CGCTACGAGC TGCCCTACCA GGAGCGCCTG CTGCGCATCG GCCGCGAGCT CGGCAACCGC CTGCACGAGC ACGACATGCA CTGGTGGGAC ACCCAGCTCT CCGAATACAC CGCCCTGCCC GAATGGCACG ACGTTCCCGG GCGCTGGGAG CAGGCCCTGG TCGACAGCGG CGCGAAGCCC GAGGACTACC CGCTGTGGCT GCTCGCCACC AAGAGCATGC AGTACCACAC CGGCGGCAAC GTCAGCATCG CGCTGATGCG CGAGGTGGCG CAGAACGTGC GGGGCCACGC CGGGGTGATC ATGAACGCCA ACACCGCCAG GCGCCTCGGC ATCGCCGACG GCGACCGCGT CGAGATCCGC TCGCACATCG GCGCCACCTA CGGCAAGGCG GTGCTCGCCC ACGGCATCCG CCCCGACACC CTGGTCATCC CGGGCCAGTT CGACCACTGG GCCACCCCGG TCGCCAAGGA TTTCGGCATG CCCAGCCTCA ACACCGTGGC GCCGATGTCG CTCGAACTCA CCGACGCCAC CGGCTCGGGC GCCGACATCG TGCGCGTGGC GATCCGCCGT CTCGAAGGGA GCACCGTGCA ATGA
|
Protein sequence | MTAATARNER KIPSYCYNCV AGPDFMTVKV IDGVATEIEP NFAAAEVHPA RGRVCVKAHG LVQKTYNPHR ILQPMKRTNP KKGRNEDPGF VPISWDEALD TIAARLAGVR EKGLVDDSGL PRVAASFGHG GTPAMYMGTL PAFLAAWGPI DFSFGSGQGV KCVHSEHLYG EFWHRAFTVA ADTPNCRYVI SIGSNVDASG GPCAVTRHAD ARVRGYKRVQ VEPHLSITGA CASEWVPIRP KTDPAFMFAL IHVLVCEHGL GQLDLPFLRD RTSSPYLVGP DGLYLRAPDS AKPLVWDPAA GRAVPFDTPG VEPALEGRFR VAAAVTVDAD DARHALVDVE GTPAHTMLVE HMRKYTPEWA EGICDVPATT IRRIAGEYLE NAQVGATIEV DGATLPLRPV AVTLGKSVNN GWGAFECCWA RTVLATLVGA LEVPGGTLGT TVRLNRPHDD RHLSVAAGED GFMAQKFNPT DKEHWVARPT GRNAHRTLVP IVGNSAWSQA LGPTQLAWMF QREVPRDFNM PKPTLPDIWF IYRSNPAISF WDTPSLVETI ATFPFTVSFA YTVDETNFFA DLLLPEATDL ESLQMIKVGG TKFVEQFWTA RGVVLRQPAV EPQGEARDFT WISTELARRT GLLEPYNKAI NRGAGGVSPL AGEGYDFSLD PTRTHDVDTI WDAICRAAST DLSQGQETHD LAWFKEHGFY TVPMPQRSWY LTPTLAEKGL RYELPYQERL LRIGRELGNR LHEHDMHWWD TQLSEYTALP EWHDVPGRWE QALVDSGAKP EDYPLWLLAT KSMQYHTGGN VSIALMREVA QNVRGHAGVI MNANTARRLG IADGDRVEIR SHIGATYGKA VLAHGIRPDT LVIPGQFDHW ATPVAKDFGM PSLNTVAPMS LELTDATGSG ADIVRVAIRR LEGSTVQ
|
| |