Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3537 |
Symbol | |
ID | 7873043 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3875412 |
End bp | 3878489 |
Gene Length | 3078 bp |
Protein Length | 1025 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643700478 |
Product | acriflavin resistance protein |
Protein accession | YP_002890508 |
Protein GI | 237654194 |
COG category | [V] Defense mechanisms |
COG ID | [COG0841] Cation/multidrug efflux pump |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0823297 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAACGGCT TGTACCGGCG GCTGATCGAC AATCACCCGC TCGCCAACAT CGCCTTCGTG ATGGTGCTGC TCGGCGGCCT GTTCAGTTAT CTCTCCATGC CGCGGGCGCA GGACCCCGAG ATCAACTTCA ACTGGGTCAG CATCGTCACC GCGCTGCCCG GCGCCTCGGC GGAGGACGTC GAGCGCGACC TCACCGGGCC GCTCGAGGAC GCACTCAAGC AGGTCAAGGA CATCCGCTTC AGCTCCTCGT CCAGCCGCGA GGGGGTGTCG TCCATCCTGG TGCGCTTCGA GGAGCTCCCC GAGCGGGTGT TCGACAAGCG CGTGAACGAC CTGCGCCGCG AGATCCAGAA CAAGGCCTCG GCAGAGCTGC CGCCGGACGC CAACGACCCC GTGGTGGTCG AGTTCACCAG CTCGAACGGC TTCCCGACCG CGATCCTGGT GCTCCATGGC GCCGGCGGCG AGACCCTGCG GCGCGCAGGC TTCGTGATGA AGAAGGAGCT CGAGGGCCTG GACGGCGTGG ACCAGGTGAT CGCCGCCGGC CTGGAAGATC CGCAGCTCCA CGTCGATTTT GATCCCGCCC GGGTAGCGGC GTCCGGGTTG TCGGCGCTGC AGGTCGCCGA CAGCGTGGCG GCCTGGTTCC GCAACACCCT CGCCGGCCGG GTGCGGGTGC AGGACCGCGA GTGGCTGGTG CGCATGGAGG GCAAGACGCC CGACCCCGAG GCGCTCGCCC AGGCTGCGGT GGTGACGCCG GCGGGCCGGG TGGCGCTCGA CGAGGTTGCC AGCGTGGCGC GCGGCCACGA GCGTCCGGGC CAGCTCGTGA GCTACAACGG CGCGCCGGCG CTGATGCTGT CGATCACCAA GGATGCCGAC GCCAACACGC TCGAGCTGGT CGAGCGCCTG CAGGACTTCG CTGCCCGGCG CAACCCGCTG CTCGCCGACC AGGGCGTGCA GCTGCGCCTG ATCGACGACC AGACCCACGC CACCCGCCAC GCGATCGGGG TGATGGAGTC GAACGCGGTG TTCGGCCTGT TCGCGGTGCT GGCGATGTGC TGGATCTTCC TCGGCTCGCG CCTGTCGGTG CTGGTCGGGC TGGGCGTGCC GTTCGCGCTT GCCGGCACCT TCATCGCGGT CGACCTGATC GGCTCCACGC TCAACCTCAC CGTGCTGCTC GGGGTCGTGA TCGCGCTCGG CATGCTGGTC GACGACGCGG TGGTGATCGT CGAGGCGATC TACTACCGCC TGCAGCGCGG CGATCCTCCG CACGAGGCGG TGGTGGCCGG GGTGGAGGAG GTCGGCCTGC CGGTGCTGTC CTCGGTGCTG ACCACCATCG CCGCCTTCCT GCCGCTGATG CTGCTGCCCG GCATCCTCGG CAAGTTCATG TTCGTGGTGC CCTTCGTGGT GACCGTCGGC TTGCTGGTCA GCCTGGTCGA GGCCTACTGG ATCCTGCCCA CCCACATCCT GGCGCTGCGC CCGCAGCCGG CGCAGCGGCC CTCGCGGTCG CAGCGGCTGC GCGAGCGCTT CACCCTGCGC ATCCGCATCG CCTATGCGCG TGCCCTGGTG GCGGTGATGC GCCGCCGCCG CCTCACCGTC GCGGTCCTGG TGCTGATGAT GGCGGGTGCC GGCGCGGCGG TGGCGACCGG CCTGGTGCGC GTGCAGTTCT TCGCCTTCGA CTCGATGCGG CTCTTCTATG TGAACGTCGA CATGCCTGCC GGTGTCGTGC CCCAGCGCAC GCTGGCCGAG GCCGAGCGCG CGGTGAAGGT GCTGGGCGAG ACGATCGGCG CCGACGAGCT GCGCGCGGTC GCGGTGTATG CGGGCCTGAA GTTCACCGAG ACCGAGCCGG TGTATGGCGA CCAGTACGCC CAGGTGCTGG TCTCGCTGCT GCCGCGCGCG GCCGGCGGGC GCGATACCGG CGAAATCATC GACGCGGTGC GTCCGGCGAT CGAGGCGCTC GCCGGCGAAG CGAAGCTCTC CTTCATGGAG ATGAAGGGCG GGCCGCCGAC GGCGAAGCCG ATCTCGGTGA AGATGAAGTC CGACGACTAC CCGCAGCTGC GCGCCGCGGC CGATGCGCTC CTCGCCGAGG CGGCAAAGCT GCCCGGGGTG CGCGATCTCA CCGACGACGA TGTGCCCGGG CGCGGCGAGA TGCGCCTCGC GCTCGATCGC GAGAAGCTGG CGCGTGCCGG GGTGCAGCCG GCCGAGGTGG CGCGCTTGCT GCGTTTGTAC GGCGAGGGCG AGATCGTTGC CACGACGCGC GACGGCGGCG AGAAGGTGGA GGTGGTGGTG CGCGCCCGCC AGGAGAGCCT GGATGACGTC ACCGGGCTGC TGCAGCGCCC GGTGCCGCTG GCCGACGGGC GCAGCGTGCA GCTGGGCGAG CTGGTGAGCC GGCAGCTGCT GCTGTCGAAG GGCTACATCC GCCACTACCA GCTCACGCGC GCGATCACCG TCGAGGCCGA GCTCGATCGC GAGCGCCTCG ACACGGTCGA AGCCAACGAC CGCCTGAAGG CCGCGTGGGC GCAGCTGCAG CCGCGCTTCC CGGGCGTGGA GCTGGATTTC TCCGGCGAAC TGGAGGACAT CCAGGAGAGT CTGGATGCGA TGGGCAAGCT GTTCGCGCTC GGCGTCGGGC TGATCTACCT GATCCTCGCC ACCCAGTTCC GCAGCTACTG GCAGCCGCTG ATCATCCTGC TGACCGTGCC GCTCGCCTTC ACCGGGGTGG TGCTCGGACT GCTGGTGTCG GGCAATCCGC TGTCGCTATA CACGCTGTAT GGGGTGATCG CGCTCACCGG CATCGCGGTG AACTCGGCGA TCGTGCTCAT CGACGCCGCC AACGAGCGCC GCGCCGTCGG CATGAGCGTG CAGCACGCCG CGATCTACGC CGCGCGCCGG CGCGTGGTGC CGATCCTGAT CACCTCCACC ACCACCATCG GCGGGCTGCT TTCGCTGGCG ATCGGCCTGG GCGGCAAGTC GCTGATGTGG GGACCGGTGG CGGCGAGCAT CGTGTGGGGG CTGGGTTTTT CCACGGTGCT GACCTTGTTC GCGGTGCCGC TGGTGTATCG GATGGCGATG CAGCGCGGTG GCGGCTAG
|
Protein sequence | MNGLYRRLID NHPLANIAFV MVLLGGLFSY LSMPRAQDPE INFNWVSIVT ALPGASAEDV ERDLTGPLED ALKQVKDIRF SSSSSREGVS SILVRFEELP ERVFDKRVND LRREIQNKAS AELPPDANDP VVVEFTSSNG FPTAILVLHG AGGETLRRAG FVMKKELEGL DGVDQVIAAG LEDPQLHVDF DPARVAASGL SALQVADSVA AWFRNTLAGR VRVQDREWLV RMEGKTPDPE ALAQAAVVTP AGRVALDEVA SVARGHERPG QLVSYNGAPA LMLSITKDAD ANTLELVERL QDFAARRNPL LADQGVQLRL IDDQTHATRH AIGVMESNAV FGLFAVLAMC WIFLGSRLSV LVGLGVPFAL AGTFIAVDLI GSTLNLTVLL GVVIALGMLV DDAVVIVEAI YYRLQRGDPP HEAVVAGVEE VGLPVLSSVL TTIAAFLPLM LLPGILGKFM FVVPFVVTVG LLVSLVEAYW ILPTHILALR PQPAQRPSRS QRLRERFTLR IRIAYARALV AVMRRRRLTV AVLVLMMAGA GAAVATGLVR VQFFAFDSMR LFYVNVDMPA GVVPQRTLAE AERAVKVLGE TIGADELRAV AVYAGLKFTE TEPVYGDQYA QVLVSLLPRA AGGRDTGEII DAVRPAIEAL AGEAKLSFME MKGGPPTAKP ISVKMKSDDY PQLRAAADAL LAEAAKLPGV RDLTDDDVPG RGEMRLALDR EKLARAGVQP AEVARLLRLY GEGEIVATTR DGGEKVEVVV RARQESLDDV TGLLQRPVPL ADGRSVQLGE LVSRQLLLSK GYIRHYQLTR AITVEAELDR ERLDTVEAND RLKAAWAQLQ PRFPGVELDF SGELEDIQES LDAMGKLFAL GVGLIYLILA TQFRSYWQPL IILLTVPLAF TGVVLGLLVS GNPLSLYTLY GVIALTGIAV NSAIVLIDAA NERRAVGMSV QHAAIYAARR RVVPILITST TTIGGLLSLA IGLGGKSLMW GPVAASIVWG LGFSTVLTLF AVPLVYRMAM QRGGG
|
| |