Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3472 |
Symbol | |
ID | 7872978 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3803272 |
End bp | 3806574 |
Gene Length | 3303 bp |
Protein Length | 1100 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643700412 |
Product | SNF2-related protein |
Protein accession | YP_002890443 |
Protein GI | 237654129 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCCGC TCGACCGCTT CACCCAGGAT GACCTGATCG CCCAGCTCGG CATGGAAACC GTGGCCAAGG GGCTCGGCTA CCTGAGCCGG GTGAGTGCGC TGTCGGCCGA CGGCAGCGAG GTGTCGGCCC TGGTCAAGGG CCGGCAGCGC ACACCCTACG ACGTCGTCGC CGACGTCACC GAAGCGGACG GGCGGCCGAC CCTGGTCAGC AGATGCACCT GCCCGATGGG CTACGGCTGC AAGCATGTGG CGGCGATGAT GCTGGTGTGG CTGCACCAGC GCCGCCGCCC CGACCGCCCG CGCGAGCAGG TGCTGGCCTG GGTGAAGGGC TTTCGCGAGA CCGCGGACAG CCTCGCGAAG GATGGCACGC GCAAGCGCAG GCCGGCCTCG GTCAGCTATG CGCTGCGCTA TGTCGTCGAG CCCAGCCTGT ACGGCGGCGA CTTCAGCGTC AAGTGCCACA AGGTTCGCCT CGACCAGCAC GGCGAGATCC GCCAGCACGA GCCCTGGAGC AACATCGAGC GCGCGCTGCA GGCCCCGCCC GCCTTCGTGG ACGACACCGA CCTCGACCTG CTGCGCCTGC TGTGGGCGCA CCGCGCGCGC AGCCACTTCG ACGCCAGCGA CGGGCTGCCG CTCGAGGGCC GCCACAGCGA CGAGCTGATG CGCCGTCTGC TCGCCAGCGG CCGCGCGCAT TTCGGCAGCC TCGCCGGCCG CGTGCTGGAG TTGGGCGAGC CGCGTGCCGG CACGCCCGAC TGGACGATCA CCCCCGACGG GCTGCGCGCC CCGGTGCTGA GGGTGGAGCC CGCAGCCGAC CGCGTGCTGG CGCTGCTGCC GCCCTGGTAC ATCGACACCG CCCGGGCCGA GGGCGGTCCG ATCACGCTGC AGGTCTCCGC CGAGCTGGTG CAGCGCCTGC TCTGGCTGCC GGCGCTCACC CAGGCCGAAT CCGAGCTGGT CGCCGAGACC CTGGCCGAGG TCGCCCCCGC CCTGCCCGCG CCCACGCTGC GCGGCGAACC GCCGATCGAG CTCGGCGGCG CGCCGGTGCC GGTACTGCGC CTGGAAACCC GCCCAAGCTA CCACGTGCGC TTTCGCCAGT ACGCACCGCA CGGGACCTAT CTGTTCGACT TCGCCCAGCT CGCCTTCCGC TACGGTGCGG TCACCGTCGA GGCCGGCGAC GCCCGCCTCT TCCATCCCCT GCCCGACGGG CGCAGCGCGC GCCTGAGCCG CGACGAGGAG GCCGAGCGCG CCGCCGCGCA GCGCCTCGCC GCCGCCGGCG TCACCCGCAT CCCGCCGGGC GCGGTGCAGA CGAGCTTCGG CAGCCTGCCC GCCGATGCGC TCGGGCTGGC GAGCGAGGCG ACCTGGCCCG GCTTCATGGC CGAGACCGTG CCGGCCCTGC GCGCGGAAGG CTGGGAGGTG GCCTTCCCGA GCGACTTCCG CCACCACGCG ATCGACATCG ATCACCTGAT GCTCGACATC GACGAGCATC AGGACGGCTG GCTGGGCCTG TCGCCGGGCG TCGAGATCGA CGGCAAGGCC CTGCCGCTGG CGCCCCTGCT CTCCGGCCTG TTCGCGCACG ATCCGCGCTG GCTGTCCGGC CGCCTGGACG AGATCGACGA CCACGAGGCG GTGATCCTGG AGGACGAGAC GCTCGGCCGC CTGCGCATCG GCGCGGCGCG CATCAAGCCG CTGGTGCGCG CGCTGGTCGA CCTCTTCGAC CGCCCCGACC CCGACTGGCG CGTCTCGAAG CTCGACGCCA CCCGCCTGGC CGACCTCGAC CTGCCCGGCC GCGGCCGCGA TCAGCTCGCC GCGCTCGCAG CGCGCCTGCG CGACGCCGAG GGCATCCGCG CCGTCGCCGC GCCGGCGGGC TTCCGCGCCG AGCTGCGGCC CTACCAGCTC GAGGGCCTGG CCTGGCTGCA GCACCTGGTA CGGCACGACC TCGCCGGCAT CCTCGCCGAC GACATGGGCC TGGGCAAGAC CGCGCAGACG CTCGCCCACC TGCTCGTCGA GAAGCAGGCC GGCCGGCTCG ACCTCCCCGC CCTGGTGGTG CTGCCGACCT CGCTGGTGTT CAACTGGCAG GCCGAGGCCG AGCGCTTCGC ACCGGACCTG AAGGTGCTGA ACCTGCACGG CGCCGACCGC CACGGGCGCT TCGAGGAGCT GGAGGCCGGC GACGTCGGCG CCGGGAAGAT CGACATCGCC CTCACCACCT ACCCCCTGCT GTGGCGCGAC GCCGAGCTGC TGCAGGCGCG CGAGTGGAGC CTCTTGATCC TCGACGAGGC ACAGACGGTG AAGAACGCCG CCAGCAAGGG CGCGCAGGTG ATCCGCCAGT TGAAGGCGCG CCACCGCCTG GGCCTCACCG GCACGCCGCT GGAGAACCAC CTCGGCGAGC TGTGGGCGCA GTTCGACTTC CTGCTGCCGG GCTTTCTCGG CAGTCACAAG GACTTCACCG CGACCTGGCG CACGCCGATC GAGAAGCACG GCGACACGGT GCGCCGCGAC CTGCTCGCGG CGCGCCTGCG CCCCTTCATC CTGCGCCGGC GCAAGGAAGA CGTCGCCACC GAGCTGCCGC CCAAGACCAT CATCGTGCGC AGTGTCGCCC TGGAGGGCGG CCAGCGCGAC CTCTACGAGA CGGTGCGCGC GGCGATGGAC GAGAAGGTGC GCGCCGAGAT CGCCGGCAAG GGCTTCGCGC GCAGCCAGAT CGTGATCCTG GATGCACTGC TCAAGCTGCG CCAGGTGTGC TGCGATCCGC GCCTGTTGAA ATCGCCCGCC GCCTTGCGGG TGAAGGAGCG CGCCAAGCTC GACCTGCTCA TGGACATGCT GCCCGAACTG ATCGACGAGG GCCGGCGCAT CCTGGTGTTC TCGCAGTTCA CCACCATGCT CGGCCTGATC GCCGCCGAAC TGGACAAGGC AAAGATCGGC TGGGTGGCGC TGACCGGCGA CACCCGCGAC CGCCGCGTGC CGGTGGAGGA CTTCCAGAAG GGGCGCGTGC CGGTATTCCT GATCAGCCTA AAGGCGGGCG GCGTGGGCCT CAACCTCACC GCCGCCGACA CCGTGATCCA CTACGACCCG TGGTGGAACC CCGCGGCCGA GAACCAGGCC ACCGACCGCG CCCACCGCAT CGGCCAGGAC AAGCCGGTGT TCGTGTTCAA GCTGGTGTGC GCGGGCAGCA TCGAAGAGAA GATCCTCGCC CTGCAGGAGC GCAAGGCCGC CCTCGCCGAG AGCGTGCTGT CGGAGGACGC GGACGCGCTC GCCAAGTTCG GCGAGGCCGA CATCGCCGCC CTCCTCGCGC CGCTCCCCGC GGACGCGCGC TGA
|
Protein sequence | MTPLDRFTQD DLIAQLGMET VAKGLGYLSR VSALSADGSE VSALVKGRQR TPYDVVADVT EADGRPTLVS RCTCPMGYGC KHVAAMMLVW LHQRRRPDRP REQVLAWVKG FRETADSLAK DGTRKRRPAS VSYALRYVVE PSLYGGDFSV KCHKVRLDQH GEIRQHEPWS NIERALQAPP AFVDDTDLDL LRLLWAHRAR SHFDASDGLP LEGRHSDELM RRLLASGRAH FGSLAGRVLE LGEPRAGTPD WTITPDGLRA PVLRVEPAAD RVLALLPPWY IDTARAEGGP ITLQVSAELV QRLLWLPALT QAESELVAET LAEVAPALPA PTLRGEPPIE LGGAPVPVLR LETRPSYHVR FRQYAPHGTY LFDFAQLAFR YGAVTVEAGD ARLFHPLPDG RSARLSRDEE AERAAAQRLA AAGVTRIPPG AVQTSFGSLP ADALGLASEA TWPGFMAETV PALRAEGWEV AFPSDFRHHA IDIDHLMLDI DEHQDGWLGL SPGVEIDGKA LPLAPLLSGL FAHDPRWLSG RLDEIDDHEA VILEDETLGR LRIGAARIKP LVRALVDLFD RPDPDWRVSK LDATRLADLD LPGRGRDQLA ALAARLRDAE GIRAVAAPAG FRAELRPYQL EGLAWLQHLV RHDLAGILAD DMGLGKTAQT LAHLLVEKQA GRLDLPALVV LPTSLVFNWQ AEAERFAPDL KVLNLHGADR HGRFEELEAG DVGAGKIDIA LTTYPLLWRD AELLQAREWS LLILDEAQTV KNAASKGAQV IRQLKARHRL GLTGTPLENH LGELWAQFDF LLPGFLGSHK DFTATWRTPI EKHGDTVRRD LLAARLRPFI LRRRKEDVAT ELPPKTIIVR SVALEGGQRD LYETVRAAMD EKVRAEIAGK GFARSQIVIL DALLKLRQVC CDPRLLKSPA ALRVKERAKL DLLMDMLPEL IDEGRRILVF SQFTTMLGLI AAELDKAKIG WVALTGDTRD RRVPVEDFQK GRVPVFLISL KAGGVGLNLT AADTVIHYDP WWNPAAENQA TDRAHRIGQD KPVFVFKLVC AGSIEEKILA LQERKAALAE SVLSEDADAL AKFGEADIAA LLAPLPADAR
|
| |