Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0958 |
Symbol | |
ID | 7085061 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1046690 |
End bp | 1049617 |
Gene Length | 2928 bp |
Protein Length | 975 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643697980 |
Product | SNF2-related protein |
Protein accession | YP_002354620 |
Protein GI | 217969386 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.106346 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGATCTCCG CCTACCACGC CAAGTACTAC GCCCACGAGC TGACACGGCG ACACGCCGCC GATGGCGTAG ATCGCCTCTC CCAATCGCTG TTCGACGCCA GCGTCGATTT GAACCCGCAT CAGATCGAGG CGGCGCTGTT CGCCCTTCGG AACCCATTGC AGGAAGGTGT ACTGCTGGCT GACGAGGTGG GGCTGGGCAA GACCATCGAA GCCGCGCTGG TGGTTTGCCA ATACTGGGCT GAGCGCCGCC GTCGGCTCCT GGTCATCTGC CCGGCAAGCC TGCGCAAGCA ATGGGCGCAG GAGCTGCACG ACAAGTTCGC CGTGCCCACC ACCGTGGTGG ATGCCGTTTC TCTGCGCAAG CAGTCCGCTG GGGACATGCT CGCCACCCTG CAACGGCTGG TTGGCAAGGC GGTCGTGATC ATGTCCTACC AGTTCGCCGC CAAGCTGGAA GCCGAGCTGC GTGCCGTGCC CTGGGACGTG GTGGTCATCG ACGAAGCGCA CAAACTGCGC AACGCGCACC GCGCCAGCAA TCGCACCGGG CAGGCGCTCA AGCGCGCACT GCAAGGCCGC AAGAAGCTGC TGCTTACCGC CACGCCGCTG CAGAACTCGC TGATGGAGCT GTATGGCCTG TCCACGGTGA TCGACGAGCA CCTGTTCGGC GACGAGACGG CTTTCCGCAA GCAGTTCATG AACAGCGGGA CGGGTCTTGA TGAATTGCGC GAGCGTCTTG CCAGTTTCGC CAAACGCACA CTGCGCCGCG ACGTGCTGGA GTACATCAAG TACACCGAGC GCAAGGCGCT CACCCAGCCG TTCAATCCCA CGGACGACGA GCAGGCGCTG TATGAGCGCA TCTCGGCCTT TCTGCAAAAG GAAGACTCCT ACGCCCTGCC TAAGCAGCAA CGCCATCTCA CGGCACTGAT CTTGCGCAAG CTGCTGGCCT CCAGTTCGCA CGCCGTCGCT GCCACGTTGG TCACCATCCG CGAGCGCCTG CAGGGCTTGC TCACGGCAGA CAAAACGGAA GACGACGGCA GCCAACTGGT CGAACAGCTC ATTGCCGAGG ACGACCTGGA GCAGGACTAT CTGGAAGAGG AAGCCAGCGA GGCCGACGAA GACACCGAAG CTCCTACGCC CGCGCCAGCG GAAGACGACA AGACGGGCGC TGCAAAAGAT GCGCATGCCG TCCGCGCTGC CATCAGCGCC GAGATCGCGG AGCTGACCGC GTTCGTCGAT GCCGCGCAAG CGCTGCAAAC CGACACCAAG GCGCAGGCGC TACTGAAGGC GCTCGGCTTG GGCTTCAGCA AGATGGCCGA GCTGCGCGCG CCGCGCAAAG CCATCATCTT CACAGAGTCC AAGCGCACGC AGGAGTACCT GCACCGCTTT CTCTCCGCCA ACGGCCACGC GGGCAAGCTG GTGCTGTTCA GCGGCACCAA CAATCATGAA GACTCCACCG CCATCTACCA GCGCTGGCTG GAAGAGTACA AAGGCACGGA TCGCGTCACC GGCTCGCCGC AAGTAGATCG CCGCACCGCG CTGATCGACC ACTTCCGCAA GGACGACGGC ACCGGCGCGG AAATCATGAT CGCCACCGAA GCAGCGGCCG AAGGTGTCAA CCTGCAGTTC TGCGCGCTGA TCATCAACTA CGACCTGCCG TGGAACCCGC AGCGCGTCGA GCAGCGCATT GGCCGCTGCC ACCGCTACGG CCAGCGTTTC GACGTGGTGG TCATCAACTT CCTCAACACC CGCAACCAGG CCGACCAGCG CGTGCTGGAA CTGCTCACTG AGAAATTCAA CCTGTTCTCA GGGGTGTTCG GTGCGAGCGA TGAAGTGCTG GGCCGCATTG AAGGCGGCCT CGACTTCGAG AAACGCATCC TCCAGATCTA CGACACCTGT CGCCAGCCCG AGCAGATCGA AGCCGCCTTC AACGCCCTGC AAGCGGAGCT GGAAGAAGTC ATCGCCGACC GCATCAAAGA CACCCAGTCC CAACTTCTGG AGAACTTCGA TGAGGACGTT CACGACCGCC TCAAACTGCG TCTTCAAGAC GCCGAAGCGC GGCTCGACAA GCTGGGGCGC TGGTTCTGGG GTGTCACCTG CTTTGCATTG GACGGCCGCG CGCGCTTCGA CGAGCAGTCC TACGCGTTCT CTCTGAGCGC GCCGCCAACT GGAATCGCCA CAGGCCGCTA TCAGCTCATT CGTGGTGCGG CGCAGCCGGA CATGCTGGCG CACGCCTACC GCCTCAGCCA CCCGCTGGGG GAATGGAGCA TTGATGCCAG CCTGAACGCG GCCACGCCCG TCGCCACATT GAAACTCGAC TACGGCAAAC ACGGCGCACG CGTCTCCGTC ATCGAGAGAC TGCGCGGCAT GTCCGGCTGG TTGACGCTGG CCCGGCTGGA AGTCACCGCC TTCGAGACTA CCGAGGCACT GCTGTTCTCC GGCCTCACCG ACGATGGTCA GGTGCTGGAT CAGGAAGCCT GCGAAAAGCT GATGGCCATT CCAGCAGCAG GCAAGCCCAC TCCTTTTAAC AACCCTGTGC CCGAGGCCCT GCTGGCCAAC AGCCAGCGCG CGGTCGCAGC CACCGTTGCC CAGGTGCTGG AAGCCAACCA GCGCCTGTTC AATGAAGAAC GCGACAAGCT GGAACGCTGG GCCGACGACA AGCTGTTGGC CGCAGAGGAA GCCCTGAAGA ACACCAAGGC GCGCATCGCC CAGTTGAAGC GCGACGCCCG CAAGGCCGCC ACCTTGCAGG AGCAAGACGG CATCCAGCGC GAACTGTCTG AGTTGGAGCG CAAGCAGCGC CGCCTGCGGC AAGAGATTTT CGCCGTCGAG GACGAGATCA TCGCCAAGCG CGACGATCTG ATTGCCTCGC TCCAGCAGCG CCTGCAAGAA AAAACAAGCC ACGAGATCCT GTTCAGCGTG CGCTGGCAGG TCATCTGA
|
Protein sequence | MISAYHAKYY AHELTRRHAA DGVDRLSQSL FDASVDLNPH QIEAALFALR NPLQEGVLLA DEVGLGKTIE AALVVCQYWA ERRRRLLVIC PASLRKQWAQ ELHDKFAVPT TVVDAVSLRK QSAGDMLATL QRLVGKAVVI MSYQFAAKLE AELRAVPWDV VVIDEAHKLR NAHRASNRTG QALKRALQGR KKLLLTATPL QNSLMELYGL STVIDEHLFG DETAFRKQFM NSGTGLDELR ERLASFAKRT LRRDVLEYIK YTERKALTQP FNPTDDEQAL YERISAFLQK EDSYALPKQQ RHLTALILRK LLASSSHAVA ATLVTIRERL QGLLTADKTE DDGSQLVEQL IAEDDLEQDY LEEEASEADE DTEAPTPAPA EDDKTGAAKD AHAVRAAISA EIAELTAFVD AAQALQTDTK AQALLKALGL GFSKMAELRA PRKAIIFTES KRTQEYLHRF LSANGHAGKL VLFSGTNNHE DSTAIYQRWL EEYKGTDRVT GSPQVDRRTA LIDHFRKDDG TGAEIMIATE AAAEGVNLQF CALIINYDLP WNPQRVEQRI GRCHRYGQRF DVVVINFLNT RNQADQRVLE LLTEKFNLFS GVFGASDEVL GRIEGGLDFE KRILQIYDTC RQPEQIEAAF NALQAELEEV IADRIKDTQS QLLENFDEDV HDRLKLRLQD AEARLDKLGR WFWGVTCFAL DGRARFDEQS YAFSLSAPPT GIATGRYQLI RGAAQPDMLA HAYRLSHPLG EWSIDASLNA ATPVATLKLD YGKHGARVSV IERLRGMSGW LTLARLEVTA FETTEALLFS GLTDDGQVLD QEACEKLMAI PAAGKPTPFN NPVPEALLAN SQRAVAATVA QVLEANQRLF NEERDKLERW ADDKLLAAEE ALKNTKARIA QLKRDARKAA TLQEQDGIQR ELSELERKQR RLRQEIFAVE DEIIAKRDDL IASLQQRLQE KTSHEILFSV RWQVI
|
| |