Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0972 |
Symbol | |
ID | 7085075 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 1073023 |
End bp | 1075293 |
Gene Length | 2271 bp |
Protein Length | 756 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643697994 |
Product | DEAD-like helicase |
Protein accession | YP_002354634 |
Protein GI | 217969400 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCTCG ATCTCGAAAC CGTTCCCGAA ACCGCTGTGC AGGGCGACCT GCTCGAAACG GCCGCTTCAC CCCTCACGCT CAGCCTGCAG GACTTCGTAT CGGAGTTCGG CGACGAGCTT CTCGACTCGC TCAACCGCGC CAATCCTCCG GTCTACACCG GCCAGGTGCG GGTGCATCGG CAACTGATCC TCGCCGCGCT CAAGCGCAAG CTGTTCCCGG CGCAAGCCGA TGTGGTCCAC GCCGTCACCG AGCTGTTGGT CGACCGTGGC GAACGCGCCG CGATCGTCAA TGGCGAGATG GGCTGCGGCA AGACGACGGT GGGTATTGCC ACCGCCGCCG TGCTCAACGC CGAAGGCTAC CGCCGCACCC TGGTGCTCTC GCCACCCCAC CTGGTCTACA AGTGGCGGCG CGAAATCCAG GAGACGGTGG CCGGCGCCAA GGTCTGGGTG CTCAATGGCC CGGACACGCT GGTCAAGCTG CTGAAACTGC GCGAGCAGTT GGGCGTGCCG GCCCAGGGCC AGGAGTTCTT CGTCCTGGGC CGTGTGCGGA TGCGGATGGG GTTCCACTGG AAGCCTATCT TCGTTCGGCG GCGCACGCCT CACGGCGACG TGGGGGCCTG CCCGGGTTGC GGGCATGTCA TCACCGACCT GGACGGCGAG CCTGTCAACA CCATCGATCT AGAAGCGGAA GAGTCCCGCC GCAAATGCAG CCACTGCCGC GCACCGCTGT GGTCGTTGAT CCGTCCCAGA GGCCTGTCCG CCAGCGACCA GTCCTCGAGC GTGCTCAAGG CACTGAAGCG TATTCCAACC ATCGGGGAAG TCACCGCGCA GAAGCTGATG CAGAAGTTCG GTGACGCCTT CCTCGCGTCG ATGCTCGGGG ACAACATCCA CGAGTTCATC AACCTGATGG ATGGCAACGG CGAGCTGGTC TTCTCGGACC GGCAAGCCCA TCGGATGGAA CGCGCGATGG CCAACATGGA GTTCGGCTTC GGCGAGGGCG GCTACCAGCC GTCCGAGTTC ATCAAGAGGC AGCTTCCCCA AGGTACGTTC GACCTGCTCA TCGCCGACGA GGCGCACGAG TACAAGAACG GCGGCTCCGC CCAGGGCCAG GCCATGGGGG TGTTGGCGGC CAAGGCGCGC AAGACGCTGC TGCTCACCGG CACACTGATG GGCGGCTACG GCGACGACCT GTTCCACCTG CTGTTCCGCG CCCTGCCGGG GCGGATGATC GAAGACGGCT ACCGGCCGAC GAAGAGCGGC AGCATGACCT CGGCCGCGAT GGCGTTCATG CGCGATCACG GTGTCTTGAA GGACATCTAC TCCGAGAGCA CCGGCACGGC GCACAAGACG GCCAAAGGCA CCAAGGTATC GGTGCGCACG GTCAAGGCGC CGGGTTTCGG TCCGAAGGGC GTGCTGCGTT GCGTTCTGCC GTTCACGGTC TTCCTCAAGT TGAAGGACAT CGGCGGCAAT GTGCTGCCGC CCTACGACGA GGAGTTCCGC GAAGTCGCGA TGGACACGGC GCAGGCCGCG GCCTACCGCG ATCTGGCGGG TCGCCTGACC CAGGAGCTGA AGCAGGCCCT GGCGAAGCGC GACACGACGC TGCTCGGTGT AGTCCTCAAC GTGCTGCTGG CCTGGCCGGA CTGCTGCTTC CGGTCGGAAA CGGTGGTGCA TCCGCGCACG CGCAACACCT TGGCGTTCGT TCCGGCTCAG TTCAACGGGC TGGAGGTGAT GCCCAAGGAA CGCGAGCTGA TCGAGATCTG CAAGCAGGAG AAGGCAGAAG GTCGCAAGAC CCTGGTCTAT TCGGTCTACA CCGGCACCCG CGACACCACG TCGCGTTTGA AGGTGCTGCT GGAGCAGGAA GGCTTCAAGG TGGCGGTGCT GCGCGCGAGC GTGGATGCCT CCCGCCGCGA GGACTGGATC GCCGAGCAGT TGGACCGTGG CATCGACGTG CTCATCACCA ATCCCGAGCT GGTGAAAACC GGCCTGGACT TGCTGGAGTT CCCGACCATC GTGTTCCTCC AGTCCGGCTA CAACGTGTAT TCGTTGCAGC AGGCCGCCCG GCGCTCATGG CGCATCGGCC AGAAGCAGCC GGTGCGCGTG ATCTACCTCG GCTACGCCAA CTCCTCGCAG ATGACCTGCC TGGGGTTGAT GGCCCGGAAG ATCATGGTGT CGCAATCCAC CTCGGGCGAC GTTCCTGAAT CCGGGCTCGA TGTCCTGAAC CAGGATGGCG ACTCGGTGGA GGTGGCACTG GCTCGGCAGT TGGTGCATTG A
|
Protein sequence | MSLDLETVPE TAVQGDLLET AASPLTLSLQ DFVSEFGDEL LDSLNRANPP VYTGQVRVHR QLILAALKRK LFPAQADVVH AVTELLVDRG ERAAIVNGEM GCGKTTVGIA TAAVLNAEGY RRTLVLSPPH LVYKWRREIQ ETVAGAKVWV LNGPDTLVKL LKLREQLGVP AQGQEFFVLG RVRMRMGFHW KPIFVRRRTP HGDVGACPGC GHVITDLDGE PVNTIDLEAE ESRRKCSHCR APLWSLIRPR GLSASDQSSS VLKALKRIPT IGEVTAQKLM QKFGDAFLAS MLGDNIHEFI NLMDGNGELV FSDRQAHRME RAMANMEFGF GEGGYQPSEF IKRQLPQGTF DLLIADEAHE YKNGGSAQGQ AMGVLAAKAR KTLLLTGTLM GGYGDDLFHL LFRALPGRMI EDGYRPTKSG SMTSAAMAFM RDHGVLKDIY SESTGTAHKT AKGTKVSVRT VKAPGFGPKG VLRCVLPFTV FLKLKDIGGN VLPPYDEEFR EVAMDTAQAA AYRDLAGRLT QELKQALAKR DTTLLGVVLN VLLAWPDCCF RSETVVHPRT RNTLAFVPAQ FNGLEVMPKE RELIEICKQE KAEGRKTLVY SVYTGTRDTT SRLKVLLEQE GFKVAVLRAS VDASRREDWI AEQLDRGIDV LITNPELVKT GLDLLEFPTI VFLQSGYNVY SLQQAARRSW RIGQKQPVRV IYLGYANSSQ MTCLGLMARK IMVSQSTSGD VPESGLDVLN QDGDSVEVAL ARQLVH
|
| |