Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0333 |
Symbol | |
ID | 7085634 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 378644 |
End bp | 379969 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643697370 |
Product | transposase IS204/IS1001/IS1096/IS1165 family protein |
Protein accession | YP_002354018 |
Protein GI | 217968784 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3464] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTCTC CGATCGAAGC CCTCTTCACC ACCGCGCTCG GCCTGCAGCC GCCCTGGTAC GTCGCCAAGG TGGACCTCGA CACCGCGAAG CGGCGGATCG ACTTCGAAGT CGAGCACGCC GGCAAGCGCG TGCCCTGTCC GGCCTGTGGG GCGGCGCATC AGCCGGTCCA CGATCGAGTG CGGCGCAGCT GGCGTCACCT GGACTTCTTT CAGTTCGAGG CGTGGCTCCA TGCCGACATC CCGCGCGTGC AGTGCTCGGG CTGCGGCAAG ACCACGCAGC TGCCGGTGCC GTGGGCTCGC GAGGGCAGCG GTTTCACGCT GCTGTTCGAG GCGCTGGGCC TGTCCCTGTG CAGCGAGTTG CCCGTGCGCC AGGCCGCCGC CCAGATGCGC GTCGCGCCCA AGCGGCTGTG GGGGCGGATC CGCCATTACG TTCACGGTGC ACGCGCTCGG GATGACATGT CGGGCGTGCG CTACGTCGGC ATCGACGAGA CCAGCGTCAA GCGCGGGCAC GCGTACATCA CCGTGGTGCA TGACCTGGAG GCCAAGCGCC TGCTGTTCGC CACGCCCGGG CGAGACCACG CGACCCTGCA GGCCTTTGCC CAGGACCTGC GCGCGCACGG TGGCGAGCCG GAGCGGATCG AGCACGCCTG CATCGACATG AGCGCGGCCT ACGCCAAGGG GATTGCCCAG GCGCTGCCCA CGGCGCAGGT CAGCTACGAC CGTTTCCACG TCGTGGCCCT GGCCAATACG GCGATGGACG AGGTTCGCCG CGAGGAGATG CGCAGCGCCG CAGCCGCGGT CCGCGCGGCG GCCGGTACGG GAAACAAGAA GACGCTGCGC CAGCTGTTGT GGGCGATGCG CAAGAACCCG CCGCAATGGA CGCCGGCACA GTGCGACGCG ATGAACTGGC TGCAGCGCTC GGGCCTCAAG AGTGCGCGGG CGTGGCGGAT GAAGCAGGGC CTGCGGCTCG TCTACCGCGA GGCGGCGGCG AGCAACTGCG AAGAGGTCGC CCGCGGGGCC TTGATGAAGT GGATCAGTTG GGCCCGACGC TCTCGCCTGG AACCCTTCAA GCGGCTCGGC GCCACGGTCA AGGCGCATCT GGGCGGCGTG CTCCGCGGCA TGCTCGACGG GCGCAGCAAC GCCTACGTCG AGGCGATGAA CGGGCAGCTT CAGCAGACGA AGACCGCCGC CCGAGGCTTC CGCAACCTCG ACAATTTCAT CGCCGTCGCC TACCTGCGCA TGTCCAAGCT CGAGCATCTA CCGAAGAACC CCATGGTGCC GGCGATCCCC CGCGAATACG GGCGCTACCG TCATGTTTGT TGTTGA
|
Protein sequence | MSSPIEALFT TALGLQPPWY VAKVDLDTAK RRIDFEVEHA GKRVPCPACG AAHQPVHDRV RRSWRHLDFF QFEAWLHADI PRVQCSGCGK TTQLPVPWAR EGSGFTLLFE ALGLSLCSEL PVRQAAAQMR VAPKRLWGRI RHYVHGARAR DDMSGVRYVG IDETSVKRGH AYITVVHDLE AKRLLFATPG RDHATLQAFA QDLRAHGGEP ERIEHACIDM SAAYAKGIAQ ALPTAQVSYD RFHVVALANT AMDEVRREEM RSAAAAVRAA AGTGNKKTLR QLLWAMRKNP PQWTPAQCDA MNWLQRSGLK SARAWRMKQG LRLVYREAAA SNCEEVARGA LMKWISWARR SRLEPFKRLG ATVKAHLGGV LRGMLDGRSN AYVEAMNGQL QQTKTAARGF RNLDNFIAVA YLRMSKLEHL PKNPMVPAIP REYGRYRHVC C
|
| |