Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0368 |
Symbol | |
ID | 7084874 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 416874 |
End bp | 418199 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643697399 |
Product | transposase IS204/IS1001/IS1096/IS1165 family protein |
Protein accession | YP_002354047 |
Protein GI | 217968813 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3464] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAACC AGATCGAAGC CCTCTTCACC ACCGCGCTTG GCCTGCAGCC GCCCTGGCAC GTTGCCAAGG TCGAGCTCAA CACGGGCAAG CGCCGGATCG ACTTCGAGGT CGAGCACACC GGCAAGCGCG CGGCGTGCCC GGCGTGCGGG ATGGAACATC AGCTGATCCA CGATCGGGTG CGGCGCAGCT GGCGGCACCT GGACTTCTTC CAGTTCGAAG CCTGGCTGCA TGCGGAGATC CCGCGCGTGC AGTGCACGGG CTGCGGCAAG ACCACGCAGT TGCCGGTGCC ATGGGCGCGC GAAGGCAGCG GCTTTACCTT GCTGTTCGAG GCGCTGGGCC TGTCGCTGTG CCGGGAGCTT CCGGTGCGCC AGGCGGCCAA CCAGATGCGG GTGGCGCCCA AGCGGCTGTG GCGGCGGGTG CGGCACTACG TCGAGGTGGC GCGCGCCCGG GACGACATGA GCGGCGTGCG CTACGTCGGC ATCGATGAGA CCAGCGTCAA GCGCGGACAC GAATACATCA CCGTCGTCCA TGACCTCGAG GCCAAGCGCC TGCTGTTCGC CACGCCGGGA CGCGATCACA CCACGCTGCA GGCCTTTGCG CAGGAGCTGC GTGCGCACGG CGGCGATCCG CAAGGGGTCG AGCATGCGTG CATCGACATG AGCGCCGCCT ACGCCAAGGG CATCGCCCAG GCGCTGCCCG GCGCCCAGAT CAGCTACGAC CGCTTCCACG TCGTGGCGCT GGCCAATGCG GCGATGGACG AGGTGCGTCG CGAGGAAATG CGCAGCTCGG CCGCCGCAGT CCGTGATGCG GCGGGCACGC ACAGCAAGAA GACCCTGCGC CAGCTGCTGT GGGGCATGCG CAAGAACCCG GTGAGCTGGA CCCGCGCGCA GTTCGAGGCG ATGCACTGGC TGCAGCGCTC GAACCTGAAG AGCGCACGGG CCTGGCGCCT GAAGCAGGCG CTGCGGCTGG TCTATCGCGA GGCGGGCGCG AGCAACAGCG AAGAGATTGC CCACGGGGCG CTGACCAAGT GGATGAGCTG GGCTCGGCGC TCTCGCCTGG AGCCGTTCAA GCGCTTGGCG GCGACGTTGA AGGCGCACCT CGCCGGGGTC GTGCGCGGCA TGCTCGACGG GCGCAGCAAC GCCTACGTCG AGGCGATGAA CGGTCTGCTT CAACAGACGA AGACCGCCGC CCGGGGCTTC CGCAAGGTCG AGAACTTCAT CGCTATCGCC TACCTGCGCA TGTCCAAGCT CGAGCATCTA CCGAAGAACC CTCTGCTGCC GGCCATCGCC CGCGACTACG GGCGCTACCG TCATGTCTGT TGTTGA
|
Protein sequence | MSNQIEALFT TALGLQPPWH VAKVELNTGK RRIDFEVEHT GKRAACPACG MEHQLIHDRV RRSWRHLDFF QFEAWLHAEI PRVQCTGCGK TTQLPVPWAR EGSGFTLLFE ALGLSLCREL PVRQAANQMR VAPKRLWRRV RHYVEVARAR DDMSGVRYVG IDETSVKRGH EYITVVHDLE AKRLLFATPG RDHTTLQAFA QELRAHGGDP QGVEHACIDM SAAYAKGIAQ ALPGAQISYD RFHVVALANA AMDEVRREEM RSSAAAVRDA AGTHSKKTLR QLLWGMRKNP VSWTRAQFEA MHWLQRSNLK SARAWRLKQA LRLVYREAGA SNSEEIAHGA LTKWMSWARR SRLEPFKRLA ATLKAHLAGV VRGMLDGRSN AYVEAMNGLL QQTKTAARGF RKVENFIAIA YLRMSKLEHL PKNPLLPAIA RDYGRYRHVC C
|
| |