Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2901 |
Symbol | |
ID | 7873803 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3143815 |
End bp | 3144960 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643699822 |
Product | transposase IS4 family protein |
Protein accession | YP_002889877 |
Protein GI | 237653563 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG5433] Transposase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.180142 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACATAG GGAAGCTGGC CGACATGGTG GAGGTTTTCG AGGGCCTCGA GGACTGGCGC AATGCGCAAC AGACGCGGCA CCGCTTGAGC GAACTGCTCA CCGTGGCGGT GTGCGCGGTG CTCAGCGGCG CCGACGACTT CGAGGAGATC TCGCAGTGGG GGCGCGCCAA AGTGCCGTGG CTGAGGGGCT TTCTGCGGCT CGACTACGGC GTGGCCTCGC CCGACACCTT CGAGCGCGTG TTCGCGCTGC TCGATCCGAA ACAGTTCGAA CAGGCCTTTC GCACCTGGGT CGGCGGCATC ATTCCGGCAG TGGGCAAAGA CCAGGTCATC GCCATCGATG GCAAGTCGAG CCGACGTACC ACGAGCAAGG CGGCCGCTGC GCCGCTGCAT CTGGTCAGCG CGTTTGCGGC CAACGTGGGC GTGGTGCTGG GCCAGACGGC GACGGCGGAG AAGTCCAACG AGATCACGGC GATCCCCGAA CTGCTCAAGG TGCTCGACAT CGAGGGCTGC ATCGTCACCA TCGATGCGAT GGGCACGCAG ACCAAGATCG CACGCGCCAT CCGTGAGCGA GGTGCCCACT ACGTGCTGTG CGTGAAGGAC AACCACCCGA AGCTGCTCGA CTCGATCATG TTCGCCGACA TCGATCCGCG CGGTCCGCTG ACACCAAGTT CGACCCATGA AACCACGAGC ACCGGCCATG GACGGATCGA GGTGCGACGC TGTACGGCCT TTGATGCGAC CGATCGGCTC CACAAGGCCG AGGCCTGGAA GGACGTGGCC AGCTTTGCCG TCGTCGAGCG CGTGCGAACG GTGGGCGAGC GCACCAGCAC CGAGCGCGTC TACTACATCA GCAGCCTGCC GGCCGACGCC GAGCGCATTG CGGTGGCGAT CAGAAGTCAT TGGGAAGTGG AGAATCGGCT GCACTGGTGT CTGGATGTTC AGTTCGGTGA CGACTACGCA CGCGGACGCA TCGGTCACAT TGCCCACAAC CTGGCGCTGG TGCGCCACAT GGCGCTCAAT CTCATCCGGC TCGATAAGTC CATCAAGACG AGCATCAAAA CCAAGCGACT GCTGGCCGCG ACGTCCGATG AGTTTCGGGC TGCGCTGCTG GGCTTTGAGC CACCAGACGA GGATGACGAC GATTGA
|
Protein sequence | MDIGKLADMV EVFEGLEDWR NAQQTRHRLS ELLTVAVCAV LSGADDFEEI SQWGRAKVPW LRGFLRLDYG VASPDTFERV FALLDPKQFE QAFRTWVGGI IPAVGKDQVI AIDGKSSRRT TSKAAAAPLH LVSAFAANVG VVLGQTATAE KSNEITAIPE LLKVLDIEGC IVTIDAMGTQ TKIARAIRER GAHYVLCVKD NHPKLLDSIM FADIDPRGPL TPSSTHETTS TGHGRIEVRR CTAFDATDRL HKAEAWKDVA SFAVVERVRT VGERTSTERV YYISSLPADA ERIAVAIRSH WEVENRLHWC LDVQFGDDYA RGRIGHIAHN LALVRHMALN LIRLDKSIKT SIKTKRLLAA TSDEFRAALL GFEPPDEDDD D
|
| |