Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3863 |
Symbol | |
ID | 7874105 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 4258896 |
End bp | 4260218 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643700806 |
Product | transposase IS4 family protein |
Protein accession | YP_002890829 |
Protein GI | 237654515 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3666] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGAGCT ATCTGCCCTA TTGCCCGCAG CAGCAGATGC TGCTGCCCCA GGCGCTGCAG GAGTGGCTAC CCGAAGGCCA CTTGGCGTAC TTCATCAGCG ACGCGGTCGA CGGGTTGGAT CTGAGCGCGT TCCACGCCCG GTATGCCGGT GGCGGACCGC GCAACCAGCC GTTTCATCCG GCCATGATGG TCAAGGTGCT GCTGTATGCG TACGCCACGG GCGTGTTCAG TTCGCGCAAG ATCGCGCGCA AGCTGCACGA GGATGTGGCG TTCCGGGTCC TGGCGGCAGA CAACTTCCCG GCCCACCGCA CGCTGAGCGA CTTTCGCGCT GTCCATTTGA AGGAGCTGAG CGAGTTGTTC GTGCAGGTGG TACGACTGGC CCGCGAGATG GGGCTGGTCA AGCTCGGGAC GGTGGCCATC GACGGCACCA AGGTCAAGGC AAACGCCAGC CGCCACAAGG CGATGAGTTA CGGCCATATG GTGAAGGCGG AGGCCGAGTT GAAACGGCAG ATCGAGGCGC TGCTCAATCG GGCCAAGGCC GCCGACGACG CCGAGCGGAA CGAGCCCGAG CTGGATGTGC CGGCCGAGAT CGCGCGGCGC GAGGCGCGGC TGACGGCGAT TGCTGAAGCC CGGGCGCGGC TCGAGCAGCG CCAGCGCGAG GCCGATCAGG CGCGCGGGCG CAGCGACGAT GACGACCGTC GCCCGCGCGG CGGTGACGGC AAACCGAAGG GCGGGCGCTA CAAGCGCGAC TTTGGAGTGC CCGAGGACAA GGCGCAGGAG AACTTCACGG ATCCGGACAG CCGCATCATG AAGCGCGCCG GCGGCGGGTT CGATCCGAGC TACAACGCCC AGACGGCGGT CGACGAGACC GCCCACATCA TCGTGGCGGC CGAACTGACC AACAACGCCA GCGACGCCGG GCAACTGGCG GGGGTACTGC AGGCCGTGCG CGACAACGTC GAACACCGAC CGCGCCAGGC GCTGGCCGAC ACCGGCTACC GCTCGGAGCA AACGTTCCGG GAACTCGACG GGTGCGGGAC CGAACTGGTG GTGGCGCTGG GCCGGGAAGG TAAGCGCCGA CTCGGCTTCG ATCGCGAACG CAATCCGCAC ACCGCGCAGA TGGCCGACAA GCTCGAGAGC GAGGCGGGCA AGAGCGCCTA CCGAAAACGG AAATGGATCG CCGAACCGCC CAACGGCTGG ATCAAGAACG TGTTGGGATT CCGGCAGTTC AGCCTGCGGG GCCTGGAGCG CGTCAAAGCG GAGTGGAAGC TCGTCTGTAT GGCGCTGAAC CTGCGCAGGA TGAGCACATT GAGGACGGCA TGA
|
Protein sequence | MTSYLPYCPQ QQMLLPQALQ EWLPEGHLAY FISDAVDGLD LSAFHARYAG GGPRNQPFHP AMMVKVLLYA YATGVFSSRK IARKLHEDVA FRVLAADNFP AHRTLSDFRA VHLKELSELF VQVVRLAREM GLVKLGTVAI DGTKVKANAS RHKAMSYGHM VKAEAELKRQ IEALLNRAKA ADDAERNEPE LDVPAEIARR EARLTAIAEA RARLEQRQRE ADQARGRSDD DDRRPRGGDG KPKGGRYKRD FGVPEDKAQE NFTDPDSRIM KRAGGGFDPS YNAQTAVDET AHIIVAAELT NNASDAGQLA GVLQAVRDNV EHRPRQALAD TGYRSEQTFR ELDGCGTELV VALGREGKRR LGFDRERNPH TAQMADKLES EAGKSAYRKR KWIAEPPNGW IKNVLGFRQF SLRGLERVKA EWKLVCMALN LRRMSTLRTA
|
| |