Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2394 |
Symbol | |
ID | 7094316 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011667 |
Strand | - |
Start bp | 55594 |
End bp | 58617 |
Gene Length | 3024 bp |
Protein Length | 1007 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643701081 |
Product | transposase Tn3 family protein |
Protein accession | YP_002364222 |
Protein GI | 217980172 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4644] Transposase and inactivated derivatives, TnpA family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 65 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGTCAC TGGAGCGCAC TGCATACCCG CAGTTTCTGA GCTACTACTC ACCGAGTGAC CTGCGACAGT TCTTTACTCT GCAGGATGAA GAAATCAACT GGCTGAAGTC GGCCGGCCGT TCTGCCGGGA CCCGACTCGG ACTGGCCGTC CTGCTCAAAG TTTTCCAGCA CTTGCGCTAC TTTCCTAGCC TCGACAAGAT ACCTTCCGAG GTGATCACCC ACGTCCGGAA TGGCCTCGGT TTCGGTGACG CGATACGGAT CGAATACCCC GTCGAACGCA CCCTCTTTCG GCACAAAGCC ATGGTGCTGG GCTTGCTTAA CGTCAAACCC TTCCATGGGC ACGATGCGAT GCGCCAAGCA GAACGCTTTG CCTGCGATGC GGCTGAGCTC ATGGATCAGC GTGCCGACAT CATCAATGGG ATAATCGAGG GGGTGCTCCA GGCGCGCTAC GAACTCCCCG CCTTCTCGAC GCTCGACGAT TTGGCAGAGA AGGCCCACGC TGCCGTGCAG AACCGGGTGT TCGGTAAGGT ATTTCACCGT ATCACCCCCC AACAACTCGA AGCCCTACAG GCGCTGCTGG TCACGGACAA ACTGGAACGC CGCCAAAGCG AATACAACGA GCTCAAGAAA TCCGCGAAAC GCCCGACCCG CAAGCATCTC GATATGCTGG TGGAGCATCT GGAATGGCTG GATTCGATCA CGGCGGGGGA TGACGTCCTT GCAGGCCTTC CCGACACCAA GATTCGCCAC TTCGCCGCCC AGGCCATGGC CTATGACGTG TCCGAGTTGC GCGAATGCGC CGAGACCAAG CGTTACACCC TGCTGGTGGC CCTGATCCGG CGCATGCAGG TGCGCGCGCG GGATCAGTTG GCCGAGATGT TCCTGCGCCG CGTGGCCACG ATTCACAAAC GCGCCAAGGA AGAGCTCGAC CAGATCCAGT TCGGGCAGCG CGGCCAGATC GAGCGCCTGA TCGGCACGCT GGATGGCGTG CTCGCCATCC TCGACGGCGA ACCGGACAAT GCGGCGGCCG GCGCTCAGAT ACGCGAATAC CTTGCACCCG CAGGTGGAGT CCACGGGGTC CGCGAGACCT GTGCCCAGGT GCAGGCGACC AGCGGCAACA ACTACCTGCC GCTCGTGTGG AAACACTTCA AGAGCCATCG CTCCATCCTG TTCCGGCTCG TGCATTTGCT CGACATCCGC GCCACCACCC AGGATCGGAC GCTGATCGAC GCTTTGAACC TCATCAAGAC CTACCAGGAC AAGCATCGCA TCGAGTGGAT TACGGAGAGC ATTGACCTTT CCTTCGCCTC GGACCGCTGG CGCAAGCTCC TGCGCGAGGG CGACAGCGGC GATGGTTTCG GTCTGGGCTT AAGCCGGCGC AACCTCGAAA TCTGTGTCTT CTCCTACCTC GCCGAGGAGT TGCGCTCTGG CGATCTCAGT ATCGTCGGCT CGGAGGAGTT CGCCGATTAC CGCGATCAGT TGCTTTCGTG GGAGGAATGC CAGGAGTTGT TACCCGCCTA CTGCGACAAG ATTGGCCTGC CGCAGGACGC AACGACCTTT GCCAGCAGTT TGCGCGACTG GCTCACCGAC ACCGCGCAGC ACCTGGATGA CACGTTTCCT GAATGCCGCG GCGATGTGGC CCTGACTGCC AGCGGAGAAC CCGTGTTGCG CAAGCCGATT GCGCGCGAAA TCCCGCCTTC GGTTATGTCG CTGCAGAACG CGCTGACCCA GCGCATGCCG GCGCGGCACA TTCTGGACGT GCTCGCCAAC ATCGAACACT GGATGGGCTT CACGCGGCAC TTCGGGCCAC TATCGGGCAA CATGGCCAAG CTCAAGCAGC CGGCCGAGCG CTACCTGCTG ACGATCTTCG CCATGGGCTG CAACCTCGGG CCCACACAGG CGGCGCGCCA TCTGGGCAAC AGCGGCGTCA CGCCGCACAT GCTGTCCTTC GTCAATCGGC GCCACCTCTC ACTGGAGAGC CTGGAAGCGG CCCAACGCGA GCTCAACGAG GTATATCTGC GCCTGGATCT CCCCAAGATA TGGGGTGATG GCAAGACGGT CGCCGCCGAC GGTACGCAGT ACGATTTCTA CGACGAAAAC CTGCTCGCGG GCTACCACTT CCGTTACCGC AAGATGGGCG CCGTCGCTTA CCGCCACGTG GCCAACAACT ACATCGCGGT GTTCCGTCAC TTCATCCCGC CCGGCGTGTG GGAAGCGATC TACGTGATCG AGGGGCTACT CAAGGCGGGC TTGTCGGTCG AGGCGGATAC GGTGCATGCC GACACCCAGG GCCAGTCGGC CACCGTGTTT GCCTTCACCC ACTTGCTGGG CATCAAGCTG ATGCCGCGGA TCCGGAACTG GAAGAATTTG ACGCTGTTCC GGCCGGACAA GACGGTGAAG TACAAGTACA TCAATCGGCT GTTCGGCGAC AGCGTGGATT GGAATCTCAT CGAGCGGCAC TGGCAGGATC TGATGCAGGT GGCCTTGTCG ATCTACGCCG GCAAGATTTC TTCGGCCACG CTGCTGCGCA AGCTGGGCAG CTACAGCCGC AAGAACCGTC TGTATTTCGC GGCGCAGGAA CTGGGCAACG TGATCCGCAC GGGCTTCCTG CTGGAGTGGA TCGGCAGTCG TGAGCTGCGC CAGGAGATCA CGGCGAACAC CAACAAGATC GAGTCCTACA ACGGCTTTGC CAAGTGGCTG TCCTTCGGTG GCGACGTGAT CGCCGTGAAT GAGCCGGACG AACAGCAGAA GCGCCTGCGC TACAACGATC TCGTGGCCTC CGCCCTGATC CTGCAGAACA CCGTCGACAT GATGCGCACG TTGGGCAACC TCCGGCATGA AGGCTGGCAG ATCACGGAAA ACGATGTCAG CTTTCTGAGC CCCTACCAGG TCGCGCACGT CAAGCGCTTC GGCGAGTACA GCCTGAAACT CAAGCGCAAA CCCGAAGCCT GGATCGCGGA CGACACCTTT CAACAAGTGG CCGCGTCGGT GCAGGACATG CGGCGCGCGA CGCGCAGAGC CTGA
|
Protein sequence | MASLERTAYP QFLSYYSPSD LRQFFTLQDE EINWLKSAGR SAGTRLGLAV LLKVFQHLRY FPSLDKIPSE VITHVRNGLG FGDAIRIEYP VERTLFRHKA MVLGLLNVKP FHGHDAMRQA ERFACDAAEL MDQRADIING IIEGVLQARY ELPAFSTLDD LAEKAHAAVQ NRVFGKVFHR ITPQQLEALQ ALLVTDKLER RQSEYNELKK SAKRPTRKHL DMLVEHLEWL DSITAGDDVL AGLPDTKIRH FAAQAMAYDV SELRECAETK RYTLLVALIR RMQVRARDQL AEMFLRRVAT IHKRAKEELD QIQFGQRGQI ERLIGTLDGV LAILDGEPDN AAAGAQIREY LAPAGGVHGV RETCAQVQAT SGNNYLPLVW KHFKSHRSIL FRLVHLLDIR ATTQDRTLID ALNLIKTYQD KHRIEWITES IDLSFASDRW RKLLREGDSG DGFGLGLSRR NLEICVFSYL AEELRSGDLS IVGSEEFADY RDQLLSWEEC QELLPAYCDK IGLPQDATTF ASSLRDWLTD TAQHLDDTFP ECRGDVALTA SGEPVLRKPI AREIPPSVMS LQNALTQRMP ARHILDVLAN IEHWMGFTRH FGPLSGNMAK LKQPAERYLL TIFAMGCNLG PTQAARHLGN SGVTPHMLSF VNRRHLSLES LEAAQRELNE VYLRLDLPKI WGDGKTVAAD GTQYDFYDEN LLAGYHFRYR KMGAVAYRHV ANNYIAVFRH FIPPGVWEAI YVIEGLLKAG LSVEADTVHA DTQGQSATVF AFTHLLGIKL MPRIRNWKNL TLFRPDKTVK YKYINRLFGD SVDWNLIERH WQDLMQVALS IYAGKISSAT LLRKLGSYSR KNRLYFAAQE LGNVIRTGFL LEWIGSRELR QEITANTNKI ESYNGFAKWL SFGGDVIAVN EPDEQQKRLR YNDLVASALI LQNTVDMMRT LGNLRHEGWQ ITENDVSFLS PYQVAHVKRF GEYSLKLKRK PEAWIADDTF QQVAASVQDM RRATRRA
|
| |