Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2112 |
Symbol | |
ID | 7085382 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2386931 |
End bp | 2389897 |
Gene Length | 2967 bp |
Protein Length | 988 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643699131 |
Product | transposase Tn3 family protein |
Protein accession | YP_002355748 |
Protein GI | 217970514 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4644] Transposase and inactivated derivatives, TnpA family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGACG GCGGCGGCAG TAAGGAAGAT CAGTGGGTAT TGGCGCCAGC CGAACGCGAA CTGGTGATGA CCAAAAACCG GGCGAACCGG TTGGGCTTTG CCATCCTGCT GACCTTCTTC CGCGATCGCG GCCGTTTTCC GCGCGACGAA ACCGAAGTCG AGGTACAGGG CATAGCCGCG CTCGCCAAAC AACTCGACGC ACCCGCGCCC ATCGATGGCG AAGCCTTCCT CACGGGCCGC ACTGCCGAGC GGCTGCGCGG CGAAATCCGC GTGCGTTTCG GCTTCCGCGA AGCGACGGTA GCCGATGCCG AGATGCTGAC GGAGTGGCTG CGTGATCATG TTGCCGGAGA AGTTGGCGGT GACATTGAGC CGATGATCGA GCGGCTGGAA GGACGTTGCC GCGAACTCGC CATCGAGCCG CCGAAACCAG ACCGGATGGA GCGCATCGCG CGCAGTGCGT TGCGCTCCCA CGAAGACCGC TTCCATAGCT GCGTGTATGG GCGGCTGCCG CCCGCGACTC GCGAACGCCT GGATGCCTTG CTGCGCCCAG AAGAATCGGG CCACGGGGAG AGCGCCGTTG AAGATGCTCA AGGCGAAGCC GCAGGCAACG CGCCGGCCGT CTTGCTGAAA CTGCGCGGCA GTCCCGGCCG CCCAAGCCTT GCCAGCATGC AGGATGAGTT GGCGAAGCTC GAACTGATCC GGGGGATCGA GCTGTCTGCC GATCTGTTCG ACCGGACTTC GCCGCGTGAC CTGGAGCGCT GCCGCCAGCG TGTGTCGGTC GAGGTTCCCC GCGACCTGCG CCGACATCCC GATGCAGCGC GCCTCACCTG GCTGGCCGCT TTCGTCCACC TGCGCGCCCG CAGCCTGACC GACGACCTGG TGGACTTGCT GATCGAGACC ATCCACCAGA TCGGCGCGCG TGCCGAACGC AAGGTCGAAC GCGAACTGCT GGAGGACCTC AAGCGCGTGT CCGGCAAGCA GAACCTGCTG TTCAATCTGG CCGACGCCAC CTTGGCCCAG CCGGACGGCG TGGTGCGCGA CGTGGTGTTT CCAGTGGTCG GCGAGCAGAC GCTGCGCGAT CTGGTCAAGG AGTGGAAGGC CACCGGCCCG ACCTACCGCA TCACGCTGCG CACTGTGATC CGCAATTCGT ACCAGGGCCA CTACCGGCGC ATAGTACCGA CCTTGCTGGC CGCGCTGGTA TTCCGCTCCA ACAACGACCG CCACCGCCCG GTGATGGACG CGCTCGACCT GGTGAAGCGC TTTGCCGACA CCAAGGTGCA TACCTTCCCA GCCGACATCG AGGTGCCGCT CGATGGCGTG GTACGTGGCC TGTGGCGAGA AGCCGTCATG GAGACGGACG CCGCCGGCCG GGATCGGGTC AACCGCGTCA CCTATGAAAT CGCCGTGCTG GAAGCCCTGC GCGAGCGGCT GCGCTGCAAG GAAATCTGGG TGGTCGGCGC GAACCGCTAC CGCAACCCCG ACGACGATCT GCCGGCTGAC TTCGAGCAAA ACCGCGAGGA CTACTACCGG GCGCTGAACC TGCCTCTCGA TGTGGAGCGC TTCATCGCCG ACTTGCAGGC CGAAATGCGC GCGGCGCTGT CCACCTTCGA CGCTGGCTTG AAGAAGAATC CATCCGTCCG GCTGAGCAGC AAGGGCGGTG GCTGGATCAC GCTGACGCCG CTCGATGCGC AACCCGATCC CCCCAATCTG ACCGCGCTAA AGGCCGAACT CAATGTCCTC TGGCCGATGA CCAGCCTGCT CGATATGGTC AAGGAAACCG ATCTGCGGTT GAGCTTTACC GATGCCCTGA AAAGCCCGAC CTCCTACGAG TCGATGGATC GCTCGGTGTT GCAGCCGCGC CTGCTCCTGT GTCTGCACGG CCTGGGCACC AATGCTGGCT TGCAGCGCAT GGCCGGGCTG GATTCCGGCA CCACGGCGCG CGACCTGGCC TATGTGCGCC GCCGTTACAT CAGCGTGGAC GCGATGCGCC GCGCGATTGC CATCGTCGCA GACGGCACGC TGCAAGCCCG CAACCCGGCG ATCTGGGGTA GCGGCACCAC CGCTTGCGCG TCGGACTCGA AACACTTCGG CGCGTGGGAT CAGAACCTCA CCACGCAATG GCACGTCCGC TACGGCGGGC GCGGCGTGAT GATCTACTGG CATGTCGAGC GCAGCTCGCT GTGCATCCAT TCGCAGCTCA AGTCGCCGTC GTCGTCGGAG GTGGCGTCGA TGATCGAGGG CGTGATCCAC CATTGCACCG AGATGGAGGT GGATCGGCAG TATGTCGATT CGCACGGCCA GAGCACGGTG GCGTTCGCCT TCTGCCGCCT GCTGGGCTTC CAGTTGCTGC CACGGCTGAA GGCCATCCAC TCACAGAAGC TGTACCGGCC AGAGACCGGC AAGGCCGACG CCTACGCGAA CCTGCAACAG ATTCTGACCA AGCCCATCGA CTGGGACTCG GTGCGGCAAC AGTACGACCA GATGGTCAAG TACGCTACCG CGCTGCGCCT GGGGACAGCG GACACCGAAG CCATCCTGCG CCGCTTCACC AAGAAGAACG TGCAGCACCC GACCTACAAG GCATTCGCTG AGTTGGGCAA GGCGATCAAG ACTATCTTCC TGTGCCGCTA CCTGCACGAC GAGGCGTTGC GCCGGGAAAT CAACGAGGGG CTGAACGTAG TCGAGCAGTG GAACGGCGCG ACCGACTTCG TGTTCTTCGC CCGCCGGGGC GAGATGGCGA GCAACCGCCG CGAGGATCAC GAGGTCAGCA TGCTCGCGCT GCACTTGATC CAGAACTGTA TGGTCTACAT CAACACGCTG ATGATCCAGA AGGTCTTGGC CCTGCCGCAT TGGCAGGGCA GGTTCACACC ACGCGACTAC GCCGCCCTGA CGCCGCTGAT CTGGGAACAC GTCAACCCGT ATGGTCGGTT CGATCTCGAT ATGAACACCC GGCTCGACCT ACCGTGA
|
Protein sequence | MADGGGSKED QWVLAPAERE LVMTKNRANR LGFAILLTFF RDRGRFPRDE TEVEVQGIAA LAKQLDAPAP IDGEAFLTGR TAERLRGEIR VRFGFREATV ADAEMLTEWL RDHVAGEVGG DIEPMIERLE GRCRELAIEP PKPDRMERIA RSALRSHEDR FHSCVYGRLP PATRERLDAL LRPEESGHGE SAVEDAQGEA AGNAPAVLLK LRGSPGRPSL ASMQDELAKL ELIRGIELSA DLFDRTSPRD LERCRQRVSV EVPRDLRRHP DAARLTWLAA FVHLRARSLT DDLVDLLIET IHQIGARAER KVERELLEDL KRVSGKQNLL FNLADATLAQ PDGVVRDVVF PVVGEQTLRD LVKEWKATGP TYRITLRTVI RNSYQGHYRR IVPTLLAALV FRSNNDRHRP VMDALDLVKR FADTKVHTFP ADIEVPLDGV VRGLWREAVM ETDAAGRDRV NRVTYEIAVL EALRERLRCK EIWVVGANRY RNPDDDLPAD FEQNREDYYR ALNLPLDVER FIADLQAEMR AALSTFDAGL KKNPSVRLSS KGGGWITLTP LDAQPDPPNL TALKAELNVL WPMTSLLDMV KETDLRLSFT DALKSPTSYE SMDRSVLQPR LLLCLHGLGT NAGLQRMAGL DSGTTARDLA YVRRRYISVD AMRRAIAIVA DGTLQARNPA IWGSGTTACA SDSKHFGAWD QNLTTQWHVR YGGRGVMIYW HVERSSLCIH SQLKSPSSSE VASMIEGVIH HCTEMEVDRQ YVDSHGQSTV AFAFCRLLGF QLLPRLKAIH SQKLYRPETG KADAYANLQQ ILTKPIDWDS VRQQYDQMVK YATALRLGTA DTEAILRRFT KKNVQHPTYK AFAELGKAIK TIFLCRYLHD EALRREINEG LNVVEQWNGA TDFVFFARRG EMASNRREDH EVSMLALHLI QNCMVYINTL MIQKVLALPH WQGRFTPRDY AALTPLIWEH VNPYGRFDLD MNTRLDLP
|
| |