Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1003 |
Symbol | |
ID | 7083987 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 1103029 |
End bp | 1104774 |
Gene Length | 1746 bp |
Protein Length | 581 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643698025 |
Product | transposase IS66 |
Protein accession | YP_002354665 |
Protein GI | 217969431 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3436] Transposase and inactivated derivatives |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGCAAC GTACCGCCGC CCCTCGCTCG AACCCTTCCG TTGGCGTGTG CGCATCCGTC CTGGCCGAGC TGTTGCCCGA CGATCCAGCA ACGCTCAAAG CCTTGTTGCT GGCACAGCAG CGTGCCTTCG AGACGCGTGA AGCCGAACGG CAGGCAGCCT TCGAGGCCCG TGAGGCCGCA CTCCAGAAAG TCTTCGATGC GCGGGAGGCT GAACGCCAAA GAGCCTTCGA TGCGCGGGAG GCTGAACTGC AAAAGGCCTT CGAGGCACGC ATCCTCGAGC TCTACGAGCA GCTTCGCCTG GCGCGTCGGC GCATGTTCGG GCCCAGCAGC GAATCGCACG CGGGCCAGGC CTGGCTCTTC GACGAGGCCG AGGCGCTGGC CGAGTCCGCA CCCGAGGCGC TCGACACCGC AACCTTGCCG CCGCCGGCCA CCGAGACGAC GGGTGAGGCG TCCGCCGACA CCGGCAAGAA GAAGGCGCGT GGTAAGCGCA AGCCCTTGCC CATCGAGCTG CCGCGCATCG ACGTCGTCCA TGACGTCCCC GAGGCCGAGC GCACCTGCGC CTGCGGCACG CCCATGGTCG AGATCGGCCA GGACGTCAGC GAACAGCTCG ACATCGTCCC GATGCAGGTG CGTGTGCTGC GCCATATCCG CAAGCGCTAC GGCTGCCCCG AGGGCGACCA GGCGCCGGTC ACCGCCCGCG CCCCGGCGCA GGTGCTGCCC AAGAGCAACG CCAGCAACGA CCTGCTGGCC TTGCTGATCG TCATCAAGTA CGTCGATGGG CTGCCGCTGG CGCGTTTCGA GTACGTGCTC GCTCGCGCAG GCGTGCTTGT GCCGCGCCAG ACCCTGGCGC GCTGGGTGAT CGGTACCGCC CAGGCGCTGC AGCCGCTCGC CAACCTGATG CGCGACGTGC TGCTCGGGCA CGACGTCATC CACATGGACG AAACCCCGGT GCAGGTGCTC AAGGAGCCTG GCCGGGCAGC CACGAGCAAG AGCCAGATGT GGGTGCAGCG CGGCGGACCG CCGGGCAAGC CGGTGGTCCT CTTCGAGTAC GATCCGAGCC GCGCGCAGGC GGTGCCCTTA CGCCTGCTCG AAGGCTGGAA GGGGCATCTG ATGGCCGACG GGCTCGAGAG CTACGGCGCA ATTGCCTTCA CCGAAGGGGT GACCCGGCTC GGTTGCTGGG TGCACGCGCG ACGTCGTTTC GTCGATGCCA GCAAGGTGCT GCCTGCCGGC AAGCGCGGCC GCGCCCACGA AGCGCTGGCC CTGATCGGCA AGCTCTACGC CATCGAGAAG GACGCGCGCG AACTGAACGA CGCCCAGCGC CTGGCGCTAC GCCAGAGCAG AAGCCGCGCC GTCATCGACG AACTGCGCCG TTGGCTCGAC CAAGTGCTCC CCACCGTGCC GCCCACCTCG GTGCTCGGGG GTGCCCTGGG CTACCTGCAT CGGCAGTGGC CGCGTCTGAC GCGCTACCTC GAGCGCGGCG ATCTGCCGAT CGACAACAAC CCCGCCGAAA ACGCCATCCG TCCCTTCGTG GTCGGGAGAA AGGCATGGCT CTTCTCGGAC ACTCAGGCCG GTGCGCGTGC CAGCGCACTC CTCTACTCGC TGGTCGAAAC CGCCAAGGCC AACGGCTTCG AGCCGTATCT GTGGCTGCGC CACGTGCTGC GCGCCTTGCC CACCGCGACC AACGTCGAAC ATTTCGAGGC CCTCCTGCCC TGGAATCTCA AGGCTGAGCA GTTGATCACG GCGTAA
|
Protein sequence | MPQRTAAPRS NPSVGVCASV LAELLPDDPA TLKALLLAQQ RAFETREAER QAAFEAREAA LQKVFDAREA ERQRAFDARE AELQKAFEAR ILELYEQLRL ARRRMFGPSS ESHAGQAWLF DEAEALAESA PEALDTATLP PPATETTGEA SADTGKKKAR GKRKPLPIEL PRIDVVHDVP EAERTCACGT PMVEIGQDVS EQLDIVPMQV RVLRHIRKRY GCPEGDQAPV TARAPAQVLP KSNASNDLLA LLIVIKYVDG LPLARFEYVL ARAGVLVPRQ TLARWVIGTA QALQPLANLM RDVLLGHDVI HMDETPVQVL KEPGRAATSK SQMWVQRGGP PGKPVVLFEY DPSRAQAVPL RLLEGWKGHL MADGLESYGA IAFTEGVTRL GCWVHARRRF VDASKVLPAG KRGRAHEALA LIGKLYAIEK DARELNDAQR LALRQSRSRA VIDELRRWLD QVLPTVPPTS VLGGALGYLH RQWPRLTRYL ERGDLPIDNN PAENAIRPFV VGRKAWLFSD TQAGARASAL LYSLVETAKA NGFEPYLWLR HVLRALPTAT NVEHFEALLP WNLKAEQLIT A
|
| |