Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0010 |
Symbol | |
ID | 7085108 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 14957 |
End bp | 16447 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643697060 |
Product | transposase, IS21 family |
Protein accession | YP_002353709 |
Protein GI | 217968475 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4584] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCCTGT CCCCTGAACA CGAGGCGCAG ATCCTGCGCT ACCACCATGC CGAGCGCTGG CGCATCGGCA CCATCGCCGC GCAGCTCGGG CTGCACCGCG ACACCGTCGC GCGCGTGCTC ACCCAGGCCG GGCTGCCCCG GCACGGCGCG GTGCAGCGCG CCTCGGCGAT CGACCCCTAT CTGCCCTTCC TCCATGAGAC GCTCGCCCAG TTTCCGCGCC TCACCGCCGC ACGGCTCTAC GACATGGTGC GCGCGCGCGG TTACCCGGGG CGCCCCGATC ACTTCCGCCA CCTGATCGCC CGCCACCGCC CGCGCCCGAG CGCCGAGGCC TACCTGCGGC TACGCACCCT GCCCGGCGAG CAGGCCCAGG TCGACTGGGG GCACTTCGGG CACCTGACGA TCGGGCGCGC GCGCCGTCCG CTGATGGCCT TCGTGATGGT GCTGTCGTGG TCGCGCCAGC TCTACCTGCA CTTCTTCCTC GATGCGCGCA TGGAGAACTT CCTGCGCGGT CACGTCGGCG CCTTCGCGCG CTGGGGCGCC GTGCCGCGCA TCGCCCTGTA CGACAATCTG AAGAGCGCCG TCCTCGAGCG CTGCGGCAAC GCGATCCGCT TCCATCCGAG CTTGCTCGCG CTCGCCGGCC ACTACCGCTT CGAGCCGCGC CCGGTGGCGG TGGCGCGCGG CAACGAGAAG GGACGCGTCG AGCGCGCGAT CCGCTACGTG CGCGAGGCCT TCTTCGCCGG GCGCCCCTTC GTCGATCTGG ACGACCTGAA CGCGCAGGCC CAGGCCTGGT GCGAGGGCGC CGCCGGCGAG CGGCGTTGCC CCGACGACCC ATCGATCACG GTGACCGAGG CCTTCGGGAT GGAGCGCGAG CGCCTGCTCG CGCTGCCCGA GGCGCCGTTT CCGACCGACG AGGTGCGCGC GGTGTCGGCG GGCAAGACCC CTTACGTGCG CTTCGATCTG AACGACTACT CGATCCCCCA CACCCATGTG CGGCGCACCC TCACCGTGTG CGCCGACCCG CTCCGGGTGC GCATCCTCGA CGGCCAGGAC GTCATCGCCA CCCATGCGCG CAGCTACGAC CGCCGCCAGC AGATCGAGTG TGCCGCGCAC CTCGAGGCGC TCGTCGCGCA CAAGCACGCG GCCAGCGCCC ACCGCGCCAC CGACCGCCTG ACGGCGGCCG TGCCCGCCTG TCAGGCGCTG CTCGCCCAGG CCGCCGAGCG CGGTGAGCCG CTCGCGCGCA CCACGCGCGC GCTCACCGAT CTGCTCGATC GCTACGGCGC GAGCGAACTG GCCGCCGCCG TCGACGAGGC GCTCGCACGC GGCGTGCCGC ACCCCAACGC GGTGCGCATG GCGCTCGAGC GCCGGCGCGA GGCGCCCCCG CCGCTGGGTG TGCCGCTGCC CGCGCATCTG AAGACGCGCG ACGTCACCGT ACGCGCCCAT TCCTTGGCGG GCTACGACCG CCTGCTGGAG GACGATCATG ACGACGCCTG A
|
Protein sequence | MALSPEHEAQ ILRYHHAERW RIGTIAAQLG LHRDTVARVL TQAGLPRHGA VQRASAIDPY LPFLHETLAQ FPRLTAARLY DMVRARGYPG RPDHFRHLIA RHRPRPSAEA YLRLRTLPGE QAQVDWGHFG HLTIGRARRP LMAFVMVLSW SRQLYLHFFL DARMENFLRG HVGAFARWGA VPRIALYDNL KSAVLERCGN AIRFHPSLLA LAGHYRFEPR PVAVARGNEK GRVERAIRYV REAFFAGRPF VDLDDLNAQA QAWCEGAAGE RRCPDDPSIT VTEAFGMERE RLLALPEAPF PTDEVRAVSA GKTPYVRFDL NDYSIPHTHV RRTLTVCADP LRVRILDGQD VIATHARSYD RRQQIECAAH LEALVAHKHA ASAHRATDRL TAAVPACQAL LAQAAERGEP LARTTRALTD LLDRYGASEL AAAVDEALAR GVPHPNAVRM ALERRREAPP PLGVPLPAHL KTRDVTVRAH SLAGYDRLLE DDHDDA
|
| |