Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2345 |
Symbol | |
ID | 7094267 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011667 |
Strand | - |
Start bp | 7246 |
End bp | 10212 |
Gene Length | 2967 bp |
Protein Length | 988 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643701035 |
Product | transposase Tn3 family protein |
Protein accession | YP_002364176 |
Protein GI | 217980126 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4644] Transposase and inactivated derivatives, TnpA family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 82 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCGCC GTTCGATCCT GTCTGCCGCC GAGAGCGAAA TCCTGTTGGC GTTACCGGAA ACGAAGGAGG ACTTGATCCG CCACTACACG TTCAACGAAA CCGACCTCTC CATCATCCGG CAGCGGCGCG GCCCAGCCAA TCGTCTTGGC TTCGCGGTAC AGCTCTGCTA CATGCGCTTT CCCGGCGTGA TGCTCGCGGT AGATGTGGAG CCGTTTCCGC CGTTGCTGCG TGTGGTGGCT GCGCAGTTGA AGGTGCCACC AGAGGCGTGG GCGGACTATG GCCAGCGCGC CGAGACACGC CGCGAGCACC TGCTGGAACT GCAATCGGTC TTTGGTTTCC AGACCTTTGC AACGCGGCAC TATCGCCCCA GCGTGCACGC CCTGGATGAA CTGGCTTGGC AGACAGACAA AGGTTTCGTG TTGGCGACGG AGTTGGTCGA AGGGCTGCGA CGAAAGAGCG TGTTGCTCCC GTCACCAGGC GTCATCGAGC GCATCTGCGC CGAGGCGATT ACCCGTGCCA ACCGGCGCAT CTACGACACG CTGTCCGCGC CGCTGACAGA CACCCACCGG CATCGCCTCG ACGAGCTACT GAAACGCCGA TACGACGGCA AGACGACCTG GCTGGCCTGG CTGCGCCAGT CACCCGCCAA GCCGAATTCG CGGCACATGC TCGAACACAT CGAACGCCTC AAGGCGTGGC AGGCACTCGA CTTGCCTTCC GGCATCGAGC GGCTGGTCCA CCAGAACCGC CTGCTCAAGA TTGCCCGCGA AGGCGGCCAG ATGACGCCTG CCGACCTGGC GAAGTTCGAG CCGCAGCGTC GTTACGCCAC CCTGGTGGCG CTCGCCATCG AGGGCATGGC CACCGTCACC GACGAAATCA TCGACCTGCA CGATCGCATC CTGGGCAAGC TGTTCAATGC CGCCAGGAAC AAGCATCAAC AGCAGTTCCA GGCAGCCGGC AAGGCGATCA ACGCCAAGGT GCGCCTGTTC GGCCGCATCG GCCGGGCGCT GATCGAAGCC AAGCAGGCGG GCAGCGACCC GTTCGCCGCC ATCGAGGCGG TCATGTCGTG GGACGCCTTT ACCGAGAGCG TCACCGAGGC GCAGCGGCTC GCGCAGCCCG AGGACTTCGA TTTCCTGCAT CGCATCGGCG AGCACTACGC AACGTTGCGC CGCTACGCGC CGGAATTCCT CGCCGTGCTC AAGCTGCGGG CCGCGCCTGC CGCCAAGGAC GTGCTCGACG CCATCGAGGT GCTACGCGGC ATGAACAGCG ACAACGCCCG CAAGGTGCCC GCCGATGCGC CGACCGACTT CATCAAGCCG CGCTGGCAGA AGCTGGTGAT GACCGACACC GGCATCGACC GGCGCTACTA CGAACTGTGC GCACTGTCAG AACTGAAGAA CGCGCTGCGC TCGGGCGACA TCTGGGTGCA AGGCTCGCGC CAGTTCAAGG ACTTCGAGGA CTACCTGGTC CCGCCCACGA AGTTCGCCAG CCTCAAGCAG GCCCGCGAAT TGCCGCTGGC CGTGGCCACC GACTGCGACC AATACCTGCA CGACCGGCTG ACGCTGCTGG AAACCCAGCT CGCCACCGTC AACCGGATGG CGCTGGCCAA CGAACTGCCG GACGCCATCA TCACGGGGTC GGGCCTGAAG ATCACGCCTC TGGATGCGGC GGTGCCCGAC ACCGCGCAGG CGCTGATCGA CCAGACGGCG ATGATCCTCC CGCACGTCAA GATCACCGAA CTGCTGCTGG AAGTGGACCA ATGGACGGGC TTCACCCGGC ACTTCACGCA CCTCAAGTCG GGCGACCTGG CCAAAGACAG GAACCTGCTG CTGACCACGA TCCTGGCCGA CGCAATCAAC CTCGGCCTGA CCAAGATGGC GGAATCCTGC CCGGGCACGA CCTACGCCAA GCTGGCCTGG CTGCAGGCCT GGCACATCCG CGATGAAACC TACTCGACGG CGCTGGCTGA ACTGGTCAAT GCGCAGTTCC GGCATCCTTT CGCCGAGCAC TGGGGTGACG GCACCACGTC CTCGTCGGAC GGCCAGAACT TCCGCACCGG CAGCAAGGCC GAAAGTACCG GCCACATCAA CCCGAAATAC GGCAGCAGTC CCGGGCGGAC GTTCTACACC CACATCTCCG ACCAGTACAC GCCGTTCAAC ACCAAGGTCG TCAACGTCGG CGTGCGCGAC TCGACCTACG TGCTCGACGG CCTGCTGTAC CACGAGTCCG ACCTTCGCAT CGAAGAGCAC TACACCGATA CGGCGGGCTT CACCGATCAC GTGTTTGCGC TGATGCACCT GCTGGGCTTT CGCTTCGCGC CGCGCATCCG CGATCTGGGC GACACCAAGC TCTACATCCC GAAGGGCGAT GCTGGCTATG AGGCGCTGAA GGCCATGATC GGCGGCACGG TCAACATCAA GCATATCCGT GCCCATTGGG ACGAAATCCT ACGGCTGGCC ACCTCGATCA AGCAGGGTAC GGTGACGGCC TCGCTGATGC TCAGGAAGCT TGGGAGCTAC CCACGCCAGA ACGGCCTTGC CGTCGCCCTG CGCGAGCTCG GCCGCATCGA GCGCACGCTG TTCATTCTGG ACTGGCTGCA AAGCGTTGAG CTGCGCCGCC GCGTGCATGC CGGGCTGAAC AAGGGCGAGG CCCGAAACGC GCTGGCCCGC GCCGTGTTCT TCAACCGGCT GGGCGAAATC CGCGACCGCA GTTTCGAGCA GCAACGCTAC CGAGCCAGCG GCCTCAACCT GGTGACGGCG GCCATCGTGT TGTGGAATAC GGTCTATCTG GAGCGGGCGG CGAACGCCCT GCGTGGCCAT GGCCACGCTG TCGATGACGC CCTGTTGCAG TACCTGTCGC CGCTGGGCTG GGAGCACATC AACCTGACCG GGGATTACCT CTGGCGTAGC AGCGTCAAGA TCGGCGCGGG CAAGTTCAGG CCGCTGCGGC CGCTGCAACC GGCTTAG
|
Protein sequence | MPRRSILSAA ESEILLALPE TKEDLIRHYT FNETDLSIIR QRRGPANRLG FAVQLCYMRF PGVMLAVDVE PFPPLLRVVA AQLKVPPEAW ADYGQRAETR REHLLELQSV FGFQTFATRH YRPSVHALDE LAWQTDKGFV LATELVEGLR RKSVLLPSPG VIERICAEAI TRANRRIYDT LSAPLTDTHR HRLDELLKRR YDGKTTWLAW LRQSPAKPNS RHMLEHIERL KAWQALDLPS GIERLVHQNR LLKIAREGGQ MTPADLAKFE PQRRYATLVA LAIEGMATVT DEIIDLHDRI LGKLFNAARN KHQQQFQAAG KAINAKVRLF GRIGRALIEA KQAGSDPFAA IEAVMSWDAF TESVTEAQRL AQPEDFDFLH RIGEHYATLR RYAPEFLAVL KLRAAPAAKD VLDAIEVLRG MNSDNARKVP ADAPTDFIKP RWQKLVMTDT GIDRRYYELC ALSELKNALR SGDIWVQGSR QFKDFEDYLV PPTKFASLKQ ARELPLAVAT DCDQYLHDRL TLLETQLATV NRMALANELP DAIITGSGLK ITPLDAAVPD TAQALIDQTA MILPHVKITE LLLEVDQWTG FTRHFTHLKS GDLAKDRNLL LTTILADAIN LGLTKMAESC PGTTYAKLAW LQAWHIRDET YSTALAELVN AQFRHPFAEH WGDGTTSSSD GQNFRTGSKA ESTGHINPKY GSSPGRTFYT HISDQYTPFN TKVVNVGVRD STYVLDGLLY HESDLRIEEH YTDTAGFTDH VFALMHLLGF RFAPRIRDLG DTKLYIPKGD AGYEALKAMI GGTVNIKHIR AHWDEILRLA TSIKQGTVTA SLMLRKLGSY PRQNGLAVAL RELGRIERTL FILDWLQSVE LRRRVHAGLN KGEARNALAR AVFFNRLGEI RDRSFEQQRY RASGLNLVTA AIVLWNTVYL ERAANALRGH GHAVDDALLQ YLSPLGWEHI NLTGDYLWRS SVKIGAGKFR PLRPLQPA
|
| |