Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2145 |
Symbol | |
ID | 7085498 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2421363 |
End bp | 2424329 |
Gene Length | 2967 bp |
Protein Length | 988 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643699165 |
Product | transposase Tn3 family protein |
Protein accession | YP_002355781 |
Protein GI | 217970547 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4644] Transposase and inactivated derivatives, TnpA family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCCGCC GTTCGATCCT CTCCGCCGCC GAGCGCGAAA GCCTGCTGGC GTTGCCGGAC ACCAAGGATG AGTTGATCCG TCACTACACG TTCAGCGAAA GCGACCTCTC CATCATCCGG CAGCGGCGCG GCCCGGCCAA TCGGCTGGGC TTCGCGGTGC AGCTCTGCTA CCTGCGCTTT CCCGGCGTCA TCCTTGGCGC TGATGAGCCA CCGTTCCCGC CATTGCTGAG ACTGGTCGCC AACCAGCTCA AGGTCGGCAT CGAAAGCTGG GACGAGTACG GGCAGCGTGA GCAGACCCGA CGCGAGCACC TGGTCGAGCT GCAAACGGTG TTCGGCTTCC AGCCGTTCAC GATTGGCCAC TACCGGCAGG CTGTCCAGTT GCTGACCGAG CTGGCCATGC AAACCGACAA GGGCATCGTG CTGGCCAGAG CCTTGATCGA GCACCTGCGG CGGCAGTCGG TCATTGTGCC CGCCCTCAAC GCCGTCGAGC GGGCGAGCGC CGAAGCGATT ACCCGCGCCA ACCGGCGTCT CTACGACGCC TTGGCTGAGC CGCTGACGGA CGTGCATCGC CGTCGCCTCG ACGATCTGCT CAAGCGCCGC GACAACGGCA AGACGACGTG GCTGGCCTGG CTGCGGCAAT CCCCGGTCAA ACCGAACTCG CGGCACATGC TGGAACACAT CGAACGCCTC AAGGCGTGGC AGGCGCTCGA CCTGCCCTCC GGCATCGAGC GGCTGGTTCA CCAGAACCGG CTGCTCAAGA TCGCCCGCGA GGGCGGCCAG ATGACGCCCG CCGACCTGGC GAAGTTCGAG CCGCAGCGGC GTTACGCGAC CCTGGTGGCG CTCGCCATCG AGGGCATGGC CACCGTCACC GACGAAATCA TCGACCTGCA TGACCGCATC CTGGGCAAGC TGTTCAATGC CGCCAAGAAC AAGCATCAGC AGCAGTTCCA GGCATCCGGC AAGGCGATCA ATGCCAAGGT GCGGCTGTTC GGGCGCATCG GCCAGGCGCT GATCGAGGCC AAGCAAGCGG GCCGCGATCC GTTCGCCGCC ATCGAGGCCG TCATGTCCTG GGATGCTTTC GCCGAGAGCG TCACCGAAGC GCAGCGGCTC GCGCAACCCG AGGACTTCGA TTTCCTGCAC CGCATCGGCG AGAGCTACGC CACGCTGCGC CGCTACGCGC CGGAATTTCT CGACGTGCTC AAGTTGCGGG CCGCGCCCGC CGCCAAGGAC GTACTCGACG CCATCGAGGT GCTGCGCAGC ATGAACAGCG ACAACGCCCG CAAGGTGCCC ACCGACGCGC CGACCGAGTT CATCAAGCCG CGCTGGCAGA AGCTGGTGAT GACCGACACC GGCATCGACC GGCGCTACTA CGAACTGTGC GCGCTGTCGG AGCTGAAGAA CGCGCTGCGC TCCGGCGACA TCTGGGTGCA AGGCTCGCGC CAGTTCAAGG ACTTCGAGGA CTACCTGGTG CCGCCCGCGA AATTCGCCAG CCTCAAGCAG GCCAGCGAAT TGCCGCTGGC CGTGGCCACC GATTGCGACC AGTACCTGCA TGACCGGCTG ACGCTGCTGG AAACGCAGCT CGCCACCGTC AACCGCATGG CGCTGGCCAA CGAGCTGCCG GACGCCATCA TCACGGAGTC GGGCCTGAAG ATCACGCCGC TCGATGCGGC GGTGCCCGAT ACCGCACAGG CCCTGATCGA CCAGACGGCG ATGATCCTAC CGCACGTCAA GATCACCGAA TTGCTGCTGG AGGTAGACGA ATGGACGGGC TTCACCCGGC ACTTCGCCCA CCTGAAGTCA GGCGACCTGG CCAAGGACAA AAACCTGTTG CTGACCACGA TCCTCGCCGA CGCGATCAAC CTGGGCCTGA CCAAGATGGC GGAATCGTGC CCCGGCACGA CCTACGCCAA GCTGGCCTGG CTGCAAGCCT GGCACATCCG CGACGAAACC TACGGGGCGG CACTGGCCGA GCTGGTCAAC GCGCAGTTCC GACATCCCTT CGCCGAGCAT TGGGGCGACG GCACCACGTC ATCGTCGGAC GGCCAGAACT TCCGCACCGG CAGCAAGGCC GAGAGCACCG GCCACATCAA TCCGAAATAC GGCAGCAGCC CAGGGCGGAC GTTCTACACC CATATCTCCG ACCAGTACGC GCCGTTCCAC ACCAAGGTCG TGAACGTCGG CGTGCGCGAC TCGACCTACG TGCTCGACGG CCTGCTGTAT CACGAATCCG ACCTGCGGAT CGAGGAGCAC TACACCGACA CGGCAGGGTT CACGGACCAC GTCTTCGCGT TGATGCACCT GCTGGGCTTC CGCTTCGCCC CGCGCATTCG TGACCTGGGC GACACCAAGC TCTACATCCC GAAGGGCGAT GCCACCTACG AGGCGTTGAA ACCGATGATC GGCGGCACGC TCAACATCAA GCACGTCCGC GCCCATTGGG ATGAAATCCT GCGGATGGCC ACCTCGATCA AGCAGGGCAC GGCGACGGCC TCGCTGATGC TCAGGAAGCT TGGCAGCTAC CCGCGCCAGA ACGGCCTGGC CGTCGCCCTG CGTGAGCTGG GACGCATCGA GCGCACACTG TTCATCCTGG ACTGGCTGCA AAGCGTCGAG CTGCGCCGCC GCGTGCATGC CGGGCTGAAC AAAGGCGAGG CGCGCAACGC GCTGGCCCGC GCCGTGTTCT TCAACCGCCT GGGGGAAATC CGCGACCGCA GCTTCGAGCA GCAGCGCTAC CGGGCCAGCG GCCTCAACCT GGTGACGGCG GCCATCGTGC TATGGAACAC GGTCTATCTG GAGCGGGCCG CGAACGCCTT GCGTGGCCAC GGTCAAGCCG TCGATGACGG CCTGTTGCAG TACCTGTCGC CGCTCGGCTG GGAGCACATC AACCTGACCG GCGATTACCT CTGGCGCAGC AGCGCCAAGA TCGGCGCGGG CAAGTTCAGG CCGCTACGGC CGCTGCAACC GGCTTAG
|
Protein sequence | MPRRSILSAA ERESLLALPD TKDELIRHYT FSESDLSIIR QRRGPANRLG FAVQLCYLRF PGVILGADEP PFPPLLRLVA NQLKVGIESW DEYGQREQTR REHLVELQTV FGFQPFTIGH YRQAVQLLTE LAMQTDKGIV LARALIEHLR RQSVIVPALN AVERASAEAI TRANRRLYDA LAEPLTDVHR RRLDDLLKRR DNGKTTWLAW LRQSPVKPNS RHMLEHIERL KAWQALDLPS GIERLVHQNR LLKIAREGGQ MTPADLAKFE PQRRYATLVA LAIEGMATVT DEIIDLHDRI LGKLFNAAKN KHQQQFQASG KAINAKVRLF GRIGQALIEA KQAGRDPFAA IEAVMSWDAF AESVTEAQRL AQPEDFDFLH RIGESYATLR RYAPEFLDVL KLRAAPAAKD VLDAIEVLRS MNSDNARKVP TDAPTEFIKP RWQKLVMTDT GIDRRYYELC ALSELKNALR SGDIWVQGSR QFKDFEDYLV PPAKFASLKQ ASELPLAVAT DCDQYLHDRL TLLETQLATV NRMALANELP DAIITESGLK ITPLDAAVPD TAQALIDQTA MILPHVKITE LLLEVDEWTG FTRHFAHLKS GDLAKDKNLL LTTILADAIN LGLTKMAESC PGTTYAKLAW LQAWHIRDET YGAALAELVN AQFRHPFAEH WGDGTTSSSD GQNFRTGSKA ESTGHINPKY GSSPGRTFYT HISDQYAPFH TKVVNVGVRD STYVLDGLLY HESDLRIEEH YTDTAGFTDH VFALMHLLGF RFAPRIRDLG DTKLYIPKGD ATYEALKPMI GGTLNIKHVR AHWDEILRMA TSIKQGTATA SLMLRKLGSY PRQNGLAVAL RELGRIERTL FILDWLQSVE LRRRVHAGLN KGEARNALAR AVFFNRLGEI RDRSFEQQRY RASGLNLVTA AIVLWNTVYL ERAANALRGH GQAVDDGLLQ YLSPLGWEHI NLTGDYLWRS SAKIGAGKFR PLRPLQPA
|
| |