Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0273 |
Symbol | |
ID | 7085574 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 310121 |
End bp | 311716 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643697314 |
Product | transposase IS66 |
Protein accession | YP_002353962 |
Protein GI | 217968728 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGCGC CGTCTTCCGC CCCTGTTCCG ACCCTTGCCG AGGCCGCCCA GTGGGCGCCC GAGCGCGTCG TCGAGCTGGC CCAGGCGCAT GAGGATCTGC AGCGCCAAGT GCAGACCATC CAACATCAGC TCGAGTGGTT CCGCCGCCAG CTCTTCGGCC AGAAGAGCGA GAAGCGCCTG GTCTCACCGG ACCCGGCGCA GATGCATCTG GGCGAGTTGC CGATCCCCGA CACACAGCCC GAACTCGCCG GCAAGACCGT GGCGGGACAC ACCCGCCGTG CGCCGCGCAC CGACTACGCA CAGGACAAGG ACGAATCGGC GCTGTTCTTC GACGAGACGC GCGTCCCGAT CGAGACCATC ACGCTCGCCA ACCCCGAGAC CGAGGGGCTT GCCGCCGACC AGTTCGAGGT GATCGGCGAG AAGGTCAGCC ACCGCTTGGC GCAGCGCCCG GGCAGCTACG TGATCCTCAA GTACGTGCGC CCGGTCATCA AGCGTCGCGA CACGCAGACG ATCCACTGCC CGGCGGCGCC CGCCGGGGTG CTCGAGGGCA GCCGCGCCGA CGTGAGTTTC CTGGTCGGGC TGCTGCTCGA CAAGTTCGCC TGGCACCTCC CGCTGTACCG CCAGCATCAG CGCCTGGCGG ACGCGGGCAT CACGGTCAGC CGCGCCTGGC TCACGCAACT GGCTGCGCAG GCGGCCGCAC TGCTCGTGCC GATCTACGAG GCGCAGCTCG CATCGATCCG CGCCAGCCGC GTCAAGGCCA TGGACGAGAC CCCGATCAAG GCCGGGCGGG CCGGCCCGGG CAAGATGAAG GCCTGCTACT TCTGGCCAGT CTATGGCGAG TTGCACGAGA TCTGTTTCCC GTTCTTCGAC AGCCGCGCGC ACAGCAACGT CGAGAAGGTG CTTGGGCTCA AGCCCACCGA GGGGGGTGTG CTGCTCTCGG ACGGCTATGG CGCCTACGAG ACCTACGCGA GCAAGACCGG GCTCACGCAC GCGCAGTGCT GGACGCACTG CCGACGCGAA TTCATCAACG CCGAAGCGGC CGAGCCCGAG CTGGCGGCCA AGGCGCTCGA GTTCATCGGC GCGCTCTACA CGGTCGAAGC GAAGATCCGC GACGACAAGC TCAAGGGCGA GGCCAAGCGC GAGTACCGGC TCGATCACGC GCGGCCGATC GTGGCGGCCT TCTTCACGTG GGTACAGGAA CGCCTGGACG CGCAAGGGCT GCTGCCGAGC AGCCCGCTGA CCAAGGCGCT GATCTACGCG CACAAGCGAC GGGCGGCACT GGAAGTGTTC CTGGCCGACC CGGATGTGCC GATCGATACC AACCACCTTG AACGGGCGCT ACGGCCAATT CCGTTGGGTC GAAAAAATTG GATGTTCAGC TGGACCGAGC TCGGCGCCCA GCATGTCGGC GTCGTCCAGA GCCTGATTGC GACCTGTCGG CTGCATGAAC TCGATCCGTA CGACTATCTC GTCGACGTGC TCCAGCGTGT CGACCAACAT CCGGCCGCAG ACGTCGCACA GCTGACGCCA AGGCTGTGGA AGCAGCACTT CGGCAAAGCG CCGTTGCGCT CGGATATCTC AAGGCGCGCT GCGTAG
|
Protein sequence | MPAPSSAPVP TLAEAAQWAP ERVVELAQAH EDLQRQVQTI QHQLEWFRRQ LFGQKSEKRL VSPDPAQMHL GELPIPDTQP ELAGKTVAGH TRRAPRTDYA QDKDESALFF DETRVPIETI TLANPETEGL AADQFEVIGE KVSHRLAQRP GSYVILKYVR PVIKRRDTQT IHCPAAPAGV LEGSRADVSF LVGLLLDKFA WHLPLYRQHQ RLADAGITVS RAWLTQLAAQ AAALLVPIYE AQLASIRASR VKAMDETPIK AGRAGPGKMK ACYFWPVYGE LHEICFPFFD SRAHSNVEKV LGLKPTEGGV LLSDGYGAYE TYASKTGLTH AQCWTHCRRE FINAEAAEPE LAAKALEFIG ALYTVEAKIR DDKLKGEAKR EYRLDHARPI VAAFFTWVQE RLDAQGLLPS SPLTKALIYA HKRRAALEVF LADPDVPIDT NHLERALRPI PLGRKNWMFS WTELGAQHVG VVQSLIATCR LHELDPYDYL VDVLQRVDQH PAADVAQLTP RLWKQHFGKA PLRSDISRRA A
|
| |