Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3989 |
Symbol | |
ID | 7873635 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 4386283 |
End bp | 4387353 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643700926 |
Product | N-6 DNA methylase |
Protein accession | YP_002890949 |
Protein GI | 237654635 |
COG category | [V] Defense mechanisms |
COG ID | [COG0286] Type I restriction-modification system methyltransferase subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCAGC AAGCCCTCTC CGCCCTCATC TGGTCGGTCG CCGACCTCTT GCGCGGCGAC TTCAAGCAGT CCGAATACGG CCGCGTGATC CTGCCCTTCA CCGTGCTGCG CCGGCTCGAC TGCGTGCTGG CGCCGACCAA GGCCGCGGTG CTCGTCGAAC ACCGCGACAA GGAGCAGGCC GGGCTGCTCT ACCTGGTGGT GGAAAAGTTC GCCCACATCG AGCCCCACCC CAGGCGCGTC GACAACGTGC ACATGGGCCT GGTCTTCGAG GAGCTGATCC GCAAGTTCGC CGAGATCTCC AACGAGACCG CCGGCGAGCA CTTCACCCCG CGCGAGCTCA TCCGCCTGAT GGTGAGCCCG CTCTTCATCG AGGACGACGA GGCGCTGTCC AAGCCCGGCA TCGTGCGCAC CATCTACGAC CCCACCGCCG GCACCGGCAC CGGCCGCATG CTGTCGGTGG CGGGCGAGCA CCTGCACGAG ATCAAGCCCG GCGCGCGCCT CACCATGTTC GGCCAGGAGC TCAACCCCGA GTCCTACGCC ATCTGCAAGG CCGACATGCT GATCAAGGGC CAGGACGTGC GCAGCATCGT GCTCGGCAAC ACGCTGTCCG AGACCCACAT CGGCGAGATC ACCCGCCTGC TCGGCGAATT CCTCGAAGCC GAGCAGGCGG TGGTGAGCGA CGCCCAGGGC AAGGAGCTCG CGCGCGTGAC CCTCTTCCCC GAGGTGCGCT GCCCGGCCGC GCCCGCGGGC GGCAAGGTCA AGCGTGTGCC CATCGCCCGC GTCTTCCGCA ACCAGGACTT CGGCTACCGC ACGATCACCA TCGAGCGCCC GCTGCGCGAC GCCGAGAACG TGCCGCTGTT CGAGGACGTG CAGGCCTGGT TCGAGCGCGA GGTGCTGTCC CACGCCCCCG ACGCCTGGAT CGACCACGAC AAGACCCGGA TCGGCTATGA GATCCCCTTG AACCGCCACT TCTACGTTTT CGAGCCGCCG CGGCCGCTGG CGGAGATCGA CGCCGACCTG AAGCGCTCGA TGGACCGGAT CAAGCAGATG ATCGAGGGGC TGGCGGGATG A
|
Protein sequence | MNQQALSALI WSVADLLRGD FKQSEYGRVI LPFTVLRRLD CVLAPTKAAV LVEHRDKEQA GLLYLVVEKF AHIEPHPRRV DNVHMGLVFE ELIRKFAEIS NETAGEHFTP RELIRLMVSP LFIEDDEALS KPGIVRTIYD PTAGTGTGRM LSVAGEHLHE IKPGARLTMF GQELNPESYA ICKADMLIKG QDVRSIVLGN TLSETHIGEI TRLLGEFLEA EQAVVSDAQG KELARVTLFP EVRCPAAPAG GKVKRVPIAR VFRNQDFGYR TITIERPLRD AENVPLFEDV QAWFEREVLS HAPDAWIDHD KTRIGYEIPL NRHFYVFEPP RPLAEIDADL KRSMDRIKQM IEGLAG
|
| |