Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0959 |
Symbol | |
ID | 7085062 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1049665 |
End bp | 1052784 |
Gene Length | 3120 bp |
Protein Length | 1039 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643697981 |
Product | DNA methylase N-4/N-6 domain protein |
Protein accession | YP_002354621 |
Protein GI | 217969387 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG2189] Adenine specific DNA methylase Mod |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.524193 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACCA AACAGAATCC CAAATTCCAG GAGCTCGTCG TCAAGCTGCG CGAAATCTTC CAGATTGACC GCCCGGAACT GGACTTCGGC ATCTACCGCA TCCTCAACGC CCGTGCAGGT GAGATCAACG ACTACCTGCA AAACCGGCTT GTCGAGAAGG TGCAGGCAGC GCTCAGCAGC GGCAATGAAT CGCAACGCGA GCAAGTGGCG CGCGAACTGA AGGAAAAAGA AGCCCAGTAC CAAGCCGATG GGATCAACCC GGAGACGGTG CCCAAGGTTC AGGAACTTCG GCAGAAGCTG GCGCAGTACA GCACCGGCGC CAGCGAACAC GAGAACGCTG TGTTCTCGCA CCTGCTCACC TTCTTTTCCC GCTATTACCA GGACGGTGAC TTCATCAGCC AGCGCCGCTA CAAGGGTGAC ACCTACGCCA TCCCCTATGC GGGCGAAGAA GTGATGCTGC ACTGGGCTAA CAAGGATCAG TACTACACCA AGAGCGGCGA GAACTTCAGC AACTACAGCT TCAAGCTCGA AGACGGTCGC ACGGTGCATT TCCGCCTCGC CGCCGCCGAC ACTGCCAAGG ACAACCGCAA GGACAACGAC AAGGAGCGTC GCTTTGCCCT GGTGGCGGCG AAAACCGTCA CTCGCCTGGA TGAAAATGGC GACGAATACG AAGAGGAACT GGTGCCCGTG GAAGAGGCAT TGGGCAGTGA CGGCAATAAG GAGCTGATCA TTCGCTTCGA GTACGCCGCC CAGACCAAAG GCACCAAGCA GGAGGCGCTG GTTACCAAGG CCGTAGAGGC AGTGCTGGCG GATGCTTCCG TCAAGGCCCG CTGGCTGGCC CTGGGCAACC GCGCGCCCAC CGAGAAGAAC CCGCAGCGCA CCCTGCTGGA AAAGCACCTG AGCGACTACA CCACCAAGAA CACGGCGGAC TACTTCATCC ACAAAGATCT GGGTGGTTTC CTCCGGCGCG AGCTGGACTT CTACATCAAG AACGAGGTCA TGCACCTGGA TGATGTGCAG AACGCGGGCG CGTTCGCGGA CATCGAGAAG AACCTTCGGA TGATCCAGTG CCTGCGCAGC ATCGCGCTGG AGCTAATCAC TTTCTTGGCC CAGTTGGAAG ACTTCCAGAA GAAGCTGTGG CTGAAAAAGA AGTTCGTTGT CTCCAGTCAC TACTGCATCA CGCTGGATCG GGTGCCGGAA GCGCTGTGGC CGGAAGTGGT AGCGAACGCG CAGCAATGGG CGCGGTGGAA ACAGCTGGGC GTTTGGGATG GCGACGCACC GGGGACGGTG GAGGACTTGA AGGCTGCGCA GTATCGGATG GTCGATACGG CTTTATTCAA TGACGACTTT AAGCAGCGCC TTTTGGCAAA AATCGAAGAT ATCGAAGCAA GCCTCGGCGG CATCGTCATC AACGGGGATA ACTTCCAGGC GCTCAATTTG GCGAAGTATC GTTACCGAGC GAGCATAGAT TTCACCTATA TTGATCCGCC CTACAACACT GTCCATTCAA AGATCGCATA CAAGAACCAG TTCGAGCACT CAAGCTGGTT GGCTTTGATT TCCAACACGC TGCCATTTAC TCGCGATCTA TTCGGGGAAA TTTATTCATT TGGATTCGCC ATTGACGATT ACGAATATAA CAATGCCTTT CACTGCTTGA GGGGGCATTT CACTGAATGC GATGTCTCGA CCATCGTGAT CAATCACCAT CCACAAGGAT CGGGCGGAAG GCTGTCACGG ACGCACGAGT ACTACATCGT CGCCTCTCCC AAAGATGCGC CGCAATACCT TGGTTTTCCG AAAGAGGACG AGACCGAGGA CAGGCAGTTC ATGCGAAGTG GAACGGCTGA CAATAACTAC CGCGCGCCGC GTGCTGGGGG AGTTGGTCGT TGGCGTAGCT TCTACGCTCT TCTCGTCGAC CCATCTACCA AGAAAGTTGT AGGAGCAGAG CCGCCGCCGC CACTTGGAAC TGATTATCCA ACCGGGCCAA CGGCGGAAGG ATTACAAAGA ATCTACCCAA TCAATACCAC TGGTGAGGAG CGCGTTTGGC GGTCATCATA CGAGACGGGC AAAGTACGTG CAGCAAATGG CGAGCTGATC GTTACTGACC GTGGTGCCGT GAAGCAGCTT ATCGATCATC AGGACAAGCG GGAAACGCTC TTCAGTAATT GGATCGGCGC AGACTTCAAT GCCGGAACCA ATGGTACCAA CGTCTTGGAT AATCTCGGGC TCGGTGGAAT TTTTGATTAC CCGAAGTCAG TGAAAACCCT CGAACAATCC TTCTGGATGC AGTCATTCGG GAAGACAAAC TTTACCGTTC TAGATTACTT TGCAGGCTCA GGAACAACTG CGCATGCAAC AATTTCCCTA AATCGACAGG ACAATGCATC GCGCAAGTAC GTTCTAGTCG AGCAAGGTGA GTATTTCGAG ACCGTTCTCA AGCCACGAAT TCAGAAAGTC GTCTTTTCGG CTGATTGGGT TGGCGGCAAG CCGACGTCTT CAGAGACAGG CATTTCGCAT TGCTTTAAGG CAATCAAACT CGAAAGCTAC GAAGACACAC TGAACAACCT GCAACTGAGC CGTACGTCCG CGCAGGGCGA TCTGCTGAAC ACCCTGCCGC AGCCGGCCAA GGAGGACTAC CTGCTCAACT ACGTGCTGGA CGTGGAAAGC CGGGGCTCGT TGCTGTCGGT GGAGGACTTC AGGAAGCCCT TCGACTACAC CCTCAACGTG GCGGTGGACT CGGCGGGCGC GTTCGAGCCG CGCAAGATCG ATCTGGTCGA AACTTTCAAT TTCCTGATCG GCCTGCGCGT CAAGCACATC GATGCCCAGC CGCAGCGCGG CTTCGTCACG GTCACCGGAA CCCTGCCCAG CAATGAGACC TGCCTCGTGC TGTGGCGCGA TTGCGATGTG CTGGACTACG AAGGCATCAG CAAGCTCTGC GACAAGCTGG CCATCAACCC GGCGGACAAT GAGTTTGACG TGGTCTACAT CAACGGCGAC CACAACATTC CCACCGTGCT GACGCAGACG GCCGAGGAAG GCGGTGCCAC CCGCGTGCTC AAGCTGCGCC AGATCGAGCC GGAGTTTCTG GAGCGCATGT TCTCCGTGGA GGACATCTGA
|
Protein sequence | MTTKQNPKFQ ELVVKLREIF QIDRPELDFG IYRILNARAG EINDYLQNRL VEKVQAALSS GNESQREQVA RELKEKEAQY QADGINPETV PKVQELRQKL AQYSTGASEH ENAVFSHLLT FFSRYYQDGD FISQRRYKGD TYAIPYAGEE VMLHWANKDQ YYTKSGENFS NYSFKLEDGR TVHFRLAAAD TAKDNRKDND KERRFALVAA KTVTRLDENG DEYEEELVPV EEALGSDGNK ELIIRFEYAA QTKGTKQEAL VTKAVEAVLA DASVKARWLA LGNRAPTEKN PQRTLLEKHL SDYTTKNTAD YFIHKDLGGF LRRELDFYIK NEVMHLDDVQ NAGAFADIEK NLRMIQCLRS IALELITFLA QLEDFQKKLW LKKKFVVSSH YCITLDRVPE ALWPEVVANA QQWARWKQLG VWDGDAPGTV EDLKAAQYRM VDTALFNDDF KQRLLAKIED IEASLGGIVI NGDNFQALNL AKYRYRASID FTYIDPPYNT VHSKIAYKNQ FEHSSWLALI SNTLPFTRDL FGEIYSFGFA IDDYEYNNAF HCLRGHFTEC DVSTIVINHH PQGSGGRLSR THEYYIVASP KDAPQYLGFP KEDETEDRQF MRSGTADNNY RAPRAGGVGR WRSFYALLVD PSTKKVVGAE PPPPLGTDYP TGPTAEGLQR IYPINTTGEE RVWRSSYETG KVRAANGELI VTDRGAVKQL IDHQDKRETL FSNWIGADFN AGTNGTNVLD NLGLGGIFDY PKSVKTLEQS FWMQSFGKTN FTVLDYFAGS GTTAHATISL NRQDNASRKY VLVEQGEYFE TVLKPRIQKV VFSADWVGGK PTSSETGISH CFKAIKLESY EDTLNNLQLS RTSAQGDLLN TLPQPAKEDY LLNYVLDVES RGSLLSVEDF RKPFDYTLNV AVDSAGAFEP RKIDLVETFN FLIGLRVKHI DAQPQRGFVT VTGTLPSNET CLVLWRDCDV LDYEGISKLC DKLAINPADN EFDVVYINGD HNIPTVLTQT AEEGGATRVL KLRQIEPEFL ERMFSVEDI
|
| |