Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2403 |
Symbol | |
ID | 7094325 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011667 |
Strand | - |
Start bp | 65778 |
End bp | 67793 |
Gene Length | 2016 bp |
Protein Length | 671 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643701089 |
Product | site-specific DNA-methyltransferase, cytosine-specific |
Protein accession | YP_002364230 |
Protein GI | 217980180 |
COG category | [V] Defense mechanisms |
COG ID | [COG1401] GTPase subunit of restriction endonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 61 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 99 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCGAT ACCTGACGGC GGCTCGTATT AAAGCAAGCA TCACCGCGTT GGCTGACACG CGGGCGAAAG CCGCATTGAT GGACTTTCTC ATCTTGAAGC GAACGCTCTC TGTCGGTGGG CAGACGCACG TAGCCATAAC CCAAAGTCAA CCAGCCTATC TGCAGGCAAC CAAGGAACTA GCTGGAGTCA AGTTAGACAA CTCAATTCTG ATCGGTGAAG AGAAGCAAAT TTTCAACGTC TTCGTGTCGC AGGAGGCGAG CCGAGGCGGC TTCCGCGGCG GCAAATACAT CTCCAACGGG ACTGGTACCA CTATCGCCGG CAACTCTTGG CAGAGGGTCG TCGAACTAAC CAGCGACGAC CCTCGAAAGG CGGGGCTGCG GGCGGGGCAT GAAGCCTACT TAGAAGCGCT CTTATTGAAA GCAGCCAAAG GCGCCAAGCC AAGCCTGGGA GAGACCGCGG TCTGGAACTA CCGAAAAGTC GATATTGAGC CGATAGTCGG GGGTTTCGCT GCGCCGGCTG ATCGCTTCAA TGCGCTGCGG GACCGCTTCG TCGCCGACTA CAGCCTTACC GCAGCCGAGC GGGATGCCTT GTTTTCAGAT CCAGCTGGCC AGATCACTGA CGCCGATCTG GACGACGCCC CGGCCACGCC GGAGGACTAC CTGAATGGTC TCGTGGCTGC GTCTGTACCT GCCGCGGCAG CTGCCGCCAC AGGGGGTACG TGCTCGCTCG ACCTAGTCGC GGCACTAGCA GCCAAGCCTT TTGTGATCCT TACAGGCGCA TCGGGCACTG GGAAGTCGCG CTCGACGCTG CGACTTGCGG AGCAATTGCA AGAGCATTAC GACGCGCAAG TCAAAGGCCA GATTTTCCAG TTGGTTCCGA TCGGCCCCGA CTGGACCTCC CCGAAGAAGC TCCTCGGCTT CCGCACTCCT TTTGGGCAGC TTCGCAAGAG GGCAGACGGG ACTGAGACTA ACGAAAGCTA CGAGATCACC GAAACGCTTC GCATCATTCT GCGGGCGTGT AATCCGAGTT CGACGAAGAT CCCGCACTTC CTGGTATTCG ACGAGATGAA TCTCTCGCAC GTCGAGCGCT ACTTCGCGCC GTTTCTGTCG CTTATGGAGG CATCGTCGAT CCTGGAAGAT GGCGAGAACG CCCCCATCGT GGATAAGCAC TCCATGTCGG TGATATCGGA GCTGCTGAAC GCGGAGGACC CGGCTTCAGC AGAGGCTGAG TCGGCCGCGT TGCTTGTAAA AAACGATCAG CCTTTGACGC TGCCGCCGAA CCTCTTCTAT GTCGGGACGG TGAACATCGA TGAGACCACC TACATGTTCT CGCCCAAAGT GCTCGACCGG GCCCACGTTC TGGAAGCGCG AGCTCTCAGG CCCTCCGAAT ACCTCGCGGG AGCGAAGCCG GAAGAGACGT TGGACTTGGC CATGGGGAAT CAGCTCCTGC AGGAGGCGAT CGACGACCGA GAAGCGGGTG AAGGCCGTGC AGCAGACCCG TCGCAGGTCC TCGTCGCTTT GGTGGACAAA TATGGAGTCA ACGCGATTGA GTTCGAAAGC CAGCGGACAT TCACAGTGCA GGTACTTGAA GGTTGCTTCA AGCTGCTTGC CCCTGTGGGG TTCGAGTTCG CGTTTCGGGT GAACAAGGAG ATCTACGCCT ACATGCTGGT GTGGATCAAG GCGCAGATCA TCAATGGCGT CGCTCCGGCC GACGCCATGA CTCATTGGGT AGATGGGCTC GACCGTGCCC TGTTCCAGAA GGTTCTCCCC AAAATTCATG GGAGTCGTTC CGCCTTGGGT GACAGCCTGA AGGCAATCCA TGCGTTCCTG GGCGGCTCTC ATGCCGACAG GGACCCGGCC GCCAAATACA CGCTGGGCGC CGAGGCTTCA ACTCGTATCG AACCGGGTGA GGCCATCAAC CTGCCGCCAG GTAAGGAGTT TGCTCGGTGC AGGGCTAAGC TCCTCGAGAT GCACGGTCGA CTGCTCTCGC GCAACTACGT CTCCTTCGTG AAGTGA
|
Protein sequence | MARYLTAARI KASITALADT RAKAALMDFL ILKRTLSVGG QTHVAITQSQ PAYLQATKEL AGVKLDNSIL IGEEKQIFNV FVSQEASRGG FRGGKYISNG TGTTIAGNSW QRVVELTSDD PRKAGLRAGH EAYLEALLLK AAKGAKPSLG ETAVWNYRKV DIEPIVGGFA APADRFNALR DRFVADYSLT AAERDALFSD PAGQITDADL DDAPATPEDY LNGLVAASVP AAAAAATGGT CSLDLVAALA AKPFVILTGA SGTGKSRSTL RLAEQLQEHY DAQVKGQIFQ LVPIGPDWTS PKKLLGFRTP FGQLRKRADG TETNESYEIT ETLRIILRAC NPSSTKIPHF LVFDEMNLSH VERYFAPFLS LMEASSILED GENAPIVDKH SMSVISELLN AEDPASAEAE SAALLVKNDQ PLTLPPNLFY VGTVNIDETT YMFSPKVLDR AHVLEARALR PSEYLAGAKP EETLDLAMGN QLLQEAIDDR EAGEGRAADP SQVLVALVDK YGVNAIEFES QRTFTVQVLE GCFKLLAPVG FEFAFRVNKE IYAYMLVWIK AQIINGVAPA DAMTHWVDGL DRALFQKVLP KIHGSRSALG DSLKAIHAFL GGSHADRDPA AKYTLGAEAS TRIEPGEAIN LPPGKEFARC RAKLLEMHGR LLSRNYVSFV K
|
| |