Gene Tmz1t_2403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2403 
Symbol 
ID7094325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011667 
Strand
Start bp65778 
End bp67793 
Gene Length2016 bp 
Protein Length671 aa 
Translation table11 
GC content59% 
IMG OID643701089 
Productsite-specific DNA-methyltransferase, cytosine-specific 
Protein accessionYP_002364230 
Protein GI217980180 
COG category[V] Defense mechanisms 
COG ID[COG1401] GTPase subunit of restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones61 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones99 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGAT ACCTGACGGC GGCTCGTATT AAAGCAAGCA TCACCGCGTT GGCTGACACG 
CGGGCGAAAG CCGCATTGAT GGACTTTCTC ATCTTGAAGC GAACGCTCTC TGTCGGTGGG
CAGACGCACG TAGCCATAAC CCAAAGTCAA CCAGCCTATC TGCAGGCAAC CAAGGAACTA
GCTGGAGTCA AGTTAGACAA CTCAATTCTG ATCGGTGAAG AGAAGCAAAT TTTCAACGTC
TTCGTGTCGC AGGAGGCGAG CCGAGGCGGC TTCCGCGGCG GCAAATACAT CTCCAACGGG
ACTGGTACCA CTATCGCCGG CAACTCTTGG CAGAGGGTCG TCGAACTAAC CAGCGACGAC
CCTCGAAAGG CGGGGCTGCG GGCGGGGCAT GAAGCCTACT TAGAAGCGCT CTTATTGAAA
GCAGCCAAAG GCGCCAAGCC AAGCCTGGGA GAGACCGCGG TCTGGAACTA CCGAAAAGTC
GATATTGAGC CGATAGTCGG GGGTTTCGCT GCGCCGGCTG ATCGCTTCAA TGCGCTGCGG
GACCGCTTCG TCGCCGACTA CAGCCTTACC GCAGCCGAGC GGGATGCCTT GTTTTCAGAT
CCAGCTGGCC AGATCACTGA CGCCGATCTG GACGACGCCC CGGCCACGCC GGAGGACTAC
CTGAATGGTC TCGTGGCTGC GTCTGTACCT GCCGCGGCAG CTGCCGCCAC AGGGGGTACG
TGCTCGCTCG ACCTAGTCGC GGCACTAGCA GCCAAGCCTT TTGTGATCCT TACAGGCGCA
TCGGGCACTG GGAAGTCGCG CTCGACGCTG CGACTTGCGG AGCAATTGCA AGAGCATTAC
GACGCGCAAG TCAAAGGCCA GATTTTCCAG TTGGTTCCGA TCGGCCCCGA CTGGACCTCC
CCGAAGAAGC TCCTCGGCTT CCGCACTCCT TTTGGGCAGC TTCGCAAGAG GGCAGACGGG
ACTGAGACTA ACGAAAGCTA CGAGATCACC GAAACGCTTC GCATCATTCT GCGGGCGTGT
AATCCGAGTT CGACGAAGAT CCCGCACTTC CTGGTATTCG ACGAGATGAA TCTCTCGCAC
GTCGAGCGCT ACTTCGCGCC GTTTCTGTCG CTTATGGAGG CATCGTCGAT CCTGGAAGAT
GGCGAGAACG CCCCCATCGT GGATAAGCAC TCCATGTCGG TGATATCGGA GCTGCTGAAC
GCGGAGGACC CGGCTTCAGC AGAGGCTGAG TCGGCCGCGT TGCTTGTAAA AAACGATCAG
CCTTTGACGC TGCCGCCGAA CCTCTTCTAT GTCGGGACGG TGAACATCGA TGAGACCACC
TACATGTTCT CGCCCAAAGT GCTCGACCGG GCCCACGTTC TGGAAGCGCG AGCTCTCAGG
CCCTCCGAAT ACCTCGCGGG AGCGAAGCCG GAAGAGACGT TGGACTTGGC CATGGGGAAT
CAGCTCCTGC AGGAGGCGAT CGACGACCGA GAAGCGGGTG AAGGCCGTGC AGCAGACCCG
TCGCAGGTCC TCGTCGCTTT GGTGGACAAA TATGGAGTCA ACGCGATTGA GTTCGAAAGC
CAGCGGACAT TCACAGTGCA GGTACTTGAA GGTTGCTTCA AGCTGCTTGC CCCTGTGGGG
TTCGAGTTCG CGTTTCGGGT GAACAAGGAG ATCTACGCCT ACATGCTGGT GTGGATCAAG
GCGCAGATCA TCAATGGCGT CGCTCCGGCC GACGCCATGA CTCATTGGGT AGATGGGCTC
GACCGTGCCC TGTTCCAGAA GGTTCTCCCC AAAATTCATG GGAGTCGTTC CGCCTTGGGT
GACAGCCTGA AGGCAATCCA TGCGTTCCTG GGCGGCTCTC ATGCCGACAG GGACCCGGCC
GCCAAATACA CGCTGGGCGC CGAGGCTTCA ACTCGTATCG AACCGGGTGA GGCCATCAAC
CTGCCGCCAG GTAAGGAGTT TGCTCGGTGC AGGGCTAAGC TCCTCGAGAT GCACGGTCGA
CTGCTCTCGC GCAACTACGT CTCCTTCGTG AAGTGA
 
Protein sequence
MARYLTAARI KASITALADT RAKAALMDFL ILKRTLSVGG QTHVAITQSQ PAYLQATKEL 
AGVKLDNSIL IGEEKQIFNV FVSQEASRGG FRGGKYISNG TGTTIAGNSW QRVVELTSDD
PRKAGLRAGH EAYLEALLLK AAKGAKPSLG ETAVWNYRKV DIEPIVGGFA APADRFNALR
DRFVADYSLT AAERDALFSD PAGQITDADL DDAPATPEDY LNGLVAASVP AAAAAATGGT
CSLDLVAALA AKPFVILTGA SGTGKSRSTL RLAEQLQEHY DAQVKGQIFQ LVPIGPDWTS
PKKLLGFRTP FGQLRKRADG TETNESYEIT ETLRIILRAC NPSSTKIPHF LVFDEMNLSH
VERYFAPFLS LMEASSILED GENAPIVDKH SMSVISELLN AEDPASAEAE SAALLVKNDQ
PLTLPPNLFY VGTVNIDETT YMFSPKVLDR AHVLEARALR PSEYLAGAKP EETLDLAMGN
QLLQEAIDDR EAGEGRAADP SQVLVALVDK YGVNAIEFES QRTFTVQVLE GCFKLLAPVG
FEFAFRVNKE IYAYMLVWIK AQIINGVAPA DAMTHWVDGL DRALFQKVLP KIHGSRSALG
DSLKAIHAFL GGSHADRDPA AKYTLGAEAS TRIEPGEAIN LPPGKEFARC RAKLLEMHGR
LLSRNYVSFV K