Gene Tmz1t_0959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0959 
Symbol 
ID7085062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp1049665 
End bp1052784 
Gene Length3120 bp 
Protein Length1039 aa 
Translation table11 
GC content56% 
IMG OID643697981 
ProductDNA methylase N-4/N-6 domain protein 
Protein accessionYP_002354621 
Protein GI217969387 
COG category[L] Replication, recombination and repair 
COG ID[COG2189] Adenine specific DNA methylase Mod 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.524193 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACCA AACAGAATCC CAAATTCCAG GAGCTCGTCG TCAAGCTGCG CGAAATCTTC 
CAGATTGACC GCCCGGAACT GGACTTCGGC ATCTACCGCA TCCTCAACGC CCGTGCAGGT
GAGATCAACG ACTACCTGCA AAACCGGCTT GTCGAGAAGG TGCAGGCAGC GCTCAGCAGC
GGCAATGAAT CGCAACGCGA GCAAGTGGCG CGCGAACTGA AGGAAAAAGA AGCCCAGTAC
CAAGCCGATG GGATCAACCC GGAGACGGTG CCCAAGGTTC AGGAACTTCG GCAGAAGCTG
GCGCAGTACA GCACCGGCGC CAGCGAACAC GAGAACGCTG TGTTCTCGCA CCTGCTCACC
TTCTTTTCCC GCTATTACCA GGACGGTGAC TTCATCAGCC AGCGCCGCTA CAAGGGTGAC
ACCTACGCCA TCCCCTATGC GGGCGAAGAA GTGATGCTGC ACTGGGCTAA CAAGGATCAG
TACTACACCA AGAGCGGCGA GAACTTCAGC AACTACAGCT TCAAGCTCGA AGACGGTCGC
ACGGTGCATT TCCGCCTCGC CGCCGCCGAC ACTGCCAAGG ACAACCGCAA GGACAACGAC
AAGGAGCGTC GCTTTGCCCT GGTGGCGGCG AAAACCGTCA CTCGCCTGGA TGAAAATGGC
GACGAATACG AAGAGGAACT GGTGCCCGTG GAAGAGGCAT TGGGCAGTGA CGGCAATAAG
GAGCTGATCA TTCGCTTCGA GTACGCCGCC CAGACCAAAG GCACCAAGCA GGAGGCGCTG
GTTACCAAGG CCGTAGAGGC AGTGCTGGCG GATGCTTCCG TCAAGGCCCG CTGGCTGGCC
CTGGGCAACC GCGCGCCCAC CGAGAAGAAC CCGCAGCGCA CCCTGCTGGA AAAGCACCTG
AGCGACTACA CCACCAAGAA CACGGCGGAC TACTTCATCC ACAAAGATCT GGGTGGTTTC
CTCCGGCGCG AGCTGGACTT CTACATCAAG AACGAGGTCA TGCACCTGGA TGATGTGCAG
AACGCGGGCG CGTTCGCGGA CATCGAGAAG AACCTTCGGA TGATCCAGTG CCTGCGCAGC
ATCGCGCTGG AGCTAATCAC TTTCTTGGCC CAGTTGGAAG ACTTCCAGAA GAAGCTGTGG
CTGAAAAAGA AGTTCGTTGT CTCCAGTCAC TACTGCATCA CGCTGGATCG GGTGCCGGAA
GCGCTGTGGC CGGAAGTGGT AGCGAACGCG CAGCAATGGG CGCGGTGGAA ACAGCTGGGC
GTTTGGGATG GCGACGCACC GGGGACGGTG GAGGACTTGA AGGCTGCGCA GTATCGGATG
GTCGATACGG CTTTATTCAA TGACGACTTT AAGCAGCGCC TTTTGGCAAA AATCGAAGAT
ATCGAAGCAA GCCTCGGCGG CATCGTCATC AACGGGGATA ACTTCCAGGC GCTCAATTTG
GCGAAGTATC GTTACCGAGC GAGCATAGAT TTCACCTATA TTGATCCGCC CTACAACACT
GTCCATTCAA AGATCGCATA CAAGAACCAG TTCGAGCACT CAAGCTGGTT GGCTTTGATT
TCCAACACGC TGCCATTTAC TCGCGATCTA TTCGGGGAAA TTTATTCATT TGGATTCGCC
ATTGACGATT ACGAATATAA CAATGCCTTT CACTGCTTGA GGGGGCATTT CACTGAATGC
GATGTCTCGA CCATCGTGAT CAATCACCAT CCACAAGGAT CGGGCGGAAG GCTGTCACGG
ACGCACGAGT ACTACATCGT CGCCTCTCCC AAAGATGCGC CGCAATACCT TGGTTTTCCG
AAAGAGGACG AGACCGAGGA CAGGCAGTTC ATGCGAAGTG GAACGGCTGA CAATAACTAC
CGCGCGCCGC GTGCTGGGGG AGTTGGTCGT TGGCGTAGCT TCTACGCTCT TCTCGTCGAC
CCATCTACCA AGAAAGTTGT AGGAGCAGAG CCGCCGCCGC CACTTGGAAC TGATTATCCA
ACCGGGCCAA CGGCGGAAGG ATTACAAAGA ATCTACCCAA TCAATACCAC TGGTGAGGAG
CGCGTTTGGC GGTCATCATA CGAGACGGGC AAAGTACGTG CAGCAAATGG CGAGCTGATC
GTTACTGACC GTGGTGCCGT GAAGCAGCTT ATCGATCATC AGGACAAGCG GGAAACGCTC
TTCAGTAATT GGATCGGCGC AGACTTCAAT GCCGGAACCA ATGGTACCAA CGTCTTGGAT
AATCTCGGGC TCGGTGGAAT TTTTGATTAC CCGAAGTCAG TGAAAACCCT CGAACAATCC
TTCTGGATGC AGTCATTCGG GAAGACAAAC TTTACCGTTC TAGATTACTT TGCAGGCTCA
GGAACAACTG CGCATGCAAC AATTTCCCTA AATCGACAGG ACAATGCATC GCGCAAGTAC
GTTCTAGTCG AGCAAGGTGA GTATTTCGAG ACCGTTCTCA AGCCACGAAT TCAGAAAGTC
GTCTTTTCGG CTGATTGGGT TGGCGGCAAG CCGACGTCTT CAGAGACAGG CATTTCGCAT
TGCTTTAAGG CAATCAAACT CGAAAGCTAC GAAGACACAC TGAACAACCT GCAACTGAGC
CGTACGTCCG CGCAGGGCGA TCTGCTGAAC ACCCTGCCGC AGCCGGCCAA GGAGGACTAC
CTGCTCAACT ACGTGCTGGA CGTGGAAAGC CGGGGCTCGT TGCTGTCGGT GGAGGACTTC
AGGAAGCCCT TCGACTACAC CCTCAACGTG GCGGTGGACT CGGCGGGCGC GTTCGAGCCG
CGCAAGATCG ATCTGGTCGA AACTTTCAAT TTCCTGATCG GCCTGCGCGT CAAGCACATC
GATGCCCAGC CGCAGCGCGG CTTCGTCACG GTCACCGGAA CCCTGCCCAG CAATGAGACC
TGCCTCGTGC TGTGGCGCGA TTGCGATGTG CTGGACTACG AAGGCATCAG CAAGCTCTGC
GACAAGCTGG CCATCAACCC GGCGGACAAT GAGTTTGACG TGGTCTACAT CAACGGCGAC
CACAACATTC CCACCGTGCT GACGCAGACG GCCGAGGAAG GCGGTGCCAC CCGCGTGCTC
AAGCTGCGCC AGATCGAGCC GGAGTTTCTG GAGCGCATGT TCTCCGTGGA GGACATCTGA
 
Protein sequence
MTTKQNPKFQ ELVVKLREIF QIDRPELDFG IYRILNARAG EINDYLQNRL VEKVQAALSS 
GNESQREQVA RELKEKEAQY QADGINPETV PKVQELRQKL AQYSTGASEH ENAVFSHLLT
FFSRYYQDGD FISQRRYKGD TYAIPYAGEE VMLHWANKDQ YYTKSGENFS NYSFKLEDGR
TVHFRLAAAD TAKDNRKDND KERRFALVAA KTVTRLDENG DEYEEELVPV EEALGSDGNK
ELIIRFEYAA QTKGTKQEAL VTKAVEAVLA DASVKARWLA LGNRAPTEKN PQRTLLEKHL
SDYTTKNTAD YFIHKDLGGF LRRELDFYIK NEVMHLDDVQ NAGAFADIEK NLRMIQCLRS
IALELITFLA QLEDFQKKLW LKKKFVVSSH YCITLDRVPE ALWPEVVANA QQWARWKQLG
VWDGDAPGTV EDLKAAQYRM VDTALFNDDF KQRLLAKIED IEASLGGIVI NGDNFQALNL
AKYRYRASID FTYIDPPYNT VHSKIAYKNQ FEHSSWLALI SNTLPFTRDL FGEIYSFGFA
IDDYEYNNAF HCLRGHFTEC DVSTIVINHH PQGSGGRLSR THEYYIVASP KDAPQYLGFP
KEDETEDRQF MRSGTADNNY RAPRAGGVGR WRSFYALLVD PSTKKVVGAE PPPPLGTDYP
TGPTAEGLQR IYPINTTGEE RVWRSSYETG KVRAANGELI VTDRGAVKQL IDHQDKRETL
FSNWIGADFN AGTNGTNVLD NLGLGGIFDY PKSVKTLEQS FWMQSFGKTN FTVLDYFAGS
GTTAHATISL NRQDNASRKY VLVEQGEYFE TVLKPRIQKV VFSADWVGGK PTSSETGISH
CFKAIKLESY EDTLNNLQLS RTSAQGDLLN TLPQPAKEDY LLNYVLDVES RGSLLSVEDF
RKPFDYTLNV AVDSAGAFEP RKIDLVETFN FLIGLRVKHI DAQPQRGFVT VTGTLPSNET
CLVLWRDCDV LDYEGISKLC DKLAINPADN EFDVVYINGD HNIPTVLTQT AEEGGATRVL
KLRQIEPEFL ERMFSVEDI