Gene Tmz1t_2371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2371 
Symbol 
ID7094293 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011667 
Strand
Start bp33755 
End bp35014 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content60% 
IMG OID643701059 
ProductDNA-cytosine methyltransferase 
Protein accessionYP_002364200 
Protein GI217980150 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value0.0692329 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.000000223127 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTAACC CGATCGACCT ACTGAAAGAG GCTCGCCTTC GGTTTTCGCA GCGCGAGATT 
GCCGATTATG TGGGCAAGGA CATCAAAACC GTTCGCCGCT GGGAGAAAGG TGAAACGCCA
TGTCCGGCAA TCCTCGAGCC CGCGCTAAGG GAGATGCTTC GCACGCCGCG CCTCCAAGAG
CTCGCGGAGC CGGACTTCAC CTTCATTGAT CTGTTTGCAG GGGTCGGGGG TATAAGGATG
GGCTTTGAGG CTCATGGTGG GCGCTGTGTC TTCACGAGCG AGTGGGACAG TTATGCTCAG
AAAACCTATG CCGAAAACTT CCCTGCCGAG CACCCCCTGA ACGGCGATAT CACAAAGATT
GAAGCTGCAG ACATCCCTGA CCACGATGTG TTGCTGGCCG GGTTCCCCTG TCAGCCGTTT
TCCATCGCAG GCGTCTCAAA GAAGAATGCG CTAGGGCGCG CGCACGGTTT CGCCTGCGAC
ACTCAAGGCA CGCTGTTCTT TGATGTGTGC AGGATCATCG AGGAGAAGCG CCCGCGTGCC
TTCCTGCTGG AGAACGTCAA GAACCTGATG TCCCACGACA AGGGCCGGAC ATGGGATGTC
ATCAAGAGCT CGCTCATCGA ACTGGGTTAC AACATTTCTC CGCGTGTGGT TGATGGCGCC
CACTTCGTGC CCCAGCACCG TGAACGCATC CTCATCGTGG GCTTCCGGAA TGAGGACGGT
ATCCGCTTCG ATTGGGATGC AGTGGGCCTG CCGCAGAAGG GAGTCCATGT GATGCGTGAC
ATCCTGCACC GTACCGACGG TACCGAACCA GTCCTCCCGT GGGATGGTGA CCGGTTCTTC
GATCATGCCG GTCGTAGGGT TCAGGACAAG TACACACTGA CCCCCAAGCT CTGGCGCTAC
CTGCAGGACT ATGCAGACAA GCACCGTGCA AAGGGCAACG GCTTCGGCTT CGGTCTGGTG
CACCCTGGCA GCGTGGCTCG AACCCTGTCC GCGCGGTACT ACAAGGATGG CTCGGAGATC
CTTGTCTATC AGGGCGAGGG TATCAACCCG CGCAGGCTCA CGCCGCGGGA GTGCGCGCGC
CTGATGGGCT TTCCGGACAG TTTCCGGATC CCGGTCTCCG ATACGCGGGC TTACAAGCAG
TTCGGTAACA GCGTTGTAAT GCCTGTCATG CGTGAGGTGG CCCGGGCCAT GGTTCCGCAC
ATTCTGGCTA GGCGAGAAGA CCGACACGAT GTGCCCGAAG CGCTCGCCTG TGCAGCGTGA
 
Protein sequence
MSNPIDLLKE ARLRFSQREI ADYVGKDIKT VRRWEKGETP CPAILEPALR EMLRTPRLQE 
LAEPDFTFID LFAGVGGIRM GFEAHGGRCV FTSEWDSYAQ KTYAENFPAE HPLNGDITKI
EAADIPDHDV LLAGFPCQPF SIAGVSKKNA LGRAHGFACD TQGTLFFDVC RIIEEKRPRA
FLLENVKNLM SHDKGRTWDV IKSSLIELGY NISPRVVDGA HFVPQHRERI LIVGFRNEDG
IRFDWDAVGL PQKGVHVMRD ILHRTDGTEP VLPWDGDRFF DHAGRRVQDK YTLTPKLWRY
LQDYADKHRA KGNGFGFGLV HPGSVARTLS ARYYKDGSEI LVYQGEGINP RRLTPRECAR
LMGFPDSFRI PVSDTRAYKQ FGNSVVMPVM REVARAMVPH ILARREDRHD VPEALACAA