Gene Tmz1t_2740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2740 
Symbol 
ID7873480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2962775 
End bp2965831 
Gene Length3057 bp 
Protein Length1018 aa 
Translation table11 
GC content67% 
IMG OID643699662 
Producthypothetical protein 
Protein accessionYP_002889717 
Protein GI237653403 
COG category[V] Defense mechanisms 
COG ID[COG1002] Type II restriction enzyme, methylase subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATGACC ACCCCGTGAC CTCGTCATCC ACCTTGCCCC AGCCCGCCGA CAGCGCCAGC 
CCCGCCGCCC TCGACGCCTT CATCGCGCGC TGGCAGCGCG CCGGTGGCAG CGAGCGCGCC
AACTACCAGC TCTTCCTCGC CGAGCTGTGC GAACTGCTCG CGCTGCCGCG CCCCGACCCT
GCCGGCGAGG ACACCCGCGA CAACGCCTAC GTGTTCGAGC GCCGGGTGCT GATGCGCCAG
CCCGACGGCA GCGCCAGCAA CGGCTTCATC GACCTCTACC GCCGCGGCGC CTTCGTGCTC
GAGGCCAAGC AGTCGGGCAG GACGCTCGAC AGCTCGGGCT GGGACAAGGC CATGCTGCGC
GCCCACAACC AGGCCGACCA GTACGCCCGC GCGCTCCCCG CCGACGAGGG CCGGCCGCCC
TTCATCCTGG TGGTCGACGT CGGCCGCAAC ATCGAGCTCT ACGCCGAGTT CAGCCGCTCG
GGCGCCACCT ACACGCCCTA CCCCGACCCC CGCAGCCACC GCATCCGCCT CGACGACCTG
CATCGCGAAG ACATCCGCCA GCGCCTGCGC GAGGTCTGGC TCGACCCGCT CGCGCTCGAC
CCCGCCCGCC GCTCGGCGCG CGTCACGCGC GAGATCGCCG ACCGCCTGGC CTCGCTCGCG
CGCTCGCTCG AAGCCGCCGG TCACGACCCG CAGCAGGTCG CCGGCTTCCT GATGCGCGCA
CTGTTCACCA TGTTCGCCGA GGACGTCGGC CTGCTGCCGC CGCGCGCCCT CACCGAACTG
CTCGAAAGCC TCAAGGGCCA GCCGCACACC TTCGCGCCGA TGCTCGAGCA CCTGTGGCAG
AACATGAACA CCGGCGGCTT CTCGCCGATC CTGCGCAACA AGGTGCTGCG CTTCAACGGC
GGCCTGTTTG CCGAGGCCAG CGCCATCCCG CTCGACCGCG ACCAGCTCGA GCTGCTGCTG
AAGGCATCGA AGGCCGACTG GCGCTACGTC GAACCCGCCA TCTTCGGCAC CCTGCTCGAG
CGCGCGCTCG ACCCGCGCGA ACGCCACAAG CTCGGCGCCC ACTACACCCC GCGCGCCTAC
GTCGAACGCC TGGTGCTGCC CACCGTCATC GAACCGCTGC GCGCGGAATG GCGCGAGGTG
CAGGCGGCCG CGCTCACCTA CGAGCAGCAG GGCAAGCACA GGGAGGCGGT CGCCGAGATC
CGCACTTTCC ACCGCCACCT GTGCACCGTG CGCGTGCTCG ACCCCGCCTG CGGCAGCGGA
AATTTCCTGT ATGTGACGCT GGAACACCTC AAGCGCCTCG AAGGCGAGGT GCTCAACCTG
CTGCACGACC TCGGCGAATC CCAGGGCCTG CTCGAACTCG AAGGCGTCAC CGTCGATCCG
CAGCAGTTCC TCGGCCTCGA GATCAACCCG CGCGCCGCCC GCATCGCCGA GATGGTGCTG
TGGATCGGCT ACCTGCAATG GCACTTCCGC ACCCACGGCT CGGTGAACCC GCCCGAGCCG
GTGCTGCGCG ACTTCCGCAA CATCGAGCAC CGCGACGCGC TGATCGAGTA CGAGCGCGAG
GAACCGGTCA CCGACGAGGC GGGACGCCCG GTCACGCGCT GGGACGGCGT GAGCTACCGG
AAGAGCCCGA TCACCGGCGA GGACATCCCG GACGAGACCG CGCAGGTGGT GCAGATACGC
TACGTGAACC CGCGCAAGGC GGCGTGGCCG CAGGCGGATT ACATCGTGGG GAATCCGCCG
TTCATCGGCG CCGCCACCAT GCGCCGCGCG CTCGGCGACG GCTATGTGGA CGCGGTACGC
CGCACCTGGC CCGAGGTGCC GGAATCGGCC GATTTCGTCA TGTACTGGTG GCACATCGCC
AGCGAGACCG TGCGCGCGGA CAAGGCGCGC CGCTTCGGCT TCATCACCAC CAACAGCATC
AAGCAGACCT TCAACCGCCG CGTCGTGCAG GCGCAGCTCG AGGCGAAGAA CCCGCTGTCG
CTGGCGTTCG CGATTCCGGA TCACCCGTGG GTGGATGCGG CGGACGGGGC GGCGGTGAGG
ATTGCGATGA CGGTGGGGGC GGGGGGTGAG CAGGACGGGC AATTGTCGGA AGTAAAGGAC
GAACGTGAGA CTGACCAAGA CGAAATCGAC GTAACGCTAC AAACACGTAG TGGACGCTTG
CATGCGGATC TCCGCAGCGG CGCAAACGTG ACCGGTGCAA TCTCGCTTCG GTCAAATGTT
GGCATCAGTT CGCCGGGGGT AAAGCTCCAC GGTGCCGGCT TCATCGTCAC GCCCGACGAG
GCAAGGTCAC TTGGCCTCGG CACAATTGGT GGCATTGAGC ACCACATTCG GGCCTATCGG
AATGGACGCG ACCTCACCGA TAGACCCCGC GGAGTAATGG TCATCGATCT CTTCGGCCTC
ACCGTCGACG AAGTGCGAAC CCGATACCCA GCGATTTATC AGTGGGTACT AGAGCGGGTG
AAGCCAGAGC GCGATCAGAA CAACCGTGCA ATCTACCGAG AAAATTGGTG GATCTTTGGT
GAGGCTCGAA GGGACTGGCG CGCGATGTCT GCGGGCTTGA AGGCACATGT CGCGACAGTG
GAAACAATGA AGCATCGGGT CTTCCAGTTG CTCGACGCCA ACATCCTTCC GGACAACAAG
GTGGTGAATG TTGCGACCGA TGACGCCCTG CTACTCGGCA TCCTGGGTAG CAGGCTCCAT
GTTGCGTGGG CCCTTGCGGC TGGTAGTCGC CTTGGCGTTG GCAACGACTC CGTCTATGTA
AAAACCACCT GCTTCGAAAC CTTCCCCTTC CCCGACCCCT CGCCCGCGCA AGCCGCCCGC
ATCCGCGACC TCGCCGAGCA GCTCGACGCT CACCGCAAGC GCCAGCAGGC GCTGCACCCC
GAGCTGACCC TCACCGGCAT GTACAACGTG CTCGAAAAGC TGCGCGCCGG CGACACGCTC
ACGCCCAAGG AACGCACGAT CCACGAGCAG GGCCTGGTGT CGGTGCTGCG CGAGCTGCAC
GACGCGCTCG ACAGCGCAGT GTTCGAGGCC TACGGCTGGT GCGACCTCGC CGCGTAA
 
Protein sequence
MHDHPVTSSS TLPQPADSAS PAALDAFIAR WQRAGGSERA NYQLFLAELC ELLALPRPDP 
AGEDTRDNAY VFERRVLMRQ PDGSASNGFI DLYRRGAFVL EAKQSGRTLD SSGWDKAMLR
AHNQADQYAR ALPADEGRPP FILVVDVGRN IELYAEFSRS GATYTPYPDP RSHRIRLDDL
HREDIRQRLR EVWLDPLALD PARRSARVTR EIADRLASLA RSLEAAGHDP QQVAGFLMRA
LFTMFAEDVG LLPPRALTEL LESLKGQPHT FAPMLEHLWQ NMNTGGFSPI LRNKVLRFNG
GLFAEASAIP LDRDQLELLL KASKADWRYV EPAIFGTLLE RALDPRERHK LGAHYTPRAY
VERLVLPTVI EPLRAEWREV QAAALTYEQQ GKHREAVAEI RTFHRHLCTV RVLDPACGSG
NFLYVTLEHL KRLEGEVLNL LHDLGESQGL LELEGVTVDP QQFLGLEINP RAARIAEMVL
WIGYLQWHFR THGSVNPPEP VLRDFRNIEH RDALIEYERE EPVTDEAGRP VTRWDGVSYR
KSPITGEDIP DETAQVVQIR YVNPRKAAWP QADYIVGNPP FIGAATMRRA LGDGYVDAVR
RTWPEVPESA DFVMYWWHIA SETVRADKAR RFGFITTNSI KQTFNRRVVQ AQLEAKNPLS
LAFAIPDHPW VDAADGAAVR IAMTVGAGGE QDGQLSEVKD ERETDQDEID VTLQTRSGRL
HADLRSGANV TGAISLRSNV GISSPGVKLH GAGFIVTPDE ARSLGLGTIG GIEHHIRAYR
NGRDLTDRPR GVMVIDLFGL TVDEVRTRYP AIYQWVLERV KPERDQNNRA IYRENWWIFG
EARRDWRAMS AGLKAHVATV ETMKHRVFQL LDANILPDNK VVNVATDDAL LLGILGSRLH
VAWALAAGSR LGVGNDSVYV KTTCFETFPF PDPSPAQAAR IRDLAEQLDA HRKRQQALHP
ELTLTGMYNV LEKLRAGDTL TPKERTIHEQ GLVSVLRELH DALDSAVFEA YGWCDLAA