Gene Tmz1t_3150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3150 
Symbol 
ID7874292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3406955 
End bp3409957 
Gene Length3003 bp 
Protein Length1000 aa 
Translation table11 
GC content65% 
IMG OID643700080 
Producttype I site-specific deoxyribonuclease, HsdR family 
Protein accessionYP_002890124 
Protein GI237653810 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCACTG AATCCAACAC CGTCGAGGCC TACCTGTCCG ACCTCCTGGC GGACTTAGGC 
AACCCCCTCG CTAATAAGGC CGAAGAGCCC TCTCCCAACT ACCTCCGCAA GCCAAGGCGC
ACTGGTTGGC ACTTCGCAGC CCCCGCCGAC ATCCCCCGCC AGCCGCATGA GCTCCTCGTC
GAACCCTGGC TGCGCGACGC CCTCATCCGC CTCAACCCCG AAATCGCGGC CCGGCCCGAT
CTGGCCGACG AAGTCCTCTA CAAGCTGCGC GCCATCGTGC TGTCGGTGCG TTCCGACGGC
CTGATCCGCG CCAACGAGGA AATGGCCGCC TGGATGCGCG GCGAACGCTC GATGCCGTTC
GGGCCGAACA ACGAGCATGT GCCGGTGCGG TTGATCGACT TCGACGACCT GGGGCGCAAC
GAATTCGTCC TCACCCGTCA ATTCACCTTC CGCGCTGGCC CCGCAGAGCG CCGCGCCGAC
CTGGTGCTGC TGGTCAATGG CTTGCCGCTG GTGTTGATCG AGGCCAAGAC GCCGGTGAAG
AAGTGCATCA GCTGGGTCGA CGGCGCGCTG CAGGTGCATG AGGACTACGA GAAGTTCGTC
CCCGAGCTGT TCGTGTGCAA CGTGTTCTCG GTGGCGACCG AGGGCAAGGA GTTCCGCTAC
GGCTCGATCG GCCTGCCGGT GAAGGACTGG GGGCCGTGGA ACCTGGATGG CGAGGCGGAC
GATGCGCGCG GCCAGACGCA TCCGCTCAAG TCGCTGCGGC AGTCGGTGGA GAGCATGCTG
CGTCCGCAGG TGGTGCTCGA CATCCTCGCC AGCTTCACGC TGTTCGCCAC CGACAAGAAG
AAGCGCCGCA TCAAGATCAT CTGCCGCTAC CAGCAATACG AGGCGGCCAA CAAGATCGTC
GAGCGCGTGC TGGCGGGCCA GCCGAAGAAG GGCCTGATCT GGCACTTCCA GGGTTCGGGC
AAGTCGCTGC TGATGGTGTT CGCGGCGCAG AAGCTGCGCA TGCACCCGCG GCTGAAGAAC
CCCACGGTGC TGATCGTGGT GGACCGCATC GACCTTGATA CCCAGATCAC CGGCACCTTC
ACCGGCGCCG ACATCCCCAA CCTGGAAAAG GCCGACTCGC GCGAGAAGCT GCAGCAACTG
CTGGCGCAGG ACGTGCGCAA GATCATCATC ACGACGATCT TCAAGTTCGG CGAAACGCCA
AACGGGAAAG CCGGGGCGCT GAACGAGCGC AGCAACATCA TCGCCCTGGT CGACGAGGCG
CACCGCACCC AGGAAGGCGA CCTCGGCCGC AAGATGCGCG AGGCGCTGCC CAATGCCTTC
CTGTTCGGCC TCACCGGCAC CCCGATCAAC CGCGCCGACC GCAACACCTT CTACGCCTTC
GGCGCCGACG AGGACGAGAA GGGCTACCTG AGCCGCTACG GCTTCGAGGA GTCGATCCGC
GACGGCGCCA CGCTGCGGTT GCACTTCGAG CCGCGGCTGG TGGACCTGCA CATCGACAAG
GCCGCGATCG ACGAGGCCTA CAAGGACCTG ACCGGCGGCC TGTCCGACCT CGACCGCGAC
AACCTCGCCA AGACCGCCGC CAAGATGGCG GTGCTGGTGA AGACGCCCGA GCGCATCCGC
CGCATCTGCG AGGACATCGT CCAGCACTAC CAGAGCAAGG TCGAGCCCAA CGGCTTCAAG
GGCCAGGTCG TCACCTTCGA TCGCGAGTCC TGCCTGCTGT TCAAGGCCGA GCTCGACAAG
CTGCTGCCCA CCGAGGCGAC CGACATCGTC ATGTCGGTGC AGCCGGGCGA CCGCAAGGAG
CGTCCCGAAT ACGCCCGCTA CGACCGTAGC CGCGACGAGG AAGAGCGCCT GCTCGACCGC
TTTCGCGACC CGGCCGACCC GCTCAAGCTG ATCATCGTCA CCGCCAAGCT GCTCACCGGC
TTCGACGCCC CCATCCTGCA GGCGATTTAC CTCGACAAGC CGCTGCGCGA CCACACGCTG
CTGCAGGCGA TCTGCCGGGT GAACCGCACC TATTCCGAGC AGAAGACGCA CGGCCTCGTC
GTCGATTACC TGGGCATCTT CGACGACGTG GCTGCCGCCC TGGAGTTCGA CGACCAGAGC
GTGAAGCAGG TCATCAGCAA CATCCAGGAG CTGAAGGACA AGCTGCCCGA GGCAATGCAG
AAATGCCTCG CCTTTTTCCC CGGCGTCGAC CGCAGCCAGC AAGGCTACGA AGGCTTGATC
GCCGCCCAGC AGTGCCTGCC GAACAACGAC ACCCGCGACG CCTTCGCCGC GGAATACAGC
GTGCTCGCGC GCATCTGGGA AGCACTGTCG CCCGACCCGC TGCTTGGACA GTACGAGACC
GACTACAAGT GGCTGTCGCA GATCTACCAG TCGGTGCAGC CGTCGAGCGG CCACGGCAAG
CTGATCTGGC ATTCGCTCGG CGCCAAGACC ATCGAGCTGA TCCACCAGAA CGTGCATGTC
GACGCCATCC GCGACGACCT CGACACCTTG GTGCTCGACG CCGACCTGCT CGAGGCCGTG
CTGTCGAACC CTGACCCGAA GAAGGCGAAG GAACTCGAGA TCAAGCTCAA CCGCCGGCTG
CGCAAGCACC AGGGCAATCC GAAGTTCAAG GATCTATCCG AGCGGCTGGA TGCGCTCAAG
GAGCGCTTCG AGTCCGGCCA GATCAACAGC GTCGACTTCC TCAAGCAGCT GCTGCAGATC
GCCAAGGAAA CCCTGCAGGC TGAAAGGGAG ACCCCTCCCG AGGAAGACGA GGACCGCGGC
AAGGCCGCGC TGACCGCGCT ATTCAACGAG GTGAAGACCC CCGAGACGCC GATCATCGTC
GAGCGCGTGG TGACGGACAT CGACGAGATC GTGCGCCTGG TGCGCTTCCC GGGATGGCAG
GGGACGCAGG CGGGGGAACG GGAGGTGAAG AAGGCGCTGC GGAAGGCCTT GTTCAAGTAC
AAGCTGCACG CGGACGAAGA GCTGTTCGAG AAGGCGTTCA GTTATATCCG GCAGTATTAC
TGA
 
Protein sequence
MFTESNTVEA YLSDLLADLG NPLANKAEEP SPNYLRKPRR TGWHFAAPAD IPRQPHELLV 
EPWLRDALIR LNPEIAARPD LADEVLYKLR AIVLSVRSDG LIRANEEMAA WMRGERSMPF
GPNNEHVPVR LIDFDDLGRN EFVLTRQFTF RAGPAERRAD LVLLVNGLPL VLIEAKTPVK
KCISWVDGAL QVHEDYEKFV PELFVCNVFS VATEGKEFRY GSIGLPVKDW GPWNLDGEAD
DARGQTHPLK SLRQSVESML RPQVVLDILA SFTLFATDKK KRRIKIICRY QQYEAANKIV
ERVLAGQPKK GLIWHFQGSG KSLLMVFAAQ KLRMHPRLKN PTVLIVVDRI DLDTQITGTF
TGADIPNLEK ADSREKLQQL LAQDVRKIII TTIFKFGETP NGKAGALNER SNIIALVDEA
HRTQEGDLGR KMREALPNAF LFGLTGTPIN RADRNTFYAF GADEDEKGYL SRYGFEESIR
DGATLRLHFE PRLVDLHIDK AAIDEAYKDL TGGLSDLDRD NLAKTAAKMA VLVKTPERIR
RICEDIVQHY QSKVEPNGFK GQVVTFDRES CLLFKAELDK LLPTEATDIV MSVQPGDRKE
RPEYARYDRS RDEEERLLDR FRDPADPLKL IIVTAKLLTG FDAPILQAIY LDKPLRDHTL
LQAICRVNRT YSEQKTHGLV VDYLGIFDDV AAALEFDDQS VKQVISNIQE LKDKLPEAMQ
KCLAFFPGVD RSQQGYEGLI AAQQCLPNND TRDAFAAEYS VLARIWEALS PDPLLGQYET
DYKWLSQIYQ SVQPSSGHGK LIWHSLGAKT IELIHQNVHV DAIRDDLDTL VLDADLLEAV
LSNPDPKKAK ELEIKLNRRL RKHQGNPKFK DLSERLDALK ERFESGQINS VDFLKQLLQI
AKETLQAERE TPPEEDEDRG KAALTALFNE VKTPETPIIV ERVVTDIDEI VRLVRFPGWQ
GTQAGEREVK KALRKALFKY KLHADEELFE KAFSYIRQYY