Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3150 |
Symbol | |
ID | 7874292 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3406955 |
End bp | 3409957 |
Gene Length | 3003 bp |
Protein Length | 1000 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643700080 |
Product | type I site-specific deoxyribonuclease, HsdR family |
Protein accession | YP_002890124 |
Protein GI | 237653810 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCACTG AATCCAACAC CGTCGAGGCC TACCTGTCCG ACCTCCTGGC GGACTTAGGC AACCCCCTCG CTAATAAGGC CGAAGAGCCC TCTCCCAACT ACCTCCGCAA GCCAAGGCGC ACTGGTTGGC ACTTCGCAGC CCCCGCCGAC ATCCCCCGCC AGCCGCATGA GCTCCTCGTC GAACCCTGGC TGCGCGACGC CCTCATCCGC CTCAACCCCG AAATCGCGGC CCGGCCCGAT CTGGCCGACG AAGTCCTCTA CAAGCTGCGC GCCATCGTGC TGTCGGTGCG TTCCGACGGC CTGATCCGCG CCAACGAGGA AATGGCCGCC TGGATGCGCG GCGAACGCTC GATGCCGTTC GGGCCGAACA ACGAGCATGT GCCGGTGCGG TTGATCGACT TCGACGACCT GGGGCGCAAC GAATTCGTCC TCACCCGTCA ATTCACCTTC CGCGCTGGCC CCGCAGAGCG CCGCGCCGAC CTGGTGCTGC TGGTCAATGG CTTGCCGCTG GTGTTGATCG AGGCCAAGAC GCCGGTGAAG AAGTGCATCA GCTGGGTCGA CGGCGCGCTG CAGGTGCATG AGGACTACGA GAAGTTCGTC CCCGAGCTGT TCGTGTGCAA CGTGTTCTCG GTGGCGACCG AGGGCAAGGA GTTCCGCTAC GGCTCGATCG GCCTGCCGGT GAAGGACTGG GGGCCGTGGA ACCTGGATGG CGAGGCGGAC GATGCGCGCG GCCAGACGCA TCCGCTCAAG TCGCTGCGGC AGTCGGTGGA GAGCATGCTG CGTCCGCAGG TGGTGCTCGA CATCCTCGCC AGCTTCACGC TGTTCGCCAC CGACAAGAAG AAGCGCCGCA TCAAGATCAT CTGCCGCTAC CAGCAATACG AGGCGGCCAA CAAGATCGTC GAGCGCGTGC TGGCGGGCCA GCCGAAGAAG GGCCTGATCT GGCACTTCCA GGGTTCGGGC AAGTCGCTGC TGATGGTGTT CGCGGCGCAG AAGCTGCGCA TGCACCCGCG GCTGAAGAAC CCCACGGTGC TGATCGTGGT GGACCGCATC GACCTTGATA CCCAGATCAC CGGCACCTTC ACCGGCGCCG ACATCCCCAA CCTGGAAAAG GCCGACTCGC GCGAGAAGCT GCAGCAACTG CTGGCGCAGG ACGTGCGCAA GATCATCATC ACGACGATCT TCAAGTTCGG CGAAACGCCA AACGGGAAAG CCGGGGCGCT GAACGAGCGC AGCAACATCA TCGCCCTGGT CGACGAGGCG CACCGCACCC AGGAAGGCGA CCTCGGCCGC AAGATGCGCG AGGCGCTGCC CAATGCCTTC CTGTTCGGCC TCACCGGCAC CCCGATCAAC CGCGCCGACC GCAACACCTT CTACGCCTTC GGCGCCGACG AGGACGAGAA GGGCTACCTG AGCCGCTACG GCTTCGAGGA GTCGATCCGC GACGGCGCCA CGCTGCGGTT GCACTTCGAG CCGCGGCTGG TGGACCTGCA CATCGACAAG GCCGCGATCG ACGAGGCCTA CAAGGACCTG ACCGGCGGCC TGTCCGACCT CGACCGCGAC AACCTCGCCA AGACCGCCGC CAAGATGGCG GTGCTGGTGA AGACGCCCGA GCGCATCCGC CGCATCTGCG AGGACATCGT CCAGCACTAC CAGAGCAAGG TCGAGCCCAA CGGCTTCAAG GGCCAGGTCG TCACCTTCGA TCGCGAGTCC TGCCTGCTGT TCAAGGCCGA GCTCGACAAG CTGCTGCCCA CCGAGGCGAC CGACATCGTC ATGTCGGTGC AGCCGGGCGA CCGCAAGGAG CGTCCCGAAT ACGCCCGCTA CGACCGTAGC CGCGACGAGG AAGAGCGCCT GCTCGACCGC TTTCGCGACC CGGCCGACCC GCTCAAGCTG ATCATCGTCA CCGCCAAGCT GCTCACCGGC TTCGACGCCC CCATCCTGCA GGCGATTTAC CTCGACAAGC CGCTGCGCGA CCACACGCTG CTGCAGGCGA TCTGCCGGGT GAACCGCACC TATTCCGAGC AGAAGACGCA CGGCCTCGTC GTCGATTACC TGGGCATCTT CGACGACGTG GCTGCCGCCC TGGAGTTCGA CGACCAGAGC GTGAAGCAGG TCATCAGCAA CATCCAGGAG CTGAAGGACA AGCTGCCCGA GGCAATGCAG AAATGCCTCG CCTTTTTCCC CGGCGTCGAC CGCAGCCAGC AAGGCTACGA AGGCTTGATC GCCGCCCAGC AGTGCCTGCC GAACAACGAC ACCCGCGACG CCTTCGCCGC GGAATACAGC GTGCTCGCGC GCATCTGGGA AGCACTGTCG CCCGACCCGC TGCTTGGACA GTACGAGACC GACTACAAGT GGCTGTCGCA GATCTACCAG TCGGTGCAGC CGTCGAGCGG CCACGGCAAG CTGATCTGGC ATTCGCTCGG CGCCAAGACC ATCGAGCTGA TCCACCAGAA CGTGCATGTC GACGCCATCC GCGACGACCT CGACACCTTG GTGCTCGACG CCGACCTGCT CGAGGCCGTG CTGTCGAACC CTGACCCGAA GAAGGCGAAG GAACTCGAGA TCAAGCTCAA CCGCCGGCTG CGCAAGCACC AGGGCAATCC GAAGTTCAAG GATCTATCCG AGCGGCTGGA TGCGCTCAAG GAGCGCTTCG AGTCCGGCCA GATCAACAGC GTCGACTTCC TCAAGCAGCT GCTGCAGATC GCCAAGGAAA CCCTGCAGGC TGAAAGGGAG ACCCCTCCCG AGGAAGACGA GGACCGCGGC AAGGCCGCGC TGACCGCGCT ATTCAACGAG GTGAAGACCC CCGAGACGCC GATCATCGTC GAGCGCGTGG TGACGGACAT CGACGAGATC GTGCGCCTGG TGCGCTTCCC GGGATGGCAG GGGACGCAGG CGGGGGAACG GGAGGTGAAG AAGGCGCTGC GGAAGGCCTT GTTCAAGTAC AAGCTGCACG CGGACGAAGA GCTGTTCGAG AAGGCGTTCA GTTATATCCG GCAGTATTAC TGA
|
Protein sequence | MFTESNTVEA YLSDLLADLG NPLANKAEEP SPNYLRKPRR TGWHFAAPAD IPRQPHELLV EPWLRDALIR LNPEIAARPD LADEVLYKLR AIVLSVRSDG LIRANEEMAA WMRGERSMPF GPNNEHVPVR LIDFDDLGRN EFVLTRQFTF RAGPAERRAD LVLLVNGLPL VLIEAKTPVK KCISWVDGAL QVHEDYEKFV PELFVCNVFS VATEGKEFRY GSIGLPVKDW GPWNLDGEAD DARGQTHPLK SLRQSVESML RPQVVLDILA SFTLFATDKK KRRIKIICRY QQYEAANKIV ERVLAGQPKK GLIWHFQGSG KSLLMVFAAQ KLRMHPRLKN PTVLIVVDRI DLDTQITGTF TGADIPNLEK ADSREKLQQL LAQDVRKIII TTIFKFGETP NGKAGALNER SNIIALVDEA HRTQEGDLGR KMREALPNAF LFGLTGTPIN RADRNTFYAF GADEDEKGYL SRYGFEESIR DGATLRLHFE PRLVDLHIDK AAIDEAYKDL TGGLSDLDRD NLAKTAAKMA VLVKTPERIR RICEDIVQHY QSKVEPNGFK GQVVTFDRES CLLFKAELDK LLPTEATDIV MSVQPGDRKE RPEYARYDRS RDEEERLLDR FRDPADPLKL IIVTAKLLTG FDAPILQAIY LDKPLRDHTL LQAICRVNRT YSEQKTHGLV VDYLGIFDDV AAALEFDDQS VKQVISNIQE LKDKLPEAMQ KCLAFFPGVD RSQQGYEGLI AAQQCLPNND TRDAFAAEYS VLARIWEALS PDPLLGQYET DYKWLSQIYQ SVQPSSGHGK LIWHSLGAKT IELIHQNVHV DAIRDDLDTL VLDADLLEAV LSNPDPKKAK ELEIKLNRRL RKHQGNPKFK DLSERLDALK ERFESGQINS VDFLKQLLQI AKETLQAERE TPPEEDEDRG KAALTALFNE VKTPETPIIV ERVVTDIDEI VRLVRFPGWQ GTQAGEREVK KALRKALFKY KLHADEELFE KAFSYIRQYY
|
| |