Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2740 |
Symbol | |
ID | 7873480 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2962775 |
End bp | 2965831 |
Gene Length | 3057 bp |
Protein Length | 1018 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643699662 |
Product | hypothetical protein |
Protein accession | YP_002889717 |
Protein GI | 237653403 |
COG category | [V] Defense mechanisms |
COG ID | [COG1002] Type II restriction enzyme, methylase subunits |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATGACC ACCCCGTGAC CTCGTCATCC ACCTTGCCCC AGCCCGCCGA CAGCGCCAGC CCCGCCGCCC TCGACGCCTT CATCGCGCGC TGGCAGCGCG CCGGTGGCAG CGAGCGCGCC AACTACCAGC TCTTCCTCGC CGAGCTGTGC GAACTGCTCG CGCTGCCGCG CCCCGACCCT GCCGGCGAGG ACACCCGCGA CAACGCCTAC GTGTTCGAGC GCCGGGTGCT GATGCGCCAG CCCGACGGCA GCGCCAGCAA CGGCTTCATC GACCTCTACC GCCGCGGCGC CTTCGTGCTC GAGGCCAAGC AGTCGGGCAG GACGCTCGAC AGCTCGGGCT GGGACAAGGC CATGCTGCGC GCCCACAACC AGGCCGACCA GTACGCCCGC GCGCTCCCCG CCGACGAGGG CCGGCCGCCC TTCATCCTGG TGGTCGACGT CGGCCGCAAC ATCGAGCTCT ACGCCGAGTT CAGCCGCTCG GGCGCCACCT ACACGCCCTA CCCCGACCCC CGCAGCCACC GCATCCGCCT CGACGACCTG CATCGCGAAG ACATCCGCCA GCGCCTGCGC GAGGTCTGGC TCGACCCGCT CGCGCTCGAC CCCGCCCGCC GCTCGGCGCG CGTCACGCGC GAGATCGCCG ACCGCCTGGC CTCGCTCGCG CGCTCGCTCG AAGCCGCCGG TCACGACCCG CAGCAGGTCG CCGGCTTCCT GATGCGCGCA CTGTTCACCA TGTTCGCCGA GGACGTCGGC CTGCTGCCGC CGCGCGCCCT CACCGAACTG CTCGAAAGCC TCAAGGGCCA GCCGCACACC TTCGCGCCGA TGCTCGAGCA CCTGTGGCAG AACATGAACA CCGGCGGCTT CTCGCCGATC CTGCGCAACA AGGTGCTGCG CTTCAACGGC GGCCTGTTTG CCGAGGCCAG CGCCATCCCG CTCGACCGCG ACCAGCTCGA GCTGCTGCTG AAGGCATCGA AGGCCGACTG GCGCTACGTC GAACCCGCCA TCTTCGGCAC CCTGCTCGAG CGCGCGCTCG ACCCGCGCGA ACGCCACAAG CTCGGCGCCC ACTACACCCC GCGCGCCTAC GTCGAACGCC TGGTGCTGCC CACCGTCATC GAACCGCTGC GCGCGGAATG GCGCGAGGTG CAGGCGGCCG CGCTCACCTA CGAGCAGCAG GGCAAGCACA GGGAGGCGGT CGCCGAGATC CGCACTTTCC ACCGCCACCT GTGCACCGTG CGCGTGCTCG ACCCCGCCTG CGGCAGCGGA AATTTCCTGT ATGTGACGCT GGAACACCTC AAGCGCCTCG AAGGCGAGGT GCTCAACCTG CTGCACGACC TCGGCGAATC CCAGGGCCTG CTCGAACTCG AAGGCGTCAC CGTCGATCCG CAGCAGTTCC TCGGCCTCGA GATCAACCCG CGCGCCGCCC GCATCGCCGA GATGGTGCTG TGGATCGGCT ACCTGCAATG GCACTTCCGC ACCCACGGCT CGGTGAACCC GCCCGAGCCG GTGCTGCGCG ACTTCCGCAA CATCGAGCAC CGCGACGCGC TGATCGAGTA CGAGCGCGAG GAACCGGTCA CCGACGAGGC GGGACGCCCG GTCACGCGCT GGGACGGCGT GAGCTACCGG AAGAGCCCGA TCACCGGCGA GGACATCCCG GACGAGACCG CGCAGGTGGT GCAGATACGC TACGTGAACC CGCGCAAGGC GGCGTGGCCG CAGGCGGATT ACATCGTGGG GAATCCGCCG TTCATCGGCG CCGCCACCAT GCGCCGCGCG CTCGGCGACG GCTATGTGGA CGCGGTACGC CGCACCTGGC CCGAGGTGCC GGAATCGGCC GATTTCGTCA TGTACTGGTG GCACATCGCC AGCGAGACCG TGCGCGCGGA CAAGGCGCGC CGCTTCGGCT TCATCACCAC CAACAGCATC AAGCAGACCT TCAACCGCCG CGTCGTGCAG GCGCAGCTCG AGGCGAAGAA CCCGCTGTCG CTGGCGTTCG CGATTCCGGA TCACCCGTGG GTGGATGCGG CGGACGGGGC GGCGGTGAGG ATTGCGATGA CGGTGGGGGC GGGGGGTGAG CAGGACGGGC AATTGTCGGA AGTAAAGGAC GAACGTGAGA CTGACCAAGA CGAAATCGAC GTAACGCTAC AAACACGTAG TGGACGCTTG CATGCGGATC TCCGCAGCGG CGCAAACGTG ACCGGTGCAA TCTCGCTTCG GTCAAATGTT GGCATCAGTT CGCCGGGGGT AAAGCTCCAC GGTGCCGGCT TCATCGTCAC GCCCGACGAG GCAAGGTCAC TTGGCCTCGG CACAATTGGT GGCATTGAGC ACCACATTCG GGCCTATCGG AATGGACGCG ACCTCACCGA TAGACCCCGC GGAGTAATGG TCATCGATCT CTTCGGCCTC ACCGTCGACG AAGTGCGAAC CCGATACCCA GCGATTTATC AGTGGGTACT AGAGCGGGTG AAGCCAGAGC GCGATCAGAA CAACCGTGCA ATCTACCGAG AAAATTGGTG GATCTTTGGT GAGGCTCGAA GGGACTGGCG CGCGATGTCT GCGGGCTTGA AGGCACATGT CGCGACAGTG GAAACAATGA AGCATCGGGT CTTCCAGTTG CTCGACGCCA ACATCCTTCC GGACAACAAG GTGGTGAATG TTGCGACCGA TGACGCCCTG CTACTCGGCA TCCTGGGTAG CAGGCTCCAT GTTGCGTGGG CCCTTGCGGC TGGTAGTCGC CTTGGCGTTG GCAACGACTC CGTCTATGTA AAAACCACCT GCTTCGAAAC CTTCCCCTTC CCCGACCCCT CGCCCGCGCA AGCCGCCCGC ATCCGCGACC TCGCCGAGCA GCTCGACGCT CACCGCAAGC GCCAGCAGGC GCTGCACCCC GAGCTGACCC TCACCGGCAT GTACAACGTG CTCGAAAAGC TGCGCGCCGG CGACACGCTC ACGCCCAAGG AACGCACGAT CCACGAGCAG GGCCTGGTGT CGGTGCTGCG CGAGCTGCAC GACGCGCTCG ACAGCGCAGT GTTCGAGGCC TACGGCTGGT GCGACCTCGC CGCGTAA
|
Protein sequence | MHDHPVTSSS TLPQPADSAS PAALDAFIAR WQRAGGSERA NYQLFLAELC ELLALPRPDP AGEDTRDNAY VFERRVLMRQ PDGSASNGFI DLYRRGAFVL EAKQSGRTLD SSGWDKAMLR AHNQADQYAR ALPADEGRPP FILVVDVGRN IELYAEFSRS GATYTPYPDP RSHRIRLDDL HREDIRQRLR EVWLDPLALD PARRSARVTR EIADRLASLA RSLEAAGHDP QQVAGFLMRA LFTMFAEDVG LLPPRALTEL LESLKGQPHT FAPMLEHLWQ NMNTGGFSPI LRNKVLRFNG GLFAEASAIP LDRDQLELLL KASKADWRYV EPAIFGTLLE RALDPRERHK LGAHYTPRAY VERLVLPTVI EPLRAEWREV QAAALTYEQQ GKHREAVAEI RTFHRHLCTV RVLDPACGSG NFLYVTLEHL KRLEGEVLNL LHDLGESQGL LELEGVTVDP QQFLGLEINP RAARIAEMVL WIGYLQWHFR THGSVNPPEP VLRDFRNIEH RDALIEYERE EPVTDEAGRP VTRWDGVSYR KSPITGEDIP DETAQVVQIR YVNPRKAAWP QADYIVGNPP FIGAATMRRA LGDGYVDAVR RTWPEVPESA DFVMYWWHIA SETVRADKAR RFGFITTNSI KQTFNRRVVQ AQLEAKNPLS LAFAIPDHPW VDAADGAAVR IAMTVGAGGE QDGQLSEVKD ERETDQDEID VTLQTRSGRL HADLRSGANV TGAISLRSNV GISSPGVKLH GAGFIVTPDE ARSLGLGTIG GIEHHIRAYR NGRDLTDRPR GVMVIDLFGL TVDEVRTRYP AIYQWVLERV KPERDQNNRA IYRENWWIFG EARRDWRAMS AGLKAHVATV ETMKHRVFQL LDANILPDNK VVNVATDDAL LLGILGSRLH VAWALAAGSR LGVGNDSVYV KTTCFETFPF PDPSPAQAAR IRDLAEQLDA HRKRQQALHP ELTLTGMYNV LEKLRAGDTL TPKERTIHEQ GLVSVLRELH DALDSAVFEA YGWCDLAA
|
| |