Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0005 |
Symbol | |
ID | 7085103 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 7271 |
End bp | 8407 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643697055 |
Product | restriction modification system DNA specificity domain protein |
Protein accession | YP_002353704 |
Protein GI | 217968470 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGAGT TTTTCGATGT AACCCTTGGC GAGGTCGTCG ATTTTTTCAA TGGCAAGGCC ATCAAGCCGG GTCAGGACGG AGAGTATCCA GCGTATGGCT CGAATGGGCT GATCGGAGGC GCACCGGACT GGAAGTATGA AAACTCCATC ATCATCGGGC GCGTGGGGGC GTACTGCGGT TCGGTTGCAT ACTGCAAGAG TCGGTTCTGG GCTTCTGATA ACACGATCGT GGCAAGGCCC AAGAGCGGGG ATGTCGGGTA TTTCTACTAT CTCCTGAAAG CACTGGAACT CAACCGTTAT GCCGGAGGTG CGGCGCAGCC ACTTGTCACA CAAACGGTTC TAAAAGGTGT TCCTGCAAGA GTTCCTGACA TCCCAACCCA GCGCCGCATT GCCTCCATCC TGTCCGCCTA CGACGACCTG ATCGAAAACA ACACGCGACG GATCGCCATC CTTGAGGAAA TGGCCCGGAG AATCTACGAG GAGTGGTTCG TCCGCTTCCG TTTTCCGGGG CATGAACAGG TGAAGATGGT GGAGTCTGAG CTGGGGTTGA TCCCGGAGGG GTGGAAGGCG ACGAATATCG GAGAGGTTGC CGAGAATCAC GATAGAAAGC GCAAACCTTT ATCGAAGATG CAGCGGGAGA AGTTCAAGGG GCCATATCCG TACTATGGCG CTGCAAAAAT CTTTGACTAC GTTGAGGATT ACATTTTTGA TGGGCGATTC GTCCTCATGG CAGAAGACGG TAGCGTCATC ACCCCCGATG GATTTCCCGT TCTTCAGTTG GCCAATGGGA GATTCTGGGC GAATAACCAT ACGCACATTT TGCGCGGAAC GCCGGATGCA TCGACTGAGT TTATTTACCT CAGACTGTCT TCGCAAAAGG TAAGTGGCTA CATAACCGGA GCTGCACAGC CGAAGATCAC ACAGGCAAAC ATGAATCGAA TACCGGTTTG TCTGCCGCCG CGAGACTTGA TGGCGCGATT TACGGAATTG GTGGGGCCGA AGTTCGATCT CATCGACTGC TTGGAAAGGA AACACACCAA TCTCAGAGCT ACCCGAGACC TCCTGCTCCC CAAGCTGATC TCCGGCGAAC TCGACGTTTC CACCCTGCCC GAACCTGAGG AGGCCATCGC GGCATGA
|
Protein sequence | MNEFFDVTLG EVVDFFNGKA IKPGQDGEYP AYGSNGLIGG APDWKYENSI IIGRVGAYCG SVAYCKSRFW ASDNTIVARP KSGDVGYFYY LLKALELNRY AGGAAQPLVT QTVLKGVPAR VPDIPTQRRI ASILSAYDDL IENNTRRIAI LEEMARRIYE EWFVRFRFPG HEQVKMVESE LGLIPEGWKA TNIGEVAENH DRKRKPLSKM QREKFKGPYP YYGAAKIFDY VEDYIFDGRF VLMAEDGSVI TPDGFPVLQL ANGRFWANNH THILRGTPDA STEFIYLRLS SQKVSGYITG AAQPKITQAN MNRIPVCLPP RDLMARFTEL VGPKFDLIDC LERKHTNLRA TRDLLLPKLI SGELDVSTLP EPEEAIAA
|
| |