Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_1993 |
Symbol | hsdR |
ID | 6314968 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | - |
Start bp | 2100137 |
End bp | 2103385 |
Gene Length | 3249 bp |
Protein Length | 1082 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 642644380 |
Product | type I restriction enzyme EcoKI subunit R |
Protein accession | YP_001918148 |
Protein GI | 188586603 |
COG category | [V] Defense mechanisms |
COG ID | [COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000937848 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.215415 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAAGA GCAACTTTGA TTTCTTAACA GGTGACTGGT CTATTCTTTC AGAATTAGGA GAGATAGCCG AAAAAAACCT ACATACTGAC CCCAATACGA CAATAATTAA ACTTCGCATG TTTGCAGAAA CATTAGTGAA GTTTATTTTT GCTGTAGAAA ATCTTGAAGA ACCTGAAGAT CAGAAACAAG TATCTCGATT AGCCATATTG AAAAAAGAAG ATTTACTTAC AGATGACTTG CTAGATTTAT TTCATACAAT CAGAAAAATA GGAAACAAGG CCGCTCATGC TGGGTATGGG AATATTGATG ATGCAAAAAC TCTTCTGAGG ATGGCTTTTC GTATATCAGT TTGGTTTATG CAAGTTTATG GCAGATGGGA TTTTGAACCA CCTGCATATG TTGAACCTGA AAAAATAGAT TTTGAATCGC TTGAAAAAGA AAGAGAAAGA TTAGAGAAAA TTGCATCTTC TTATGAGAAG AAGGTTAAAA AACTAGAAAA TGAATTAAAT AGTTTAAGAA GTAAAGCTAG TCGGGATGAT GAGAAAAAAG AGCGAAAAAC AAAAGCCAGG CAACTAGGCA CAGGCTTGGA ATTAACTGAA GCTGAAACAC GTACTATTAT AGATGATAAA TTAAGAGCTG TTGGTTGGGA AGCTGATACA GAAACTATCC GATATAGTAA GGGTATACGT CCAGAAAAGG GACGAAACCT TGCTATTGCT GAATGGCCTG TCAAAAATGG CTCTGTTGAC TATGCCTTGT TTGTAGGAAT GCAATTTATA GGGATTATAG AAGCTAAGCG AAAAAGTAAG AACATACAAT CTGATATAGA GCAGGCTAAG CAATACTCCA AACGAATCTA TCAGGTGGAA AATGAAGAAA TATCAGGTCC ATGGAACGGA TACAAAGTAC CTTTTCTATT TGCTACTAAT GGTCGTCCTT ATTTAAAGCA ACTAGAACAC AAATCTGGTA TTTGGTTTTT AGATGCAAGA AAGAAAACTA ATCATCCTCG GCCTCTGAAG GATTGGTACT CTCCAGAAGG ACTAACAGAT TTATCAAAGA AAAATATAGA AGAAGCAACT AAAGAACTGA AAGAAGAACC GTTAGATTAT CTTGGATTAC GCTACTATCA AGAAGAAGCT ATAAAAGCTA TTGAAAAGGG ATTAGAAGAA GGAAGACAAA GTCTTCTTAT AGCTATGGCA ACAGGAACAG GTAAAACGAG AATGGCAATT GGGTTAATAT ACCGTTTGGT TAAGTCCAGA AGATTTAAGA GAGTTTTATT TCTAGTTGAC AGAAACGCTC TTGGAAAGCA AGCTGAGAAT GCCTTTAAAG ATTCAAAGCT AGAAAGTTTT AACTCGTTTA CTGAAATATT TGAGCTACAG TCTCTTAAAG CACCTAACCC AAATCCAGAA ACTAAAGTCC ATATTTCTAC TGTGCAAGGA ATGATGCGAA GAATATATTA TAATGATCCT GAAAAGGACA AACCTACAGT TGATCAATAT GATTGTATAG TGGTAGATGA AGCACATCGA GGCTATACTT TAGACAGCGA AATGGAAGAG CTAGAACTAT ACTTTAGAGA TCATAATGAC TATGTTAGTA AATATCGTCA GGTACTTGAT TACTTTGATG CTGTTAGAAT TGGGCTTACT GCAACACCAG CTCTTCATAC TGTGGAAATT TTTGGGAGCC CAATATATAC CTATTCCTAT CGTGAAGCAG TGGTAGATGG TTATTTAATT GACCATGAGC CCCCATATCA AATTAATACT AAATTAAAGA AAGAAGGGAT TAATTGGCAA GTTGGAGAAG AAGTAGATGT TTATGATGCC AAAAGTGGAG ATATAAAGAA AGAAGTTATG GAAGATGAGG TAGAAATAGA TGTCTCCCAA TTTAACAAAC AAGTAATAAC TGAAAGTTTT AACAGAACTG TCATTAAAGA ACTCGTGAAC TATATTCATC CGGAGCTAGA GGGAAAAACT TTAATATTTG CTGCTAATGA TGATCATGCT GACACAATTG TAAGAATATT AAAGGAAGAA TTAGAAAATA AATATGGTCC AATTGAAGAC AATGCAGTTA TGAAAATAAC TGGGTCCCTA AAAAATCCAC TCCAAGCAAT AAAATTATTT CAAAACGAAC GTTTACCTAA TATTGTTGTG ACTGTTGATT TACTTACTAC AGGCGTTGAT GTGCCACCTA TTTGTAGTCT TGTGTTTATG CGAAGGGTAC GGTCACGAAT CTTATATGAA CAAATGCTAG GTAGGGCAAC TAGAAGATGT GATGAGATTG ATAAGGATCA CTTTAATATA TTTGATGCAG TAGAAATTTA CGAAGCTTTA AAGCCTTATA CTGATATGAA ACCCGTTGTA AAAAAACCAA ATGTACCAGT GAAACAGTTG GTAAATGAAC TAGAACAGAT AAACTCAACT GATAAGCAAA AAAATCATAT AGATCAAATT AAAGCAAAAA TTCAACGTAA AAGTAAACAT TGGACCGAAG AAGAAAAAGA AAATTTTAAA GCATTGACTG GCGGAAAAAC TGTTGATGAA TATATTGATT GGATGAAAAA CTCAGAAACA GAAGAAATTG TAAATGAGTT GCAAAAAAGT GAAAGTATAG TTAATTACAT TGATGAAAAC CGTGAAAGAC CTCAATATCA GTATATCTCA AACCATAAGG ATGAACACTT ATCTACAACT AGGGGTTATG GGAACGCAGA AAAGCCTGAA GATTATTTAG AAGGGTTTAA ACAGTTTATT GAAGAAAATA TAAATCACAT CCCTGCATTA AAAGTAGTTT GTCAGAGACC TAAGGAGCTT ACTAGAGAGG ATTTAAGAAA ACTTAAAATA GAACTTGATC AACAAGGTTA TAACGAAAAG AACTTGCAAG CTGCTTGGAG AGATGCTAAA AATGAAGATA TAGCAGCTGA TATTATAAGC TTTATTAGAC AGTTAACAGT GGGTGATGCA TTAGTTAGTC ATGAAGAGAG AGTCAAAAAT GCAATGAAAA GAATCTATAG AATGAAAGCA TGGCCACCTG TACAAAAAAA ATGGCTTGAA AGAATAGAAA AGCAACTTTT AGAAGAAAAG GTACTAGGCC CGGATCCAGA AGAAGCTTTC GAAGTCCAAC CATTTAAAAG ACATGGAGGT TATAAGCAAT TAAATAAGAT TTTTGATGGA CAAATAGATA TGATTGTTGA AAAAATTAAT GAGGAACTAT TTAACCAGGA AGGAAGGGAT CACGTCTAA
|
Protein sequence | MQKSNFDFLT GDWSILSELG EIAEKNLHTD PNTTIIKLRM FAETLVKFIF AVENLEEPED QKQVSRLAIL KKEDLLTDDL LDLFHTIRKI GNKAAHAGYG NIDDAKTLLR MAFRISVWFM QVYGRWDFEP PAYVEPEKID FESLEKERER LEKIASSYEK KVKKLENELN SLRSKASRDD EKKERKTKAR QLGTGLELTE AETRTIIDDK LRAVGWEADT ETIRYSKGIR PEKGRNLAIA EWPVKNGSVD YALFVGMQFI GIIEAKRKSK NIQSDIEQAK QYSKRIYQVE NEEISGPWNG YKVPFLFATN GRPYLKQLEH KSGIWFLDAR KKTNHPRPLK DWYSPEGLTD LSKKNIEEAT KELKEEPLDY LGLRYYQEEA IKAIEKGLEE GRQSLLIAMA TGTGKTRMAI GLIYRLVKSR RFKRVLFLVD RNALGKQAEN AFKDSKLESF NSFTEIFELQ SLKAPNPNPE TKVHISTVQG MMRRIYYNDP EKDKPTVDQY DCIVVDEAHR GYTLDSEMEE LELYFRDHND YVSKYRQVLD YFDAVRIGLT ATPALHTVEI FGSPIYTYSY REAVVDGYLI DHEPPYQINT KLKKEGINWQ VGEEVDVYDA KSGDIKKEVM EDEVEIDVSQ FNKQVITESF NRTVIKELVN YIHPELEGKT LIFAANDDHA DTIVRILKEE LENKYGPIED NAVMKITGSL KNPLQAIKLF QNERLPNIVV TVDLLTTGVD VPPICSLVFM RRVRSRILYE QMLGRATRRC DEIDKDHFNI FDAVEIYEAL KPYTDMKPVV KKPNVPVKQL VNELEQINST DKQKNHIDQI KAKIQRKSKH WTEEEKENFK ALTGGKTVDE YIDWMKNSET EEIVNELQKS ESIVNYIDEN RERPQYQYIS NHKDEHLSTT RGYGNAEKPE DYLEGFKQFI EENINHIPAL KVVCQRPKEL TREDLRKLKI ELDQQGYNEK NLQAAWRDAK NEDIAADIIS FIRQLTVGDA LVSHEERVKN AMKRIYRMKA WPPVQKKWLE RIEKQLLEEK VLGPDPEEAF EVQPFKRHGG YKQLNKIFDG QIDMIVEKIN EELFNQEGRD HV
|
| |