Gene Nther_1993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_1993 
SymbolhsdR 
ID6314968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp2100137 
End bp2103385 
Gene Length3249 bp 
Protein Length1082 aa 
Translation table11 
GC content34% 
IMG OID642644380 
Producttype I restriction enzyme EcoKI subunit R 
Protein accessionYP_001918148 
Protein GI188586603 
COG category[V] Defense mechanisms 
COG ID[COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000937848 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.215415 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAGA GCAACTTTGA TTTCTTAACA GGTGACTGGT CTATTCTTTC AGAATTAGGA 
GAGATAGCCG AAAAAAACCT ACATACTGAC CCCAATACGA CAATAATTAA ACTTCGCATG
TTTGCAGAAA CATTAGTGAA GTTTATTTTT GCTGTAGAAA ATCTTGAAGA ACCTGAAGAT
CAGAAACAAG TATCTCGATT AGCCATATTG AAAAAAGAAG ATTTACTTAC AGATGACTTG
CTAGATTTAT TTCATACAAT CAGAAAAATA GGAAACAAGG CCGCTCATGC TGGGTATGGG
AATATTGATG ATGCAAAAAC TCTTCTGAGG ATGGCTTTTC GTATATCAGT TTGGTTTATG
CAAGTTTATG GCAGATGGGA TTTTGAACCA CCTGCATATG TTGAACCTGA AAAAATAGAT
TTTGAATCGC TTGAAAAAGA AAGAGAAAGA TTAGAGAAAA TTGCATCTTC TTATGAGAAG
AAGGTTAAAA AACTAGAAAA TGAATTAAAT AGTTTAAGAA GTAAAGCTAG TCGGGATGAT
GAGAAAAAAG AGCGAAAAAC AAAAGCCAGG CAACTAGGCA CAGGCTTGGA ATTAACTGAA
GCTGAAACAC GTACTATTAT AGATGATAAA TTAAGAGCTG TTGGTTGGGA AGCTGATACA
GAAACTATCC GATATAGTAA GGGTATACGT CCAGAAAAGG GACGAAACCT TGCTATTGCT
GAATGGCCTG TCAAAAATGG CTCTGTTGAC TATGCCTTGT TTGTAGGAAT GCAATTTATA
GGGATTATAG AAGCTAAGCG AAAAAGTAAG AACATACAAT CTGATATAGA GCAGGCTAAG
CAATACTCCA AACGAATCTA TCAGGTGGAA AATGAAGAAA TATCAGGTCC ATGGAACGGA
TACAAAGTAC CTTTTCTATT TGCTACTAAT GGTCGTCCTT ATTTAAAGCA ACTAGAACAC
AAATCTGGTA TTTGGTTTTT AGATGCAAGA AAGAAAACTA ATCATCCTCG GCCTCTGAAG
GATTGGTACT CTCCAGAAGG ACTAACAGAT TTATCAAAGA AAAATATAGA AGAAGCAACT
AAAGAACTGA AAGAAGAACC GTTAGATTAT CTTGGATTAC GCTACTATCA AGAAGAAGCT
ATAAAAGCTA TTGAAAAGGG ATTAGAAGAA GGAAGACAAA GTCTTCTTAT AGCTATGGCA
ACAGGAACAG GTAAAACGAG AATGGCAATT GGGTTAATAT ACCGTTTGGT TAAGTCCAGA
AGATTTAAGA GAGTTTTATT TCTAGTTGAC AGAAACGCTC TTGGAAAGCA AGCTGAGAAT
GCCTTTAAAG ATTCAAAGCT AGAAAGTTTT AACTCGTTTA CTGAAATATT TGAGCTACAG
TCTCTTAAAG CACCTAACCC AAATCCAGAA ACTAAAGTCC ATATTTCTAC TGTGCAAGGA
ATGATGCGAA GAATATATTA TAATGATCCT GAAAAGGACA AACCTACAGT TGATCAATAT
GATTGTATAG TGGTAGATGA AGCACATCGA GGCTATACTT TAGACAGCGA AATGGAAGAG
CTAGAACTAT ACTTTAGAGA TCATAATGAC TATGTTAGTA AATATCGTCA GGTACTTGAT
TACTTTGATG CTGTTAGAAT TGGGCTTACT GCAACACCAG CTCTTCATAC TGTGGAAATT
TTTGGGAGCC CAATATATAC CTATTCCTAT CGTGAAGCAG TGGTAGATGG TTATTTAATT
GACCATGAGC CCCCATATCA AATTAATACT AAATTAAAGA AAGAAGGGAT TAATTGGCAA
GTTGGAGAAG AAGTAGATGT TTATGATGCC AAAAGTGGAG ATATAAAGAA AGAAGTTATG
GAAGATGAGG TAGAAATAGA TGTCTCCCAA TTTAACAAAC AAGTAATAAC TGAAAGTTTT
AACAGAACTG TCATTAAAGA ACTCGTGAAC TATATTCATC CGGAGCTAGA GGGAAAAACT
TTAATATTTG CTGCTAATGA TGATCATGCT GACACAATTG TAAGAATATT AAAGGAAGAA
TTAGAAAATA AATATGGTCC AATTGAAGAC AATGCAGTTA TGAAAATAAC TGGGTCCCTA
AAAAATCCAC TCCAAGCAAT AAAATTATTT CAAAACGAAC GTTTACCTAA TATTGTTGTG
ACTGTTGATT TACTTACTAC AGGCGTTGAT GTGCCACCTA TTTGTAGTCT TGTGTTTATG
CGAAGGGTAC GGTCACGAAT CTTATATGAA CAAATGCTAG GTAGGGCAAC TAGAAGATGT
GATGAGATTG ATAAGGATCA CTTTAATATA TTTGATGCAG TAGAAATTTA CGAAGCTTTA
AAGCCTTATA CTGATATGAA ACCCGTTGTA AAAAAACCAA ATGTACCAGT GAAACAGTTG
GTAAATGAAC TAGAACAGAT AAACTCAACT GATAAGCAAA AAAATCATAT AGATCAAATT
AAAGCAAAAA TTCAACGTAA AAGTAAACAT TGGACCGAAG AAGAAAAAGA AAATTTTAAA
GCATTGACTG GCGGAAAAAC TGTTGATGAA TATATTGATT GGATGAAAAA CTCAGAAACA
GAAGAAATTG TAAATGAGTT GCAAAAAAGT GAAAGTATAG TTAATTACAT TGATGAAAAC
CGTGAAAGAC CTCAATATCA GTATATCTCA AACCATAAGG ATGAACACTT ATCTACAACT
AGGGGTTATG GGAACGCAGA AAAGCCTGAA GATTATTTAG AAGGGTTTAA ACAGTTTATT
GAAGAAAATA TAAATCACAT CCCTGCATTA AAAGTAGTTT GTCAGAGACC TAAGGAGCTT
ACTAGAGAGG ATTTAAGAAA ACTTAAAATA GAACTTGATC AACAAGGTTA TAACGAAAAG
AACTTGCAAG CTGCTTGGAG AGATGCTAAA AATGAAGATA TAGCAGCTGA TATTATAAGC
TTTATTAGAC AGTTAACAGT GGGTGATGCA TTAGTTAGTC ATGAAGAGAG AGTCAAAAAT
GCAATGAAAA GAATCTATAG AATGAAAGCA TGGCCACCTG TACAAAAAAA ATGGCTTGAA
AGAATAGAAA AGCAACTTTT AGAAGAAAAG GTACTAGGCC CGGATCCAGA AGAAGCTTTC
GAAGTCCAAC CATTTAAAAG ACATGGAGGT TATAAGCAAT TAAATAAGAT TTTTGATGGA
CAAATAGATA TGATTGTTGA AAAAATTAAT GAGGAACTAT TTAACCAGGA AGGAAGGGAT
CACGTCTAA
 
Protein sequence
MQKSNFDFLT GDWSILSELG EIAEKNLHTD PNTTIIKLRM FAETLVKFIF AVENLEEPED 
QKQVSRLAIL KKEDLLTDDL LDLFHTIRKI GNKAAHAGYG NIDDAKTLLR MAFRISVWFM
QVYGRWDFEP PAYVEPEKID FESLEKERER LEKIASSYEK KVKKLENELN SLRSKASRDD
EKKERKTKAR QLGTGLELTE AETRTIIDDK LRAVGWEADT ETIRYSKGIR PEKGRNLAIA
EWPVKNGSVD YALFVGMQFI GIIEAKRKSK NIQSDIEQAK QYSKRIYQVE NEEISGPWNG
YKVPFLFATN GRPYLKQLEH KSGIWFLDAR KKTNHPRPLK DWYSPEGLTD LSKKNIEEAT
KELKEEPLDY LGLRYYQEEA IKAIEKGLEE GRQSLLIAMA TGTGKTRMAI GLIYRLVKSR
RFKRVLFLVD RNALGKQAEN AFKDSKLESF NSFTEIFELQ SLKAPNPNPE TKVHISTVQG
MMRRIYYNDP EKDKPTVDQY DCIVVDEAHR GYTLDSEMEE LELYFRDHND YVSKYRQVLD
YFDAVRIGLT ATPALHTVEI FGSPIYTYSY REAVVDGYLI DHEPPYQINT KLKKEGINWQ
VGEEVDVYDA KSGDIKKEVM EDEVEIDVSQ FNKQVITESF NRTVIKELVN YIHPELEGKT
LIFAANDDHA DTIVRILKEE LENKYGPIED NAVMKITGSL KNPLQAIKLF QNERLPNIVV
TVDLLTTGVD VPPICSLVFM RRVRSRILYE QMLGRATRRC DEIDKDHFNI FDAVEIYEAL
KPYTDMKPVV KKPNVPVKQL VNELEQINST DKQKNHIDQI KAKIQRKSKH WTEEEKENFK
ALTGGKTVDE YIDWMKNSET EEIVNELQKS ESIVNYIDEN RERPQYQYIS NHKDEHLSTT
RGYGNAEKPE DYLEGFKQFI EENINHIPAL KVVCQRPKEL TREDLRKLKI ELDQQGYNEK
NLQAAWRDAK NEDIAADIIS FIRQLTVGDA LVSHEERVKN AMKRIYRMKA WPPVQKKWLE
RIEKQLLEEK VLGPDPEEAF EVQPFKRHGG YKQLNKIFDG QIDMIVEKIN EELFNQEGRD
HV