Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_0797 |
Symbol | |
ID | 6314479 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | + |
Start bp | 831523 |
End bp | 834525 |
Gene Length | 3003 bp |
Protein Length | 1000 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 642643172 |
Product | protein of unknown function DUF450 |
Protein accession | YP_001916972 |
Protein GI | 188585427 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.00985204 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCTGTTG ATCACAGTGA GATCTCCCTT GAAAAAACTA TAGAATCTTA TCTAATAAAC CATGGTGGAT ATATTAAAGG TGACCCCTCT GAATTTGCTA GGGAAAAAGC ACTATTTCCC AAAACCTTTA TAGCATTCAT CAAAGATACC CAACCAGATA AGTGGGAAAA ACTAGAGAGA ATGCACGGTG ATCAAGTAGA AGAAAAAGTT ATCTACCGTC TAACTAGAGA ACTTTCACAG CGGGGAATGT TAGATGTACT GAGAAAAGGG ATTACTGATT TTGGTGTAAA ACTAACAACA GCTTATTTTC CACCAGCCAG TGGATTAAAT CCTGAAATAC AAGAACTATA TGATAAAAAC AGGTTAACAG CTACCAGACA AGTGAGATAT TCTACTAAGA ATGAAAACTC TTTAGATTTA GTGTTGAGCA TTAATGGTTT ACCAGTTGCA ACAGCAGAGC TTAAAAACCC CTCCACCGGT CAGACATTCC AAGATGCAAA AAAACAATAC AAAGAAGACC GTGACCCAAA AGAACCAATT TTCCAGTTTA AAAAACGAAC CCTCGTTCAT TTTGCTTTGG ATACTGATGA AGTATATATG ACAACAAAAC TTGCAAAAGG GAATACTGTA TTTCTCCCCT TTAACAAAGG AAAAGATGGC GGTAAAGGGA ATCCTGAAAA TCCTAATGGA TACCGGACAG CTTATTTGTG GGAAGAAATA TGGGAAAAAG ATACCTGGAT GAAGATTGTC TCCCGATTTA TGAACTTAGA AGTTAAAAAA GAAGAAAAGG ATGGCAAACA GATTACTAAG GAAAACATTA TCTTTCCTAG ATATCACCAA TTAGAAGCAG TGTTTAATAT AACAAATGAT GCTCAAAAGA GAGGACCAGG TAAAAACTAT TTAGTTCAAC ACTCCGCAGG AAGTGGAAAA AGTTTAACCA TAGGGTGGCT TGCTCATCGA TTATCAAATT TACATGATGA CAATAATGCT CCTGTCTTTA ACAGTGTGAT AGTTATTACC GATCGTGTTG TCCTTGATAA ACAGCTTCAA GAAACTATAT ATCAGATTGA CCATCAACAG GGAGTAGTAG AGCGGATAGA TAGACATTCA TCTCAGTTAA AGGATGCCCT TGAAAAAGGA AAGAAAATAA TAATCACCAC ACTGCAAAAA TTCCCTGTGA TATTAGATGA AATTAAAGGC TTGCCTGAAA GAAATTACGC GGTAATAGTA GATGAAGCCC ACAGTTCACA AACCGGAAAA TCCGCAAGAG CAGTCAGAGA AGTTCTATCT GCCGAAACAT TAGAAGAGGC AGAACAAAGA CAGCAGGACA TAGAAGATGA ATTTGATCCA GAAGAAGAAA TTATAAAATC TATGAAAAAA CAAGGCAAGC AGGAAAATCT TAGCTTTTTT GCATTCACAG CAACCCCAAA GGCTAAAACA TTAGAAACCT TTGGCCATAA AGACAAAGAA GGGAAACCTG TTCCCTTTCA TTTATATCCC ATGAGACAAG CTATTGAAGA AGGGTTTATA TTGGACGTAT TGAAAAACTA TACCACTTAT AAAACCTATT TTAAACTGGC TAAAAGAATT GAAGATGATC CAAACTTAGA TAAGAAAAAA GCAGCAAAAG AAGTAGCTAG ATTTGTTAGC CTTCACCCGC ACAATCTAGC GCAAAAAACT GAAGTGATGA TAGAACACTT CCGAAATGTA ACTATGCACA AAATAAATGG TAAAGCTAAG GCAATGCTGG TGACAAGCTC TAGATTTCAT GCACTAAGAT ATAAATTTGA ATTTGATAGA TATTTAAAAG AAAACGGGTA CGATGATATT AAAACCCTAG TTGCTTTTTC AGGAACAGTA CAAGACCCTG AACTTGATAA AGAATATACA GAATCGGAAA TTAATCAGAT AAGAGAATCT GAACTTCCAC AAAAATTTGA TACTGATGGA TACCGTCTTC TTTTAGTTGC AGACAAATAT CAAACTGGAT TTGATCAGCC ATTACTTCAT ACAATGTACG TTGATAAAAA ACTGACTGAT GTTAAAGCAG TCCAAACCCT GTCCAGGTTA AATCGCCCTT ATCCAGGTAA AGATGATACA TTTGTCCTCG ATTTTGTCAA TGAAGCGGAT GATATTAAAA AAGCCTTTCA ACCTTTTTAT GAGCAAACAA CCGTTGAAGA AACCACTGAT CCCAATCTGC TTTATGACTT GAAGTATAAG CTGGATGATT TTCAGGTATA TTGGCAAACT GAGATAGATA ATTTCTGTAA AGTATTCTTT AAACCACCAG AAAAGCAAAA TAAAGATAAA GATGCAGCTA TGTTAAATTC ATATGTAGAT CCTGCGGTAG ATAGATTTAA AAATAAACCA AAAGAAGAGC AGGAGGAATT TCACAAGACA CTTACAAGCT TTATCAGAAC TTATGCTTTT TTACTGCAAA TCATCCCCTT TCAAGACCAA GAGCTTCATA AACTAGATGC TTATGGACGA TTTTTACTAA CTAAACTACC TAGTCAAAGA AACCATGAAG ATACAGTAGA ATTAAATAAT GAAATTGATT TAGAATATTA CAGGTTAGAA AAAACCGGTG ACCACAGTGT TGTATTGGAA GAACAAGGTG AATATGGAGT TAGGGGTACC ACCCATGCAG GAACAGGACA AAATAAAGAC AAAGAAGATG AAAAAGCTCC TTTGTCAGAG ATTATAGAAA TAATTAATGA ACGATTTGGA ACTGACTTCA CAGAAAATGA CTGGCTATTC TTTGAACAAA TTAAAAATGA CATGTTAGAG GATGAGGAAT TAGAGAAGCA TGCACGGAAC AATAGGAAAG ATAATTTTTA CTACGCTTTT AAAGACCACT TTGATAAAAA GACTATAAAG AGAAGAACTC AGAATATGGA ACTGTTTGCC ATGTTGATGG ATAATGAAGA ATTCAAAGAA AAGGTAATGG ACTTCTATAT GAATGAAGTA TTTAAGGAAT TTAAAAATAA AGCTAGTTCA TAA
|
Protein sequence | MSVDHSEISL EKTIESYLIN HGGYIKGDPS EFAREKALFP KTFIAFIKDT QPDKWEKLER MHGDQVEEKV IYRLTRELSQ RGMLDVLRKG ITDFGVKLTT AYFPPASGLN PEIQELYDKN RLTATRQVRY STKNENSLDL VLSINGLPVA TAELKNPSTG QTFQDAKKQY KEDRDPKEPI FQFKKRTLVH FALDTDEVYM TTKLAKGNTV FLPFNKGKDG GKGNPENPNG YRTAYLWEEI WEKDTWMKIV SRFMNLEVKK EEKDGKQITK ENIIFPRYHQ LEAVFNITND AQKRGPGKNY LVQHSAGSGK SLTIGWLAHR LSNLHDDNNA PVFNSVIVIT DRVVLDKQLQ ETIYQIDHQQ GVVERIDRHS SQLKDALEKG KKIIITTLQK FPVILDEIKG LPERNYAVIV DEAHSSQTGK SARAVREVLS AETLEEAEQR QQDIEDEFDP EEEIIKSMKK QGKQENLSFF AFTATPKAKT LETFGHKDKE GKPVPFHLYP MRQAIEEGFI LDVLKNYTTY KTYFKLAKRI EDDPNLDKKK AAKEVARFVS LHPHNLAQKT EVMIEHFRNV TMHKINGKAK AMLVTSSRFH ALRYKFEFDR YLKENGYDDI KTLVAFSGTV QDPELDKEYT ESEINQIRES ELPQKFDTDG YRLLLVADKY QTGFDQPLLH TMYVDKKLTD VKAVQTLSRL NRPYPGKDDT FVLDFVNEAD DIKKAFQPFY EQTTVEETTD PNLLYDLKYK LDDFQVYWQT EIDNFCKVFF KPPEKQNKDK DAAMLNSYVD PAVDRFKNKP KEEQEEFHKT LTSFIRTYAF LLQIIPFQDQ ELHKLDAYGR FLLTKLPSQR NHEDTVELNN EIDLEYYRLE KTGDHSVVLE EQGEYGVRGT THAGTGQNKD KEDEKAPLSE IIEIINERFG TDFTENDWLF FEQIKNDMLE DEELEKHARN NRKDNFYYAF KDHFDKKTIK RRTQNMELFA MLMDNEEFKE KVMDFYMNEV FKEFKNKASS
|
| |