Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_0365 |
Symbol | |
ID | 6316198 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | + |
Start bp | 381686 |
End bp | 384622 |
Gene Length | 2937 bp |
Protein Length | 978 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 642642750 |
Product | type III restriction protein res subunit |
Protein accession | YP_001916550 |
Protein GI | 188585005 |
COG category | [V] Defense mechanisms |
COG ID | [COG3587] Restriction endonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.000898416 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAAATTAC AATTTAATCC TAACTTAGAC TTTCAACAAG AGGCTATTAG ATCAATTGTA GATATATTTG AAGGACAACC TATTACGCAT TCAAACTTTA CTGTTGCTAA TCTATCAGGA CAAATTGGTA TTCATGAAAC AAATATAGGG GTTGGGAATA AATTAGACCC TAGCTTTGAT GAAGAAGATA TACTTAAAAA TGTTAGAAAG ATTCAATTAA GAAATGGATT ACCACAGACT GAAAATATCG AAAAAGATGA CTATCATTTT ACAGTAGAAA TGGAAACGGG AACGGGTAAG ACCTATGTTT ATTTGAGAAC GCTATTTGAA CTTAATCAGA AATATGGATT TAAGAAATTT ATTATTGTTG TTCCTTCAGT TGCTATTAAA GAAGGTGTGG TTAAATCTAT AAATATTATG TCAGATCATT TCAAGCTCTT GTATGACAAT GTGATGTTTA GAGCTTATGA ATATCAATCC CAAAATATTG AAAGAATCAG AGACTTTGCC ACTAGTGACC ATATTCAAAT TATGGTTATG ACTATTCAGT CTTTTAATAA GGATAAGAAT GTCATAAATA ACGACCATGA AAGGACTAAT GGATTAAAAC CAATAGAATT TATACGGGAT ACTAACCCTA TTGTAGTAAT AGATGAACCT CAGTCAACTG TTTCTACCAA AAAAGCAGAA GATGCGGTTA TGTCTTTAAA TCCTTTGTGC ACATTGAGAT ATTCTGCTAC TCACAGGAAG AAACATAACT TAATGTATAA ATTAGATGCA GTGGATGCTT ATCAAAGACA GCTTGTAAAA CAAATCGAAG TAGCTAGTGT CACATCAAAA GATTACCATA ATGACGCTTA TTTGCGGTTA GTAAGTGTAG ATAATAGTAA GACACCAATT ACAGCAAAAA TTGAAATTGA TAAGCGGACT AAAAACGGAG GGATTAAAAG ACAAAGTGTA CAAGTTAAAA AAGGTGATGA CTTATTCGAA AAATCTGGAG GTCGTGAACA GTATAGTGGA TATATTGTGA GTGAGATATA TGCTAAAGAA GGTTCTGAAT ATGTTGATTT TACAAGTCGA AAACATATTG AATTGGGTGA AGTCAGAGGT GAGCTAGACG ATGAAGTAAT TAAGCGTACT CAGATTAGGA AGACCATTGA AGAGCATTTG GAGAAAGAAT TGAGACTTAA ACAAGAAGGT ATTAAAATTT TGAGTCTATT CTTTATAGAT AGAGTTTCTA ATTACAGATA TTACGATGAA GAAGGAAATC CCCAGAAAGG GAAATACGCT ATTTGGTTTG AAGAGGAATA TAAGGATATT ATTCAAAAGC CAAAGTATAG AACATTGTTT AATGACGTTG ATATTGAAAC GGAAGCTGAA GCTGTTCATA ACGGGTATTT TTCAAAAGAT AGAAAGGGTA AAGTAAAAGA TACCAGAGGT AATACCCAGG CTGATGAAGA TACTTATAAC CTGATAATGA AAGATAAAGA GAGATTACTT GACTTCAACT CAAAACTCAA ATTTATCTTT TCACATTCAG CTCTAAAAGA AGGTTGGGAC AACCCCAATG TTTTCCAAAT TTGCACTTTG AATGAAACAA AATCTGAAAT AAAGAAAAGA CAAGAAATCG GTAGAGGACT TAGGTTAGCA GTAGATCAAA ATGGTGAGAG AAGACATGGC TTTAATATAA ACACTCTAAC TGTAATGGCT AATGAATCTT ATGAAGACTT TGCAAAAGCT CTTCAGAAAG AAATTGAGGA AGAAGAAGGC ATTAAGTTTG GAGTGGTTGA AAAACATACT TTTGCTAATT TAAAAGTCGA GAGAGAGGGC GAATATCAGT ATCTAGAACA AAATGCTTCC GAAGAATTAT GGAATGATCT GAAATCTAAA GAATATATTG ATGATCAAGG CAAAATTACC GACAAATTAA AAGAAGATAT AAAAAACAAA AATTTCGAGG TACCAGAAGA ATATAAAGAA GTAGAGGACC AGGTAGTTGC AACTCTTAAA AAGATTGCAG GAAGTCTGCG TATCAATAAC GCCGATGATA AGAAGGAAAT CAAATTGAAT AAGCAAAGGT ATTTGAGCCC TGAATTTAAA GAACTTTGGG ATAGGATTAA ATATAAAACT ACTTATAATG TTGAATTTGA TACGGAAGAA CTGATTCAAG AATGTGTAGA AGAGATTAAA AAGAACTTAA TGATTGATAA AGCTAAAGTT ATATATACAA AAGGAGAAGT AGATATTAGT GCAGCAGGGA CTGTAGCGGA AGAAAAAGGT CGTTATGCTA TGGTAGTTGA TGATGCTAAA TTTAGACTTC CTGATATTAT AACTTACCTG CAAAATGAAA CTAGCCTTAC AAGAAAAACC ATAGTACGGA TACTAAAAGA ATCAGGCAAA CTTTATCAGT TTAAAAATAA TCCTCAAAAA TTTATGGATG AAGTTAGCAA GATTATTAAA ACGAAGATGA GACATTTAAT TGTTGACGGG ATAAAATATG AAAAAATCGG TGAAGAGGCT TACTATGCTC AAGAGCTATT TGAAAATGAA GAACTTTTTG GTTATCTTTC CAAGAATCTC GTGAAAAGTG AAAAGTCTGT TTATGATCAT GTGATTTGTG ATTCTGATGT TGAAGCCGAT TTTGCCCAAA AATTTGAAAA TAATGATCTA GTTAAAGTGT ACGCAAAGCT CCCTGATTGG TTTAAAATAG ACACACCGTT AGGAGATTAT AATCCTGATT GGGCAGTATT AATTGATAAG GACGGTGAAG AAAGGCTCTA TTTTGTAGTT GAAACTAAAG GAAGCGTTTT ATTCGAAGAA TTAAGACCGA GAGAAGAAGG AAAAATCAAA TGTGGTGAGA AGCATTTTGA AGCACTAGGA AATCACATTG AATTTGAGAA AAAAGATAAT TTTGAAGAGT TTATTGAGAA TGTTTGA
|
Protein sequence | MKLQFNPNLD FQQEAIRSIV DIFEGQPITH SNFTVANLSG QIGIHETNIG VGNKLDPSFD EEDILKNVRK IQLRNGLPQT ENIEKDDYHF TVEMETGTGK TYVYLRTLFE LNQKYGFKKF IIVVPSVAIK EGVVKSINIM SDHFKLLYDN VMFRAYEYQS QNIERIRDFA TSDHIQIMVM TIQSFNKDKN VINNDHERTN GLKPIEFIRD TNPIVVIDEP QSTVSTKKAE DAVMSLNPLC TLRYSATHRK KHNLMYKLDA VDAYQRQLVK QIEVASVTSK DYHNDAYLRL VSVDNSKTPI TAKIEIDKRT KNGGIKRQSV QVKKGDDLFE KSGGREQYSG YIVSEIYAKE GSEYVDFTSR KHIELGEVRG ELDDEVIKRT QIRKTIEEHL EKELRLKQEG IKILSLFFID RVSNYRYYDE EGNPQKGKYA IWFEEEYKDI IQKPKYRTLF NDVDIETEAE AVHNGYFSKD RKGKVKDTRG NTQADEDTYN LIMKDKERLL DFNSKLKFIF SHSALKEGWD NPNVFQICTL NETKSEIKKR QEIGRGLRLA VDQNGERRHG FNINTLTVMA NESYEDFAKA LQKEIEEEEG IKFGVVEKHT FANLKVEREG EYQYLEQNAS EELWNDLKSK EYIDDQGKIT DKLKEDIKNK NFEVPEEYKE VEDQVVATLK KIAGSLRINN ADDKKEIKLN KQRYLSPEFK ELWDRIKYKT TYNVEFDTEE LIQECVEEIK KNLMIDKAKV IYTKGEVDIS AAGTVAEEKG RYAMVVDDAK FRLPDIITYL QNETSLTRKT IVRILKESGK LYQFKNNPQK FMDEVSKIIK TKMRHLIVDG IKYEKIGEEA YYAQELFENE ELFGYLSKNL VKSEKSVYDH VICDSDVEAD FAQKFENNDL VKVYAKLPDW FKIDTPLGDY NPDWAVLIDK DGEERLYFVV ETKGSVLFEE LRPREEGKIK CGEKHFEALG NHIEFEKKDN FEEFIENV
|
| |