Gene Nther_0365 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_0365 
Symbol 
ID6316198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp381686 
End bp384622 
Gene Length2937 bp 
Protein Length978 aa 
Translation table11 
GC content32% 
IMG OID642642750 
Producttype III restriction protein res subunit 
Protein accessionYP_001916550 
Protein GI188585005 
COG category[V] Defense mechanisms 
COG ID[COG3587] Restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.000898416 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAATTAC AATTTAATCC TAACTTAGAC TTTCAACAAG AGGCTATTAG ATCAATTGTA 
GATATATTTG AAGGACAACC TATTACGCAT TCAAACTTTA CTGTTGCTAA TCTATCAGGA
CAAATTGGTA TTCATGAAAC AAATATAGGG GTTGGGAATA AATTAGACCC TAGCTTTGAT
GAAGAAGATA TACTTAAAAA TGTTAGAAAG ATTCAATTAA GAAATGGATT ACCACAGACT
GAAAATATCG AAAAAGATGA CTATCATTTT ACAGTAGAAA TGGAAACGGG AACGGGTAAG
ACCTATGTTT ATTTGAGAAC GCTATTTGAA CTTAATCAGA AATATGGATT TAAGAAATTT
ATTATTGTTG TTCCTTCAGT TGCTATTAAA GAAGGTGTGG TTAAATCTAT AAATATTATG
TCAGATCATT TCAAGCTCTT GTATGACAAT GTGATGTTTA GAGCTTATGA ATATCAATCC
CAAAATATTG AAAGAATCAG AGACTTTGCC ACTAGTGACC ATATTCAAAT TATGGTTATG
ACTATTCAGT CTTTTAATAA GGATAAGAAT GTCATAAATA ACGACCATGA AAGGACTAAT
GGATTAAAAC CAATAGAATT TATACGGGAT ACTAACCCTA TTGTAGTAAT AGATGAACCT
CAGTCAACTG TTTCTACCAA AAAAGCAGAA GATGCGGTTA TGTCTTTAAA TCCTTTGTGC
ACATTGAGAT ATTCTGCTAC TCACAGGAAG AAACATAACT TAATGTATAA ATTAGATGCA
GTGGATGCTT ATCAAAGACA GCTTGTAAAA CAAATCGAAG TAGCTAGTGT CACATCAAAA
GATTACCATA ATGACGCTTA TTTGCGGTTA GTAAGTGTAG ATAATAGTAA GACACCAATT
ACAGCAAAAA TTGAAATTGA TAAGCGGACT AAAAACGGAG GGATTAAAAG ACAAAGTGTA
CAAGTTAAAA AAGGTGATGA CTTATTCGAA AAATCTGGAG GTCGTGAACA GTATAGTGGA
TATATTGTGA GTGAGATATA TGCTAAAGAA GGTTCTGAAT ATGTTGATTT TACAAGTCGA
AAACATATTG AATTGGGTGA AGTCAGAGGT GAGCTAGACG ATGAAGTAAT TAAGCGTACT
CAGATTAGGA AGACCATTGA AGAGCATTTG GAGAAAGAAT TGAGACTTAA ACAAGAAGGT
ATTAAAATTT TGAGTCTATT CTTTATAGAT AGAGTTTCTA ATTACAGATA TTACGATGAA
GAAGGAAATC CCCAGAAAGG GAAATACGCT ATTTGGTTTG AAGAGGAATA TAAGGATATT
ATTCAAAAGC CAAAGTATAG AACATTGTTT AATGACGTTG ATATTGAAAC GGAAGCTGAA
GCTGTTCATA ACGGGTATTT TTCAAAAGAT AGAAAGGGTA AAGTAAAAGA TACCAGAGGT
AATACCCAGG CTGATGAAGA TACTTATAAC CTGATAATGA AAGATAAAGA GAGATTACTT
GACTTCAACT CAAAACTCAA ATTTATCTTT TCACATTCAG CTCTAAAAGA AGGTTGGGAC
AACCCCAATG TTTTCCAAAT TTGCACTTTG AATGAAACAA AATCTGAAAT AAAGAAAAGA
CAAGAAATCG GTAGAGGACT TAGGTTAGCA GTAGATCAAA ATGGTGAGAG AAGACATGGC
TTTAATATAA ACACTCTAAC TGTAATGGCT AATGAATCTT ATGAAGACTT TGCAAAAGCT
CTTCAGAAAG AAATTGAGGA AGAAGAAGGC ATTAAGTTTG GAGTGGTTGA AAAACATACT
TTTGCTAATT TAAAAGTCGA GAGAGAGGGC GAATATCAGT ATCTAGAACA AAATGCTTCC
GAAGAATTAT GGAATGATCT GAAATCTAAA GAATATATTG ATGATCAAGG CAAAATTACC
GACAAATTAA AAGAAGATAT AAAAAACAAA AATTTCGAGG TACCAGAAGA ATATAAAGAA
GTAGAGGACC AGGTAGTTGC AACTCTTAAA AAGATTGCAG GAAGTCTGCG TATCAATAAC
GCCGATGATA AGAAGGAAAT CAAATTGAAT AAGCAAAGGT ATTTGAGCCC TGAATTTAAA
GAACTTTGGG ATAGGATTAA ATATAAAACT ACTTATAATG TTGAATTTGA TACGGAAGAA
CTGATTCAAG AATGTGTAGA AGAGATTAAA AAGAACTTAA TGATTGATAA AGCTAAAGTT
ATATATACAA AAGGAGAAGT AGATATTAGT GCAGCAGGGA CTGTAGCGGA AGAAAAAGGT
CGTTATGCTA TGGTAGTTGA TGATGCTAAA TTTAGACTTC CTGATATTAT AACTTACCTG
CAAAATGAAA CTAGCCTTAC AAGAAAAACC ATAGTACGGA TACTAAAAGA ATCAGGCAAA
CTTTATCAGT TTAAAAATAA TCCTCAAAAA TTTATGGATG AAGTTAGCAA GATTATTAAA
ACGAAGATGA GACATTTAAT TGTTGACGGG ATAAAATATG AAAAAATCGG TGAAGAGGCT
TACTATGCTC AAGAGCTATT TGAAAATGAA GAACTTTTTG GTTATCTTTC CAAGAATCTC
GTGAAAAGTG AAAAGTCTGT TTATGATCAT GTGATTTGTG ATTCTGATGT TGAAGCCGAT
TTTGCCCAAA AATTTGAAAA TAATGATCTA GTTAAAGTGT ACGCAAAGCT CCCTGATTGG
TTTAAAATAG ACACACCGTT AGGAGATTAT AATCCTGATT GGGCAGTATT AATTGATAAG
GACGGTGAAG AAAGGCTCTA TTTTGTAGTT GAAACTAAAG GAAGCGTTTT ATTCGAAGAA
TTAAGACCGA GAGAAGAAGG AAAAATCAAA TGTGGTGAGA AGCATTTTGA AGCACTAGGA
AATCACATTG AATTTGAGAA AAAAGATAAT TTTGAAGAGT TTATTGAGAA TGTTTGA
 
Protein sequence
MKLQFNPNLD FQQEAIRSIV DIFEGQPITH SNFTVANLSG QIGIHETNIG VGNKLDPSFD 
EEDILKNVRK IQLRNGLPQT ENIEKDDYHF TVEMETGTGK TYVYLRTLFE LNQKYGFKKF
IIVVPSVAIK EGVVKSINIM SDHFKLLYDN VMFRAYEYQS QNIERIRDFA TSDHIQIMVM
TIQSFNKDKN VINNDHERTN GLKPIEFIRD TNPIVVIDEP QSTVSTKKAE DAVMSLNPLC
TLRYSATHRK KHNLMYKLDA VDAYQRQLVK QIEVASVTSK DYHNDAYLRL VSVDNSKTPI
TAKIEIDKRT KNGGIKRQSV QVKKGDDLFE KSGGREQYSG YIVSEIYAKE GSEYVDFTSR
KHIELGEVRG ELDDEVIKRT QIRKTIEEHL EKELRLKQEG IKILSLFFID RVSNYRYYDE
EGNPQKGKYA IWFEEEYKDI IQKPKYRTLF NDVDIETEAE AVHNGYFSKD RKGKVKDTRG
NTQADEDTYN LIMKDKERLL DFNSKLKFIF SHSALKEGWD NPNVFQICTL NETKSEIKKR
QEIGRGLRLA VDQNGERRHG FNINTLTVMA NESYEDFAKA LQKEIEEEEG IKFGVVEKHT
FANLKVEREG EYQYLEQNAS EELWNDLKSK EYIDDQGKIT DKLKEDIKNK NFEVPEEYKE
VEDQVVATLK KIAGSLRINN ADDKKEIKLN KQRYLSPEFK ELWDRIKYKT TYNVEFDTEE
LIQECVEEIK KNLMIDKAKV IYTKGEVDIS AAGTVAEEKG RYAMVVDDAK FRLPDIITYL
QNETSLTRKT IVRILKESGK LYQFKNNPQK FMDEVSKIIK TKMRHLIVDG IKYEKIGEEA
YYAQELFENE ELFGYLSKNL VKSEKSVYDH VICDSDVEAD FAQKFENNDL VKVYAKLPDW
FKIDTPLGDY NPDWAVLIDK DGEERLYFVV ETKGSVLFEE LRPREEGKIK CGEKHFEALG
NHIEFEKKDN FEEFIENV