Gene Nther_0796 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_0796 
Symbol 
ID6314478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp830208 
End bp831533 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content33% 
IMG OID642643171 
Productrestriction modification system DNA specificity domain 
Protein accessionYP_001916971 
Protein GI188585426 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.00343889 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAGAAAT TTAAGCAATA CAAAAAATAC AAGGACTCTG GTATTGAATG GCTTGGAAAA 
GTTCCGAGTC ACTGGGACAT AAACCGGATG GATGCTTATA CAAAATATTA CAAGAAAAGT
ATTGAACGAG AAGCACTTCG CGGAAAGACA GTTTTTTACT ATAGTATTCC TGCAATTGAA
GAAACAGGCG ATGGTGTGGT AGAAGAAGGC TCAAACATTG ATAGTAATAA ATTACTACTT
AAGGGAGAAG AATTACTGGT ATCAAAGTTG AACCCAAGAA AAGGTAGAAT AATTCCTACT
AAAGAAAAAG AAATGCCAAT TATTTGCTCT TCAGAATTTG TTCCCCTGGT ACCCCGCAAT
TGTAGTAGGG AATTTATTAG ATATATATAT CAGTCAGAAC TAGTAAAACA AAAACTAAGT
AGTGCTGTAC AATCTGCTAC TAATAGTCAT CAAAGAGTTA ACCCTAGAGA TATATCAAAA
ATATATTTTG CGTTTCCAAG TAAAAGTGAA CAGGATAATA TAGTAAAATA CTTAAATTCA
AAAACATCTC AAATAGATTC CCTAATCAAC AAAAAACAAA ACCTCATCGA AAAACTCCAA
GAATACAAGC AATCCCTTAT AACCCACACC GTCACCAAAG GACTTGACCC CAATGTAAAA
ATGAAAGATT CGGGTGTTGA ATGGATAGGG GAAGTGCCGG AGCATTGGGA GATTTTAAAA
GGGAAATATC TGCTAGATAT TTATAACGGG TATCCTCCTG AAGAATTAAG TTTAAGTGCT
AATGGTCAGG TGAAATATAT TCAAGTGGAT GACTTAAACA CAGAAAATGA TGAATTAGTA
ATAAAAGATT CTAAGTTAAA ACTTAAAAAT AAAAAGACAG AAGCATTAGA TCACCCAATT
ATATTGATAC CTAAAAGAGG TGCTGCAATT TTTACAAATA AAGTCAAGAT TTTAGTTGAT
AAAGGACTTA TTGACTCTAA TATAATGGGG TTAAAACCTA AGAAAAACTG TAATATACAT
TATTTAGTCT ATATGATTAA GGCGAGGAAA GTAGATGATA TTGCGGATAC ATCTACAATA
CCTCAAATTA ACAATAAGCA TATTAACCCT CTACCACTAA CTATACCACC CATCGAAGAA
CAAAATAAAA TAGCAGAATA TCTAGATGAA AAAGTTGATA ATATAAATAA TTGTATTCTT
AATATAAAAG TAGCTATCCA AAAACTCAAA GAATACCGCC AATCCCTTAT CACCCACGCA
GTCACTGGCA AGATTGACGT CAGGGACTGG GCAGATGCAA AGGAAGGTGA AGATAATGTC
TGTTGA
 
Protein sequence
MEKFKQYKKY KDSGIEWLGK VPSHWDINRM DAYTKYYKKS IEREALRGKT VFYYSIPAIE 
ETGDGVVEEG SNIDSNKLLL KGEELLVSKL NPRKGRIIPT KEKEMPIICS SEFVPLVPRN
CSREFIRYIY QSELVKQKLS SAVQSATNSH QRVNPRDISK IYFAFPSKSE QDNIVKYLNS
KTSQIDSLIN KKQNLIEKLQ EYKQSLITHT VTKGLDPNVK MKDSGVEWIG EVPEHWEILK
GKYLLDIYNG YPPEELSLSA NGQVKYIQVD DLNTENDELV IKDSKLKLKN KKTEALDHPI
ILIPKRGAAI FTNKVKILVD KGLIDSNIMG LKPKKNCNIH YLVYMIKARK VDDIADTSTI
PQINNKHINP LPLTIPPIEE QNKIAEYLDE KVDNINNCIL NIKVAIQKLK EYRQSLITHA
VTGKIDVRDW ADAKEGEDNV C