Gene Nther_0806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_0806 
Symbol 
ID6314488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp840455 
End bp841606 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content29% 
IMG OID642643180 
ProductMcrBC 5-methylcytosine restriction system component-like protein 
Protein accessionYP_001916980 
Protein GI188585435 
COG category[V] Defense mechanisms 
COG ID[COG4268] McrBC 5-methylcytosine restriction system component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0894928 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.000053751 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATGGATA ATACTCTAAA GCTAAAAGAG TATGATGAAA GTAATAAACT TCAATTAAGA 
CAGGACCATC TGGATTTATT ACTAGCTAAA TTTGATAATC AACTAAAGGT CAAGTCAGCT
ATTGATAGAT CAGGATATAT TATATGTAGT AACAGTTATG TGGGAGTTTA CGATTTAGAT
GACTTTAAAA TCGTAGTAGA GCCTAAAATT GATACAGCCA ATGTGTTTAA AATGCTTTCC
TACTCTTATG ATTTAATTTT TTGGCATGAT GAAAAAGCAC AATTCGCTAA TATTCAGGAA
CTACTAGATT ATTTAGTGTT GGTATTTTGT AATCAGGTTA ATAGGCTTAT CAAAAAAGGA
CTACATGCTG ATTATGTACT TGTTAATGAT AAATTAAGCT ATGCTAAAGG GCGCATGAAT
GTTAGAGAAT TGGTAGAAAA GCCTTGGGAA AAGCATAAAA TTGATTGTTA TTATGACAAT
TATCAAGTTG ATATTTTAGA GAATCAAATT ATAAAGTTTA CAATTGATTT GTTAAAAAGG
TATATCCAAA ATAATTGGAT AAGAAGATCA CTGTTAAATA CAAACAGATA CTTTGATTCT
GTTTCCTTAA GGCCTATTAC AGTAGAAGAT ATTGATCAAG TTCAGTACAC CACATTAAAT
AAACATTACA AACACATTCA TAACTTCTGT AAAATGTTTT TAGAGTTAAT GGGTATAAAT
GAACAAATTG GAGAAACTCT TTTTAATCAG TTTCACTTAG AAATGAACAA CCTATATGAA
AAATATGTAG GGAAATTATT AAAGGAAGAG TTACCAAATA ATTATTGTGT TATTCTCCAG
GATAAGCTTC ACTTAGATGA ATATGATCAG ATAAGCATTA GGCCGGATAT TGTAATTTAT
AATGATGTAA AGCCTTATTT AGTTATTGAT ACCAAGTATA AGGGTTCCAA AGATATTACG
AATAATGACA TTTACCAAAT GGCAGCTTAT ATGAGTAAAA CAAAAACAGA TGGTGTATTA
TTGTATCCTG CTCAAGAAGT GGCTGAAACA GAATATATTA TAAATGGTAG GAGCCTCAAT
ATAAAAACTA TTGACTTACA AAACCTTGAT GATGGTGCAA AGGATTTAAT AAACTGGATA
ATTAAAGTTT GA
 
Protein sequence
MMDNTLKLKE YDESNKLQLR QDHLDLLLAK FDNQLKVKSA IDRSGYIICS NSYVGVYDLD 
DFKIVVEPKI DTANVFKMLS YSYDLIFWHD EKAQFANIQE LLDYLVLVFC NQVNRLIKKG
LHADYVLVND KLSYAKGRMN VRELVEKPWE KHKIDCYYDN YQVDILENQI IKFTIDLLKR
YIQNNWIRRS LLNTNRYFDS VSLRPITVED IDQVQYTTLN KHYKHIHNFC KMFLELMGIN
EQIGETLFNQ FHLEMNNLYE KYVGKLLKEE LPNNYCVILQ DKLHLDEYDQ ISIRPDIVIY
NDVKPYLVID TKYKGSKDIT NNDIYQMAAY MSKTKTDGVL LYPAQEVAET EYIINGRSLN
IKTIDLQNLD DGAKDLINWI IKV