Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_0806 |
Symbol | |
ID | 6314488 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | + |
Start bp | 840455 |
End bp | 841606 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 642643180 |
Product | McrBC 5-methylcytosine restriction system component-like protein |
Protein accession | YP_001916980 |
Protein GI | 188585435 |
COG category | [V] Defense mechanisms |
COG ID | [COG4268] McrBC 5-methylcytosine restriction system component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.0894928 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.000053751 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATGGATA ATACTCTAAA GCTAAAAGAG TATGATGAAA GTAATAAACT TCAATTAAGA CAGGACCATC TGGATTTATT ACTAGCTAAA TTTGATAATC AACTAAAGGT CAAGTCAGCT ATTGATAGAT CAGGATATAT TATATGTAGT AACAGTTATG TGGGAGTTTA CGATTTAGAT GACTTTAAAA TCGTAGTAGA GCCTAAAATT GATACAGCCA ATGTGTTTAA AATGCTTTCC TACTCTTATG ATTTAATTTT TTGGCATGAT GAAAAAGCAC AATTCGCTAA TATTCAGGAA CTACTAGATT ATTTAGTGTT GGTATTTTGT AATCAGGTTA ATAGGCTTAT CAAAAAAGGA CTACATGCTG ATTATGTACT TGTTAATGAT AAATTAAGCT ATGCTAAAGG GCGCATGAAT GTTAGAGAAT TGGTAGAAAA GCCTTGGGAA AAGCATAAAA TTGATTGTTA TTATGACAAT TATCAAGTTG ATATTTTAGA GAATCAAATT ATAAAGTTTA CAATTGATTT GTTAAAAAGG TATATCCAAA ATAATTGGAT AAGAAGATCA CTGTTAAATA CAAACAGATA CTTTGATTCT GTTTCCTTAA GGCCTATTAC AGTAGAAGAT ATTGATCAAG TTCAGTACAC CACATTAAAT AAACATTACA AACACATTCA TAACTTCTGT AAAATGTTTT TAGAGTTAAT GGGTATAAAT GAACAAATTG GAGAAACTCT TTTTAATCAG TTTCACTTAG AAATGAACAA CCTATATGAA AAATATGTAG GGAAATTATT AAAGGAAGAG TTACCAAATA ATTATTGTGT TATTCTCCAG GATAAGCTTC ACTTAGATGA ATATGATCAG ATAAGCATTA GGCCGGATAT TGTAATTTAT AATGATGTAA AGCCTTATTT AGTTATTGAT ACCAAGTATA AGGGTTCCAA AGATATTACG AATAATGACA TTTACCAAAT GGCAGCTTAT ATGAGTAAAA CAAAAACAGA TGGTGTATTA TTGTATCCTG CTCAAGAAGT GGCTGAAACA GAATATATTA TAAATGGTAG GAGCCTCAAT ATAAAAACTA TTGACTTACA AAACCTTGAT GATGGTGCAA AGGATTTAAT AAACTGGATA ATTAAAGTTT GA
|
Protein sequence | MMDNTLKLKE YDESNKLQLR QDHLDLLLAK FDNQLKVKSA IDRSGYIICS NSYVGVYDLD DFKIVVEPKI DTANVFKMLS YSYDLIFWHD EKAQFANIQE LLDYLVLVFC NQVNRLIKKG LHADYVLVND KLSYAKGRMN VRELVEKPWE KHKIDCYYDN YQVDILENQI IKFTIDLLKR YIQNNWIRRS LLNTNRYFDS VSLRPITVED IDQVQYTTLN KHYKHIHNFC KMFLELMGIN EQIGETLFNQ FHLEMNNLYE KYVGKLLKEE LPNNYCVILQ DKLHLDEYDQ ISIRPDIVIY NDVKPYLVID TKYKGSKDIT NNDIYQMAAY MSKTKTDGVL LYPAQEVAET EYIINGRSLN IKTIDLQNLD DGAKDLINWI IKV
|
| |