Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1391 |
Symbol | |
ID | 5774546 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 1273832 |
End bp | 1275244 |
Gene Length | 1413 bp |
Protein Length | 470 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641317037 |
Product | Pre-mRNA processing ribonucleoprotein |
Protein accession | YP_001582725 |
Protein GI | 161528899 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1498] Protein implicated in ribosomal biogenesis, Nop56p homolog |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATTCTG TGATTTTAAC AGAATTGGGA ATCTCAGTTT TTGATGATGA AAAACTAGAA AAATCATTTT CATTTTCAAA TCCTGTAAAA GAGTATCTCT CAATAAAAAA TAAAGAATCA AAATTAAACG AGGTTATCAA TTATTTGGCT TCAATTCAAA GAGGAGTCTC AGTAAGTGAT GATTCACTAC TTGCTATTTT GAAAAAAAAT AATATAGATT CTCAAATTAT GGAAGATTCA GAATTAGAGA GAATTCAAGC ATCAAAACCA CAAATCATTG TTGATTCAGG TTTTGCATCA AATCCTCAAG ATGCTTTAGG AAAGTTAAGA GAATTTGCTT TAGGATTATC ATCTTCAAAG GTTACTGAAG TTTCTGAAAG TCCAGATCTT CATATCATTC AAGCAATCAA TTCACTAGAT GAAATTGACA AAATTGCAAA TGCTCTTAGT TCAAGATTAA GAGAATGGTA TGGATTACAT TTTCCAGAAT TAGATAACAT CATTGATAGT ATTAATGGAT ATGCACAAAT TGTTATGGCA GGAAAACGAG AGTCGCTAAC AAAACAAGTT TTTGAAGATG CAGGTTTTCC AGAATCAAAA GTAGAGATGT TGTCTTTAAT TTCTACAAAA AGCAGAGGGG GAGATATCTC TGATGTAAAT CTTGCAATTG TTCAATCAAT TGCAAAACAA ATTTTAGATT TTCATGATTT GCGTAAAAAA CTTGAAGAAC ATGTTGAATC AGAAATGGAA ACTATTGCAC CAAATCTTTC TGCAATACTT GGTACAACAG TAGGTGCTAG AATTTTAGGA AGAGCTGGCA GTCTTAAGAG ATTAGCTTCA CTTCCAGCAA GCACAATTCA AGTTCTTGGA GCAGAAAAAG CATTGTTTAG ATCATTGAAA ACTGGTTCTC AACCACCAAA ACACGGACTA TTATTCCAAC ATGCAATGGT TCATGCTGCA CCTAGATGGC AAAGAGGAAA AATTGCACGT GCCATTGCTG CAAAAGCAGT CATTGCTGCT AGAGTTGACG TCTACGGAGA GGGCCTAAAC AGTACACTAC TTGAGAAACT CAATGTCAGG GTTGATGAAA TTGGAAAGAA ATACGAAAAC CCAACTGAAA AAGATACCAG AAGACCAGAA TCATTTAGAC GAGATGGAGG CAATTTCAGA GATAATCGCA GACGTAGAGA TGACAGAGGC GGACGTAGAG ATGACAGAGG CGGACGTAGA GATGACAGAG GCGGACGTAG AGATGACAGA GGCGGACGTA GAGATGACAG AGGCGGACGT AGAGATGACA GAGGCGGACG TAGAGATGAC AGAGGCGGAC GTAGAGATGA CAGAGGCGGA CGTAGAGATG ACAGACCAAG TTCAAATAAA AATAAAAAGA GAAAACAGTT TGGAAGAAGA TAA
|
Protein sequence | MYSVILTELG ISVFDDEKLE KSFSFSNPVK EYLSIKNKES KLNEVINYLA SIQRGVSVSD DSLLAILKKN NIDSQIMEDS ELERIQASKP QIIVDSGFAS NPQDALGKLR EFALGLSSSK VTEVSESPDL HIIQAINSLD EIDKIANALS SRLREWYGLH FPELDNIIDS INGYAQIVMA GKRESLTKQV FEDAGFPESK VEMLSLISTK SRGGDISDVN LAIVQSIAKQ ILDFHDLRKK LEEHVESEME TIAPNLSAIL GTTVGARILG RAGSLKRLAS LPASTIQVLG AEKALFRSLK TGSQPPKHGL LFQHAMVHAA PRWQRGKIAR AIAAKAVIAA RVDVYGEGLN STLLEKLNVR VDEIGKKYEN PTEKDTRRPE SFRRDGGNFR DNRRRRDDRG GRRDDRGGRR DDRGGRRDDR GGRRDDRGGR RDDRGGRRDD RGGRRDDRGG RRDDRPSSNK NKKRKQFGRR
|
| |