Gene Nmar_1391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1391 
Symbol 
ID5774546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1273832 
End bp1275244 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content37% 
IMG OID641317037 
ProductPre-mRNA processing ribonucleoprotein 
Protein accessionYP_001582725 
Protein GI161528899 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1498] Protein implicated in ribosomal biogenesis, Nop56p homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATTCTG TGATTTTAAC AGAATTGGGA ATCTCAGTTT TTGATGATGA AAAACTAGAA 
AAATCATTTT CATTTTCAAA TCCTGTAAAA GAGTATCTCT CAATAAAAAA TAAAGAATCA
AAATTAAACG AGGTTATCAA TTATTTGGCT TCAATTCAAA GAGGAGTCTC AGTAAGTGAT
GATTCACTAC TTGCTATTTT GAAAAAAAAT AATATAGATT CTCAAATTAT GGAAGATTCA
GAATTAGAGA GAATTCAAGC ATCAAAACCA CAAATCATTG TTGATTCAGG TTTTGCATCA
AATCCTCAAG ATGCTTTAGG AAAGTTAAGA GAATTTGCTT TAGGATTATC ATCTTCAAAG
GTTACTGAAG TTTCTGAAAG TCCAGATCTT CATATCATTC AAGCAATCAA TTCACTAGAT
GAAATTGACA AAATTGCAAA TGCTCTTAGT TCAAGATTAA GAGAATGGTA TGGATTACAT
TTTCCAGAAT TAGATAACAT CATTGATAGT ATTAATGGAT ATGCACAAAT TGTTATGGCA
GGAAAACGAG AGTCGCTAAC AAAACAAGTT TTTGAAGATG CAGGTTTTCC AGAATCAAAA
GTAGAGATGT TGTCTTTAAT TTCTACAAAA AGCAGAGGGG GAGATATCTC TGATGTAAAT
CTTGCAATTG TTCAATCAAT TGCAAAACAA ATTTTAGATT TTCATGATTT GCGTAAAAAA
CTTGAAGAAC ATGTTGAATC AGAAATGGAA ACTATTGCAC CAAATCTTTC TGCAATACTT
GGTACAACAG TAGGTGCTAG AATTTTAGGA AGAGCTGGCA GTCTTAAGAG ATTAGCTTCA
CTTCCAGCAA GCACAATTCA AGTTCTTGGA GCAGAAAAAG CATTGTTTAG ATCATTGAAA
ACTGGTTCTC AACCACCAAA ACACGGACTA TTATTCCAAC ATGCAATGGT TCATGCTGCA
CCTAGATGGC AAAGAGGAAA AATTGCACGT GCCATTGCTG CAAAAGCAGT CATTGCTGCT
AGAGTTGACG TCTACGGAGA GGGCCTAAAC AGTACACTAC TTGAGAAACT CAATGTCAGG
GTTGATGAAA TTGGAAAGAA ATACGAAAAC CCAACTGAAA AAGATACCAG AAGACCAGAA
TCATTTAGAC GAGATGGAGG CAATTTCAGA GATAATCGCA GACGTAGAGA TGACAGAGGC
GGACGTAGAG ATGACAGAGG CGGACGTAGA GATGACAGAG GCGGACGTAG AGATGACAGA
GGCGGACGTA GAGATGACAG AGGCGGACGT AGAGATGACA GAGGCGGACG TAGAGATGAC
AGAGGCGGAC GTAGAGATGA CAGAGGCGGA CGTAGAGATG ACAGACCAAG TTCAAATAAA
AATAAAAAGA GAAAACAGTT TGGAAGAAGA TAA
 
Protein sequence
MYSVILTELG ISVFDDEKLE KSFSFSNPVK EYLSIKNKES KLNEVINYLA SIQRGVSVSD 
DSLLAILKKN NIDSQIMEDS ELERIQASKP QIIVDSGFAS NPQDALGKLR EFALGLSSSK
VTEVSESPDL HIIQAINSLD EIDKIANALS SRLREWYGLH FPELDNIIDS INGYAQIVMA
GKRESLTKQV FEDAGFPESK VEMLSLISTK SRGGDISDVN LAIVQSIAKQ ILDFHDLRKK
LEEHVESEME TIAPNLSAIL GTTVGARILG RAGSLKRLAS LPASTIQVLG AEKALFRSLK
TGSQPPKHGL LFQHAMVHAA PRWQRGKIAR AIAAKAVIAA RVDVYGEGLN STLLEKLNVR
VDEIGKKYEN PTEKDTRRPE SFRRDGGNFR DNRRRRDDRG GRRDDRGGRR DDRGGRRDDR
GGRRDDRGGR RDDRGGRRDD RGGRRDDRGG RRDDRPSSNK NKKRKQFGRR