Gene Nmar_0291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0291 
Symbol 
ID5774209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp257852 
End bp258958 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content34% 
IMG OID641315916 
Productpeptidase M50 
Protein accessionYP_001581625 
Protein GI161527799 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0750] Predicted membrane-associated Zn-dependent proteases 1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.00037958 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACGAGG AATCTCAAGA CGACATAATT TCTTTAGTAA ATTCCATCTT TGATGTAAGT 
GATTTTATAA AAACTGAATT TTCAATGGAG TTTCGAATTG AAGATATTGA GTTCAAATCC
AAATTTGAAA AATTAGCAAG AAGATTAGAA GGAATGAGTT TTGCATGTAG ATTAGAGCAA
AAAGATGGTG GAAAGTTTGT TATTATTCAA AAGTTTGCGA TAAAAAAACA AAGAAGGTGG
ATGAAAACTG CATGGACACC AAGAGCTTTG TTTGCAATTG TAGTTGCATT TGTTATGGTT
GATGGATACT ATAGAACATC TGGAACAAAT TCTATTGTTG AAATTGGAGA ACCACTTGAG
ATGGCAGCAG TTTACACATT ATCTTTGCTA GGAATTTTAG GAATTCATGA ACTAGGACAC
ATAATTGCAG CAAAAGCCCA CAGATTAAAA ACTACATGGC CATACTTTAT TCCAGGTCTA
CCAGTAATAG GAATCCCAAC ATTTGGGGCA TTTATTCAAT CAAGGGGATT GACCATCAAC
AGAGAAATTT TGTTTGATGT TGCAATAGCC GGTCCAATAG CAGGATTAGT GATTGCAGTA
ATTGTTTCAA TATATGGAGC ATATACTGCA CCAATTTTAG AACCTGAAAT TGCTGCAGGG
TTATTTGAAG AATCTAGACT AATGGAATGG GAGCAAGGAG AGCCATTGTT AATGACTGCA
AGTCTTGCAA TGTTTGGAAA AGGAGGTTCA GGACATGAAG TAATTATGAC TCCAATAATG
TTTGCAGCAT GGATTGGATT TCTAATTACA TTTTTGAATT TACTTCCAGC ATGGCAACTA
GATGGAGGTC ATATGGCCAG AACTTTGTTG GGTCCAAAAT TACATAGATA TGCAACTTTT
GGCAGTATGG CAATTCTAGT TTTGTTAAAT TATTGGTTAA TGGCAATTTT AATTCTAATA
ATGAGTTCAA GAAATCCTAG TGCAATGCCA TTAGATGATA TTTCGCCACT TTCAAGAAAT
AGAAAATTAG CATATATTGG AATTATTGGA TTGGCAATTT TATGTGCACC ATTACCATCA
GATTTTTTGC CTAATTTCCT ACCTTAG
 
Protein sequence
MDEESQDDII SLVNSIFDVS DFIKTEFSME FRIEDIEFKS KFEKLARRLE GMSFACRLEQ 
KDGGKFVIIQ KFAIKKQRRW MKTAWTPRAL FAIVVAFVMV DGYYRTSGTN SIVEIGEPLE
MAAVYTLSLL GILGIHELGH IIAAKAHRLK TTWPYFIPGL PVIGIPTFGA FIQSRGLTIN
REILFDVAIA GPIAGLVIAV IVSIYGAYTA PILEPEIAAG LFEESRLMEW EQGEPLLMTA
SLAMFGKGGS GHEVIMTPIM FAAWIGFLIT FLNLLPAWQL DGGHMARTLL GPKLHRYATF
GSMAILVLLN YWLMAILILI MSSRNPSAMP LDDISPLSRN RKLAYIGIIG LAILCAPLPS
DFLPNFLP