Gene Nmar_1533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1533 
Symbol 
ID5773286 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1396322 
End bp1397305 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content37% 
IMG OID641317184 
Productmetalloendopeptidase glycoprotease family 
Protein accessionYP_001582867 
Protein GI161529041 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000105035 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGTTAGGTT TAGGAATAGA AAGTACTGCT CATACCTTCT CATGTGCAGT TATAGAAATG 
AAGGGAAAGA AAGGAAAAAT TTTATCTGAT GTTCGTAAAA TTTATCGTCC TGCTGATGGA
GAGGGAATTC ATCCACGAGA GGCTTCAAGA CACCACATTG AAAATAGTTC TCTAGTATTG
TCTGAATGTC TTGATGAGGC AAATATCAAA GTAAATGACT TGGATATTGT ATCTTATGCT
GGTGGTCCTG GTTTAGGACC TTGTTTACGT GTTGGCGCTG TAGTGGCAAG ATCATTGGCA
TCTTTTTACA AAATTCCAAT TTATCCTGTC AATCATGCAT TGGGTCATAT AGAATTAGGA
AAGTTGCTAA CTGGTGCAAC CAATCCTTTA GTCCTCTTAG TCTCTGGAGG TCACACAATG
CTTTTGGCAT TTTTAAATAA ACAATGGAGA GTGTTTGGTG AAACTTTGGA TATTACTTTG
GGGCAACTAC TTGATCAGTT TGGAAGATCA ATTGGTTTTG CTTCTCCTTG TGGAAAAAAT
ATTGAGGAAT TGGCAACAAC ATCTTCTAAC TATGTTACAT TGCCATATTC TGTAAAGGGA
AATGATGTCT CATTTTCTGG ATTGCTATCT GCAACAAAAT CAGTAGCAAA AAAAAGTAAA
GTTGATGCAT GCTATTCTCT TCAAGAAACT GCTTTTGCAA TGATAGCAGA AGCTGTAGAA
CGTGCCTTGT CTTTTACGAG AAAAAAAGAA CTGATGATTG TAGGTGGTGT TGCAGCTAAC
AAACGATTAT CTGAAATGCT ACAAGATGTC TGTAAACGAC ATGGTGCAAA ATTCTTTGTT
GTCCCTTTGA AATATGCTGG GGATTGCGGT AGCCAAATAT GTTGGACTGG ACTTTTAGAA
TCTCAAATCA AGAAAGGCGT GTCATTAAAA GATACTTTTG TTACTCAGTC TTGGAGATTA
GATACTGTTA AAGTGAATTA CTAA
 
Protein sequence
MLGLGIESTA HTFSCAVIEM KGKKGKILSD VRKIYRPADG EGIHPREASR HHIENSSLVL 
SECLDEANIK VNDLDIVSYA GGPGLGPCLR VGAVVARSLA SFYKIPIYPV NHALGHIELG
KLLTGATNPL VLLVSGGHTM LLAFLNKQWR VFGETLDITL GQLLDQFGRS IGFASPCGKN
IEELATTSSN YVTLPYSVKG NDVSFSGLLS ATKSVAKKSK VDACYSLQET AFAMIAEAVE
RALSFTRKKE LMIVGGVAAN KRLSEMLQDV CKRHGAKFFV VPLKYAGDCG SQICWTGLLE
SQIKKGVSLK DTFVTQSWRL DTVKVNY