Gene Nmar_1177 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1177 
Symbol 
ID5774555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1075014 
End bp1076786 
Gene Length1773 bp 
Protein Length590 aa 
Translation table11 
GC content31% 
IMG OID641316820 
ProductpepF/M3 family oligoendopeptidase 
Protein accessionYP_001582511 
Protein GI161528685 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1164] Oligoendopeptidase F 
TIGRFAM ID[TIGR02290] oligoendopeptidase, pepF/M3 family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCAAT ACCAGTTAGG AATGTGGAAT CTTTCAGAAT TAGCAAAAAA TCCAAAAAGT 
TCAGCATTTC AAAAACAGAT CAAAGAATTA GAAAATCAGG CTGAAAAATT TGAGAAAATT
AAATCAAAAC TGGACCCAAA AATGTCATCA AAGAAATTCA TGGAAATACT TAGTCAAGTA
GAAGAAATTT CTGAGAAAAT GAGTAAAATT GGTGGATATG CATCTCTTTC ATATTCTTCA
GATACACAAT CAGATGAAGC AACATCATTA ATGACTAGAA TGTCAAAATT AGGATCAGAC
ATTTCAAACA AAATATTGTT TTTTGATTTA TGGTGGAAAA CGCAAGTTGA TGAAAAAAAT
GCAAAGAGAT TGATTAAAGA TGCAGGAGAA CTCTCAGAAT ATCTAGCACA CAAAAGACTA
ATTGCAAAAT ACTCACTCAG TGAACCTGAA GAGAGAATCA TTAACACATT AGATGTTACA
GGTATTTCTG CACTTGTAAA ATTGTATGAC AAGATAACTA ATGCATTTGA ATACCAAATG
AAAATCGGAA ACAAAACAAA AAAGATGACA AGAGAAGAAT TAACAAATTA TGTTCGACAT
ACAAATCCAA AAATTCGTGA AACAGCTTAC AAAACAATCC TAGGAAAATA TAATGAAAAC
AAAGGTGTTG TTGGAGAAAT TTATCAAAAT ATTGCACTTA ATTGGAAAGA TGAAGGAATC
GATATTCGAG GCTACAGAAC GCCAATTTCT ATGAGGAATA TTGGAAATGA TGTAGATGAC
AAAACAATAG AATCACTACT TCTAGTTTGC AAAAAAAATG CTCCAGTCTT TCAAAAATTC
TTTGTGCAAA AAGCAAAGAT GCTCAAAATG AAAAAGCTTA GAAGATATGA CATCTATGCA
CCTGCTGCTG CAAACATTAA AGAAAAAAAT TATTCATACA ACAAATCTGT AAAACTAGTT
TTTGAATCAC TAGGCAAATT TAGTAACACA TTAGAAGATT TTGCAAGAAA GGTTTTCAAT
GAAAATCATA TTGACTCAGA AGTAAGACAA GGAAAAAGAG ATGGAGCATT TTGTAGTACA
TTAACACCCA AAATCACGCC TTATGTATTG GTCAATTTTA CAGGAAAATC AAGAGACGTA
TTTACATTAG CTCATGAGTT AGGTCACGCG GTCCACAGTC AAGCTGCACA AGATAGATCA
ATTCTAGTCC AAGATGCACC ATTACCATTA GCTGAAACAG CATCAACATT TTCTGAATTA
CTGCTTTATG ACAATATTTC AGACAAGATT TCAGATGATG AAAAGAAAAT AATGTTATCT
GAAAAAATTG ATGATTTGTA TGCAACAATT CTAAGACAAT CATTTTTTAC AATTTTTGAG
ATTGATGCTC ATAAACAAAT TGGCGAAGGA ACAACCATAG ATGAAATTTC AAAAACATAT
TTACAAAATC TCAAACAACA ATTTGGAAAA TCAGTTGATG TTACAGATGA CTTTGCAATA
GAATGGAGTT GTATCCCACA TTTCTATCAC ACACCATTTT ATTGCTATGC ATATTCATTT
GGAAATCTTC TTGCATTATC ATTATTCCAA AGATACAAAA AAGAAGGTAA AGACTTTGTT
CCAGCATACA TTGACATTCT TGCAGCAGGG GGTTCAAAAA AACCTGAAAA ACTCCTTAAA
GAACATGGAT TAGATATACA ATCTACCAAG TTTTGGCAAG AAGGTTTTGA TTATGTTAAC
GGACAGGTAA AAGCACTATC ATCACTAAAC TAG
 
Protein sequence
MSQYQLGMWN LSELAKNPKS SAFQKQIKEL ENQAEKFEKI KSKLDPKMSS KKFMEILSQV 
EEISEKMSKI GGYASLSYSS DTQSDEATSL MTRMSKLGSD ISNKILFFDL WWKTQVDEKN
AKRLIKDAGE LSEYLAHKRL IAKYSLSEPE ERIINTLDVT GISALVKLYD KITNAFEYQM
KIGNKTKKMT REELTNYVRH TNPKIRETAY KTILGKYNEN KGVVGEIYQN IALNWKDEGI
DIRGYRTPIS MRNIGNDVDD KTIESLLLVC KKNAPVFQKF FVQKAKMLKM KKLRRYDIYA
PAAANIKEKN YSYNKSVKLV FESLGKFSNT LEDFARKVFN ENHIDSEVRQ GKRDGAFCST
LTPKITPYVL VNFTGKSRDV FTLAHELGHA VHSQAAQDRS ILVQDAPLPL AETASTFSEL
LLYDNISDKI SDDEKKIMLS EKIDDLYATI LRQSFFTIFE IDAHKQIGEG TTIDEISKTY
LQNLKQQFGK SVDVTDDFAI EWSCIPHFYH TPFYCYAYSF GNLLALSLFQ RYKKEGKDFV
PAYIDILAAG GSKKPEKLLK EHGLDIQSTK FWQEGFDYVN GQVKALSSLN