Gene Nmar_1724 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1724 
Symbol 
ID5774125 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1580685 
End bp1581749 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content34% 
IMG OID641317378 
Productpeptidase M24 
Protein accessionYP_001583058 
Protein GI161529232 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.0292375 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAACAAC GTAGAAAAAA TCTACTAAAA CACGCCCAAA AGATCGGTTG TGATACATTA 
GTTACTTTTG AGCCTGAAAA CCTCTTTTAC ATGACTGGGT TTTGGGGCGA AGCAATAGGC
CTGTTAGAAA AAAATGGCAA AACCACCATT ATTGCACCTG AACTTGAGGT TGGAAGAGCA
AAAGATGAAT CTGAAGACTG TGATGTAATT ACAGCAGAAC GTGGAACTGG TCTTGTAACT
TCGCTTGTAA AGAAAATAAA GAAAAATCGC GTTTGTACTG ATTGCCAAAA TTACTCTATA
ATGACATCTT TGAAAAAATC TATTCCAAAA ATAAAATCCT CTACAGAACC ATTTTACAAC
GCTCGTATAA TCAAAGACGA AAATGAGATC AAAATCCTCA AAAAAGCATC CAAAATCATT
GATGAAATGT TTGAAACCTG TTCAAAAAAG ATCAAAGTGG GCCAAAAAGA GTCAGAATTA
CAAACAATTT TGATGACTTA TGCAATGGAG CAACAAATGT TTGATACTGG ATACAAATCT
ACTCTGAATC CTCTAATTAT CGCTGGAGGC CCCAATGGTG CATTGCCTCA TGCTCAAGTA
ACACAAAGGA AGTTCAAAAA AGGTGATCTT GTTGTAACTG ATCTTACACT AAGATACAAA
GGATATGTTT CTGATGCAAC AAGAACATTT GCAATAGGAA ATGTTTCATC GCAAACTAAA
GAAGCATATG AAATTGTTAA AGAATCTCAA AAACTTGGAT TAAAAGCTGT AAAACCAAAT
GCAAATTGTA AGGATGTTGA TTTTGCATGC AGAAAATACA TTGATGATAA AAATTATGGA
CAATACTTTA TTCATTCAAC TGGTCATGGA ATTGGATTGG AAGTTCACGA ACTTCCTACT
GTTTCATACA GGAGTGACAC AAAACTTAAA GAAAATATGG CAATTACTGT AGAACCTGGA
ATCTATATCG AAAATAAATT TGGAATACGA ATAGAAGATT CTTTGATTGT AAAGGAAAGA
CCTATTGTTA TGCACAAATT CACTAAAGAT TTAATCACAA TTTGA
 
Protein sequence
MKQRRKNLLK HAQKIGCDTL VTFEPENLFY MTGFWGEAIG LLEKNGKTTI IAPELEVGRA 
KDESEDCDVI TAERGTGLVT SLVKKIKKNR VCTDCQNYSI MTSLKKSIPK IKSSTEPFYN
ARIIKDENEI KILKKASKII DEMFETCSKK IKVGQKESEL QTILMTYAME QQMFDTGYKS
TLNPLIIAGG PNGALPHAQV TQRKFKKGDL VVTDLTLRYK GYVSDATRTF AIGNVSSQTK
EAYEIVKESQ KLGLKAVKPN ANCKDVDFAC RKYIDDKNYG QYFIHSTGHG IGLEVHELPT
VSYRSDTKLK ENMAITVEPG IYIENKFGIR IEDSLIVKER PIVMHKFTKD LITI