Gene Nmar_1472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1472 
Symbol 
ID5774667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1337614 
End bp1340079 
Gene Length2466 bp 
Protein Length821 aa 
Translation table11 
GC content34% 
IMG OID641317120 
Producthypothetical protein 
Protein accessionYP_001582806 
Protein GI161528980 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGAAG AAAACATGTC TTCTGAGATA GGATTAGGAC TATCCCTTGC AGAAATGATA 
TTCAAGTTTG CTCAAAAATC ATTCAAGTGG AATTGGAATG AATATCGTAC AGAAATTATT
GATGAATTTG GCGCCAAATA TCCAGAAAGT AGAGATATAC TAATTGATGT TTTGTCAAAA
CAAGAATTTT CAGACAATTT CAAAAAATTA TCTGAAGGAT CTGTTCAAGA CTTCCTGAAC
CTGCTTAAAG GACTAGAAAA TGCATTTCAA GATACAGAAT ACAAATTTGA TTCAGAAAAA
CTGTTAACAG AAATAAAACA AAAAATTGAG ATGGAAGGAG CTGAAACATC TTTTCAGAAA
ATTATGACAG CATATAATCA AAAAATTCTG AATATTGCCA CAAAGTTAGA TAGTAAAATG
GATAAAGCAA TTGGAATACT TAAAGAAGTT CATGAACAAA CTGTTACAAA AGATTCTGGA
GACCAAGATC AACAATTTCA GCAAGGAAAT CTAAACCTTT TATCATATCT TAAAGAACTT
ACTACAAAAC TTGAAGCTAA ACCCCCACCA GAACATAAGT TTGGGATTAC ATTTTCTGAT
GAAAACGGAG AAAAATTTCC AATAACTGAA TTACATTCAA TCATAAAATC AAAAAGAAAA
GTAATCCTAA AAGGCGCTGC AGGAAGCGGA AAGAGTAACA CAGTTACAGA GTTTCTCAAA
TCTATTGTTA AAGAAAAGGT TCTTCCTGTT TTTATCCCCA TTAATGGAAG TGAAACAGAA
TTTTTTAAAG AATTGGAGGA TGCGAGAGAT GAAAGTGCTG AGTCACGCAT AGACAAACTG
CTGAAAATGT CTAGTGTGGA AATTACAGTG GAACGTCTCA AAAGTTTTGA GGGAGATGTC
ATAATAGTGA TTGATGGCAT AAATGAATTC AAATCAGGCG ATTTTGGTGA ATTAACAGAA
AAAATAATTC AAGATGTAGG AGAATATGTA AGACAAACTT CTCCTCACAC ATATGTTCTG
CTAACGGACA GGATGGATGT ACGTGGTGCA CTTACAGGAT GGAACAAAAT TACATTGAAT
CCTTTGAGTG AAGAAGAGAT AAGAAAAAAT CTAGACATTA CTATTGGTAA AGAACAATAT
GAGTTGAATG ATGTAAGCAT GAGTTTGTTA TCCAGACCAT TCTTTCTCAA TATTGCCATG
GAGAGATGTT CAGCTTCTAT GACTTCTGAA TCTTCCGTAC TAAATGATTT CTTCAAAGAG
AGATTGAAGA TGAATGAGGA GGAAATGAAG AAATTAGCAA AAGCGTCATT TTTTGCATAT
CAAGAAGGTT CGGTATCATT TGAGTTGAAA AACTTTGTAG AGATTACAGG ACAAGAGACA
TTTGATAAAC TGTTAGGTGG AGGTGCGGCA TTTACGACAA GTGACACCCA TGCAACATTT
AGTCATCAAT TATTACATGA TTTTCTTGTA TCCGAATATT TGGCATCACA CCCAGAGAAT
TGGGATGTAG ATGAATTTGA CATAGCTACT TTGGAGTCCA GTTCACTAGA ATCTATCCAT
CTGGTAATGC AACAGATCGA AAATAAAGAA CTTGCTGACA GATTTTTGTT GGAGGTATAT
GATTGGAATT GGAAAGCTGC AATTGCATCC ATCATATTAA ATTCCAAGTT AGGAATTAAC
AATCATTCTG AAGATATAGT TACTGCAATG TTAGTATTGG TTGCACAAAA ACAATTTGAT
AATGGTTATG AAACTTCTGA ATCTGCAAAA AATCACTTGG GAGATTATGA GAAAACCTTT
GGTGAACAAT TGACTAAAGC AAATAATCTT AAAGAATTAA TAGATATGAT AAGTGTCATG
AAATCTGAAT CAGATTTGTT TGATGAATGG AAACAATTGT TTACCATTAA ACGTGATTCA
GAAATCAATG ATGAAACTCT CAAGTTGTTA AAATCTGACA ATTCTTTGAT TGGTTGGACT
GCATCAAATG TTTTACGTGA ATGCAATCTA AGTGAGGAAA ACCAAAAAGA TCTAAGAGAC
ATGTATGATT ATTCTGATGC AGATGTTCCA AATCAGAACA CCCGTAGATG GAGGATTATT
CATCCACTTG GAAAGTGCCC CTCAGAGGAA AATAAAAAAC TACTATTCAT TGCTCTAGAA
ACAGACCCCT ATCATTGGGT ACGATATGGT GCTGCTCGTG CATTGGTGGA AATGGCTGCA
ATCACTGAAG ATGCGGATAT GCGTAAAGAT ATCCTAGACA AATTAGGAAA TATGGCTTCG
AAATTAAGGA AAAACATTCT TGATGAAATT GGAAAGACAG TCTTTTACAG AAATGCATAT
GGGCAATGGG ATGAGTCTGT GATTCCCTTG CTCGAAAAGG TAAAGGAATC AAAATCAGAT
CATGTGCATG ATGAAGATTG GGAAGAACGA TTATCAAAAT ACAAAAAGGG TGAATGGAAA
AGTTGA
 
Protein sequence
MNEENMSSEI GLGLSLAEMI FKFAQKSFKW NWNEYRTEII DEFGAKYPES RDILIDVLSK 
QEFSDNFKKL SEGSVQDFLN LLKGLENAFQ DTEYKFDSEK LLTEIKQKIE MEGAETSFQK
IMTAYNQKIL NIATKLDSKM DKAIGILKEV HEQTVTKDSG DQDQQFQQGN LNLLSYLKEL
TTKLEAKPPP EHKFGITFSD ENGEKFPITE LHSIIKSKRK VILKGAAGSG KSNTVTEFLK
SIVKEKVLPV FIPINGSETE FFKELEDARD ESAESRIDKL LKMSSVEITV ERLKSFEGDV
IIVIDGINEF KSGDFGELTE KIIQDVGEYV RQTSPHTYVL LTDRMDVRGA LTGWNKITLN
PLSEEEIRKN LDITIGKEQY ELNDVSMSLL SRPFFLNIAM ERCSASMTSE SSVLNDFFKE
RLKMNEEEMK KLAKASFFAY QEGSVSFELK NFVEITGQET FDKLLGGGAA FTTSDTHATF
SHQLLHDFLV SEYLASHPEN WDVDEFDIAT LESSSLESIH LVMQQIENKE LADRFLLEVY
DWNWKAAIAS IILNSKLGIN NHSEDIVTAM LVLVAQKQFD NGYETSESAK NHLGDYEKTF
GEQLTKANNL KELIDMISVM KSESDLFDEW KQLFTIKRDS EINDETLKLL KSDNSLIGWT
ASNVLRECNL SEENQKDLRD MYDYSDADVP NQNTRRWRII HPLGKCPSEE NKKLLFIALE
TDPYHWVRYG AARALVEMAA ITEDADMRKD ILDKLGNMAS KLRKNILDEI GKTVFYRNAY
GQWDESVIPL LEKVKESKSD HVHDEDWEER LSKYKKGEWK S