Gene Nmar_0336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0336 
Symbol 
ID5773772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp298821 
End bp300401 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content36% 
IMG OID641315964 
Productradical SAM domain-containing protein 
Protein accessionYP_001581670 
Protein GI161527844 
COG category[C] Energy production and conversion 
COG ID[COG1032] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCCGGCA AACGTGTTGT ACTTACTGCT GATCGTAGTT TAATGACAAA TTACAGGGGA 
AACTTTCTGT ATGGATTTAT TGCATGTGGA CCATATGAAG TTCTGCCAGA ATGGGTTTTT
GACAAAGTGT TTTGTCCATC AGTTGAAACA GATCCAATCA CGGGGGAAGC AAAGGTTGCA
CAAATTGGAT TAAGAAGAAT TGAAAGTTCA TTGATTCAAG GAGGTTACAA TAGAGAAGAT
GTATTCATTG GACACCCAGA TATGTTGCAC AAATCAATTG GTCCAGATAC CAAAGTTGTA
GGAATCAATG TGATGGATCC ATTGGGAATG GCACCAGTTA CCACAACAAT GTCACCAGAA
AAATTGTCGT ATGTAGCAAT GAAATTTAAA AAAATGTGTG CAAGTATAAT TCAGCTCAAA
AAGAAATATG ATTTCAAAGT TGTTGTTGGA GGAAACGGAG CATGGGAATT AGCAAAATCA
GATAGAATGA AGATTCATGG AATAGACACA GTAGTAGTTG GAGAGGCAGA TGAATTAGCA
GTTGATTTGT TCCAAGATTT GGAGAAAAAT GATGCACCAG AATTGATGCA CTGTTTTGTA
AGAAACCTTG AAAATATTCC AGTTATTGAA GGTCCTACAA TCAACTCATT GATTGAAGCA
ATGAGAGGTT GTGGAAGAGG TTGTGATTTT TGTGATGTAA ATAAGAGATC AAAAAAAGAT
TTACCTATAG ATAGATTACA ACACGAAGCA AAAACTAATT TAGATTACGG TTTTGACTCA
ATCTGGTTAC ATTCTGATGA AATGTTACTT TATGGATGTG ATAACAGAGA CTTTGTTCCA
AACAGAGATG CAATTACAGA TTTGTGGAAG TCACTAAAAG GACTAGGTGC AAACTTTATT
GGAACTACAC ATATGACATT TTCTGCAGTT GCTGCAGATC CTACACTAAT GCAACAAATT
TCTCATGTAA ATGGACAGGA CCAATCAGGA AGATGGCTTG CAACCAATTT AGGAATTGAA
ACAGTTGCAC CAGATATGGT AAAAAAACAC CTAGGTGTTA AAACAAGACC ATTCTCAACA
GAAGAATGGG GCAGTGTAGT TAGAGAAGGT GCAAAAATTC TTAATGAGAA CCACTGGTTC
CCAGCAGCTA CAATCATTAT TGGTTGGCCA GATGAAACAC CGGATGATAT TCAATATACA
ATTGACATGA TGAGCGACTT TAGAGAAATG GACTTTAGAG GATTAGTAGC ACCATTATTG
TATCAAGATT TTAGTGAAAA GAATTCAATG CACTTTGGAA ACTTGAATGA AGCTCAATTT
ACACTATTTT GGAAATGCTG GGAAAACAAC CTTAGAGTAA TTAATGACAT TATTCCAATT
ATTCTCAGAA ACAAGACCTA CGGTCCACCA ATGAAAGTTT TCATGTATGG AATTTTGAAG
GCAGGAACTT GGGCAATTAT GAGATATCTC AGAGGATTGT GCAAGGATCT CTTTAATGGA
AGAACTCCTG ATGAGATAAT TGACAAATAT GCTAGAAGTA GATCAGTATC TGCTCCTAAA
ATTCAAACAA AGAAATTATA G
 
Protein sequence
MSGKRVVLTA DRSLMTNYRG NFLYGFIACG PYEVLPEWVF DKVFCPSVET DPITGEAKVA 
QIGLRRIESS LIQGGYNRED VFIGHPDMLH KSIGPDTKVV GINVMDPLGM APVTTTMSPE
KLSYVAMKFK KMCASIIQLK KKYDFKVVVG GNGAWELAKS DRMKIHGIDT VVVGEADELA
VDLFQDLEKN DAPELMHCFV RNLENIPVIE GPTINSLIEA MRGCGRGCDF CDVNKRSKKD
LPIDRLQHEA KTNLDYGFDS IWLHSDEMLL YGCDNRDFVP NRDAITDLWK SLKGLGANFI
GTTHMTFSAV AADPTLMQQI SHVNGQDQSG RWLATNLGIE TVAPDMVKKH LGVKTRPFST
EEWGSVVREG AKILNENHWF PAATIIIGWP DETPDDIQYT IDMMSDFREM DFRGLVAPLL
YQDFSEKNSM HFGNLNEAQF TLFWKCWENN LRVINDIIPI ILRNKTYGPP MKVFMYGILK
AGTWAIMRYL RGLCKDLFNG RTPDEIIDKY ARSRSVSAPK IQTKKL