Gene Nmar_0284 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0284 
Symbol 
ID5773304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp249075 
End bp250628 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content38% 
IMG OID641315908 
Productproton-translocating NADH-quinone oxidoreductase subunit M 
Protein accessionYP_001581618 
Protein GI161527792 
COG category[C] Energy production and conversion 
COG ID[COG1008] NADH:ubiquinone oxidoreductase subunit 4 (chain M) 
TIGRFAM ID[TIGR01972] proton-translocating NADH-quinone oxidoreductase, chain M 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value1.06102e-08 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGAATACG CATTATTACA GGCAGTTTTC TTGCCACTAC TCTTATCTCC AATAGCCTAC 
ATCTTGGGAA GAAAAGTAGG ACCAACTCCT GCCATGTGGT TCACATTTGC AATATTACTT
TACACCACAA TCCTTGTAGT TAATGCAGCA TTATCTGGAA CTGTAGAAGA ACACTATCCA
TGGACTGAAC AATTTGGTGA ATTTGGTTTC TTACTTGATG GTTTGGCATC TCCATTTGCA
ATAATCATCT ATGTGTTATC CACAATTTTG GTACTTTATT CAAAACCATA CATGATTCAC
AAATTCCATG AACAATTTGA AGAAGAACAA AAAATCAATC CTTCTTCAAG TGGACAAAGT
TCTGTTGTTG AATCCTCTGC TCTTTCTAAT TATGTAAATG CAAAATCTGG TCTTTACTTT
GCACTTTATC TTGTATTTGC AATGGGAATG CTTGGAACTG TCATGTCCAC AAACCTGATT
GAATTTTACA TATTCTTTGA GGTTATGTTG ATTCCTGGTT TCTTCTTAGT TGCTCTTTGG
GGTGATGGTC CAAGAAGAAA GATTGGTTTA ATGTTCCTCT TTTGGACTCA CGCTGGTGCA
GTTGTTTTAC TATTAGGATT CTTAATGATT GGACTGACGA TTGGTAGCTT TGATTTTGCT
GATATCAAAG AATCTGAAAT TCCTCAAGAT GTCGTAATGA TTTCTGCAAT TGCTATTGCG
ATTGGTCTTG GAGTAAAACT GGCTGTCTTT ATGTTCCATG TTTGGTTACC TTATGTCCAC
GGTTCAGCAC CTACACCAAT TAGTGCACTT TTGTCCCCTG CAATGATCGG AATTGGTGCA
TATGGTGTCT TTAGATTAAT TGTTGAATTC TTACCGTCCA CATTTGCAGA GTTGTCTATT
TGGTTCCACA TTTGGGGTCT TGTTACAATG CTTTATGGTG GTGCAATGGC CTTGATGCAA
GATGATCTAA AACGATTACT TGCTTATTCT AGTATCAGTC AGATGGGATA CCTGCTATTT
GGTATTGGCT CCATGTCTGC AATGGGTCTT GCAGGTGCTG AGATGATGTA CATCACTCAC
GGACTTGGAA AAGGTATTCT CTTCATGATG GCTGGAATTA TAATTGTCAA AGTCGGTACA
CGTAGTATCT CTAAACTTGG TGGTCTTGCT GGAAAGATGC CAATTACTGC AGTTTGTGCA
GTTATTGGTG CACTTACAAT TATGGGTGTT CCACCAACCA GTGGATTTAT GGGAGAATGG
ATTCTATTTT ACGGAGCATT AGAAACTGCT ATTGAAGAAG GTTCTACACT TAGAGCAGTA
ACATTTGGTC TTGGACTTGT TGCAACTGCA CTAACAATGG CTTACATGTT ATGGATGCTA
AAACGTGTCT TCTTTGGTAA GACACCTGAA CATCTAGAGA AAGTCAAAGA AGGAAGTTGG
TATATGACAG CACCAATGAT GGTACTAGCT GGATTTACTG TTGTACTTGG AATTTATCCA
GATATTTTCT TGAAGACAAT CTTACCTTAC ATGAACGGAG TGTTGGGAAT CTAA
 
Protein sequence
MEYALLQAVF LPLLLSPIAY ILGRKVGPTP AMWFTFAILL YTTILVVNAA LSGTVEEHYP 
WTEQFGEFGF LLDGLASPFA IIIYVLSTIL VLYSKPYMIH KFHEQFEEEQ KINPSSSGQS
SVVESSALSN YVNAKSGLYF ALYLVFAMGM LGTVMSTNLI EFYIFFEVML IPGFFLVALW
GDGPRRKIGL MFLFWTHAGA VVLLLGFLMI GLTIGSFDFA DIKESEIPQD VVMISAIAIA
IGLGVKLAVF MFHVWLPYVH GSAPTPISAL LSPAMIGIGA YGVFRLIVEF LPSTFAELSI
WFHIWGLVTM LYGGAMALMQ DDLKRLLAYS SISQMGYLLF GIGSMSAMGL AGAEMMYITH
GLGKGILFMM AGIIIVKVGT RSISKLGGLA GKMPITAVCA VIGALTIMGV PPTSGFMGEW
ILFYGALETA IEEGSTLRAV TFGLGLVATA LTMAYMLWML KRVFFGKTPE HLEKVKEGSW
YMTAPMMVLA GFTVVLGIYP DIFLKTILPY MNGVLGI