Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0284 |
Symbol | |
ID | 5773304 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 249075 |
End bp | 250628 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641315908 |
Product | proton-translocating NADH-quinone oxidoreductase subunit M |
Protein accession | YP_001581618 |
Protein GI | 161527792 |
COG category | [C] Energy production and conversion |
COG ID | [COG1008] NADH:ubiquinone oxidoreductase subunit 4 (chain M) |
TIGRFAM ID | [TIGR01972] proton-translocating NADH-quinone oxidoreductase, chain M |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0000000106102 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGAATACG CATTATTACA GGCAGTTTTC TTGCCACTAC TCTTATCTCC AATAGCCTAC ATCTTGGGAA GAAAAGTAGG ACCAACTCCT GCCATGTGGT TCACATTTGC AATATTACTT TACACCACAA TCCTTGTAGT TAATGCAGCA TTATCTGGAA CTGTAGAAGA ACACTATCCA TGGACTGAAC AATTTGGTGA ATTTGGTTTC TTACTTGATG GTTTGGCATC TCCATTTGCA ATAATCATCT ATGTGTTATC CACAATTTTG GTACTTTATT CAAAACCATA CATGATTCAC AAATTCCATG AACAATTTGA AGAAGAACAA AAAATCAATC CTTCTTCAAG TGGACAAAGT TCTGTTGTTG AATCCTCTGC TCTTTCTAAT TATGTAAATG CAAAATCTGG TCTTTACTTT GCACTTTATC TTGTATTTGC AATGGGAATG CTTGGAACTG TCATGTCCAC AAACCTGATT GAATTTTACA TATTCTTTGA GGTTATGTTG ATTCCTGGTT TCTTCTTAGT TGCTCTTTGG GGTGATGGTC CAAGAAGAAA GATTGGTTTA ATGTTCCTCT TTTGGACTCA CGCTGGTGCA GTTGTTTTAC TATTAGGATT CTTAATGATT GGACTGACGA TTGGTAGCTT TGATTTTGCT GATATCAAAG AATCTGAAAT TCCTCAAGAT GTCGTAATGA TTTCTGCAAT TGCTATTGCG ATTGGTCTTG GAGTAAAACT GGCTGTCTTT ATGTTCCATG TTTGGTTACC TTATGTCCAC GGTTCAGCAC CTACACCAAT TAGTGCACTT TTGTCCCCTG CAATGATCGG AATTGGTGCA TATGGTGTCT TTAGATTAAT TGTTGAATTC TTACCGTCCA CATTTGCAGA GTTGTCTATT TGGTTCCACA TTTGGGGTCT TGTTACAATG CTTTATGGTG GTGCAATGGC CTTGATGCAA GATGATCTAA AACGATTACT TGCTTATTCT AGTATCAGTC AGATGGGATA CCTGCTATTT GGTATTGGCT CCATGTCTGC AATGGGTCTT GCAGGTGCTG AGATGATGTA CATCACTCAC GGACTTGGAA AAGGTATTCT CTTCATGATG GCTGGAATTA TAATTGTCAA AGTCGGTACA CGTAGTATCT CTAAACTTGG TGGTCTTGCT GGAAAGATGC CAATTACTGC AGTTTGTGCA GTTATTGGTG CACTTACAAT TATGGGTGTT CCACCAACCA GTGGATTTAT GGGAGAATGG ATTCTATTTT ACGGAGCATT AGAAACTGCT ATTGAAGAAG GTTCTACACT TAGAGCAGTA ACATTTGGTC TTGGACTTGT TGCAACTGCA CTAACAATGG CTTACATGTT ATGGATGCTA AAACGTGTCT TCTTTGGTAA GACACCTGAA CATCTAGAGA AAGTCAAAGA AGGAAGTTGG TATATGACAG CACCAATGAT GGTACTAGCT GGATTTACTG TTGTACTTGG AATTTATCCA GATATTTTCT TGAAGACAAT CTTACCTTAC ATGAACGGAG TGTTGGGAAT CTAA
|
Protein sequence | MEYALLQAVF LPLLLSPIAY ILGRKVGPTP AMWFTFAILL YTTILVVNAA LSGTVEEHYP WTEQFGEFGF LLDGLASPFA IIIYVLSTIL VLYSKPYMIH KFHEQFEEEQ KINPSSSGQS SVVESSALSN YVNAKSGLYF ALYLVFAMGM LGTVMSTNLI EFYIFFEVML IPGFFLVALW GDGPRRKIGL MFLFWTHAGA VVLLLGFLMI GLTIGSFDFA DIKESEIPQD VVMISAIAIA IGLGVKLAVF MFHVWLPYVH GSAPTPISAL LSPAMIGIGA YGVFRLIVEF LPSTFAELSI WFHIWGLVTM LYGGAMALMQ DDLKRLLAYS SISQMGYLLF GIGSMSAMGL AGAEMMYITH GLGKGILFMM AGIIIVKVGT RSISKLGGLA GKMPITAVCA VIGALTIMGV PPTSGFMGEW ILFYGALETA IEEGSTLRAV TFGLGLVATA LTMAYMLWML KRVFFGKTPE HLEKVKEGSW YMTAPMMVLA GFTVVLGIYP DIFLKTILPY MNGVLGI
|
| |