Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0280 |
Symbol | |
ID | 5773320 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 246489 |
End bp | 247787 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641315904 |
Product | NADH dehydrogenase |
Protein accession | YP_001581614 |
Protein GI | 161527788 |
COG category | [C] Energy production and conversion |
COG ID | [COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0000247954 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCAGCTC TGGCACCAAA ATTCAAACTA AGTGAGTTTA TCAAATCTCT TCTTGATAAT GCATTTTGGG TAGTTCTGCT AGTATCGTTA ATTGGATTAC CTGCAATTCA AGTAGCATTC TTCTACATTG AAATGCCTGT AATCAATGGC GAATTACTTA CACCATTCCT TGCAATGACT TGGTTGGCAG ATCCTACACG TAGTTTACCA ATTCTCAAAG CATTCATGGC AACTGATATT TTTAGAGTAA TGGCATTTCC AGGATTTGGA TTTGCAGCAC TCATAGCTGC TGCAACAATT TTTGTTGAAA GAAAAATGCT TGCTAAATTA CAACTTAGAG TTGGTCCCTT CTATTGTGGA AAGATTGAAG GTATTTTACA ATTAATGGGT GATGGTCTAA AACTAATTTC AAAGGAGATT ATTATTCCAG CAAAGGCTGA CAAGCCAATA TTTTGGGCAG CTCCTGTAAT CTTTGTTGCA ACCGCTGCAG CATTTGTTGC GTTAATCCCT GTAGCTCCTG GTTGGGTAGT TGCAGATGTA GACTTGGGAT TGCTAGGTGT TTTTGCAGTA ATTGGTTTCT TCCCAATCAT AACAATTCTT TCAGCATGGT CTGCTAACAG TAAATTCCCA TTCATTGGCG GTATTAGAGC ACTATTCCAG ATGGTCTCAT TTGAAATTCC ACTGATATTG TCCTTGTTGG GAGTTGTAAT TCTTACAGGT ACTCTTAACT TATCTGAAAT TGCAGCTAGC CAATCAAACT TCCCATGGAT TATATTTTTG CCAGTTGGCG CAATTGTATT CTTTATCACA ATGCTTGCAG AACTAGAAAG AATTCCATTT GATTTGCCAG AAGCAGAAAG TGAAATTGTT GCCGGTTGGT TAACTGAGTT CTCTGGAATG ATGTATGGTC TAGTTCAATT AGGAACCTAT CTGAAACTTT ATGCATTTGC AGGATTGTTT GTTGTTTTGT TCTTAGGTGG CTGGAACGGT CCAATGGTTG TTCCTCCATT CCCAGAAGAA TTCCTTACTG GAGTTGAAAT GGGTCCACTT ACTGTAGGTC CATTCCCTGG ATTGCCATTG TTTGATCAAG AAATGCTAAA TGGAACATTA TGGTTTGTTC TCAAAACTGT TGGAGTTATC TTCTTTATTC TCTTACCAAG AGGTGTCTTC CCAAGAATTA GAATTGATAT GTTGTTGAGT CTTGGTTGGA CCAAACTAAT TGGACTTGCT TTCGTTAACA TCTTTATTGC ACTAGGCTTG CTTTACGCTG GAGTCTTGGG ACCAGGAGGA TTACAATAA
|
Protein sequence | MSALAPKFKL SEFIKSLLDN AFWVVLLVSL IGLPAIQVAF FYIEMPVING ELLTPFLAMT WLADPTRSLP ILKAFMATDI FRVMAFPGFG FAALIAAATI FVERKMLAKL QLRVGPFYCG KIEGILQLMG DGLKLISKEI IIPAKADKPI FWAAPVIFVA TAAAFVALIP VAPGWVVADV DLGLLGVFAV IGFFPIITIL SAWSANSKFP FIGGIRALFQ MVSFEIPLIL SLLGVVILTG TLNLSEIAAS QSNFPWIIFL PVGAIVFFIT MLAELERIPF DLPEAESEIV AGWLTEFSGM MYGLVQLGTY LKLYAFAGLF VVLFLGGWNG PMVVPPFPEE FLTGVEMGPL TVGPFPGLPL FDQEMLNGTL WFVLKTVGVI FFILLPRGVF PRIRIDMLLS LGWTKLIGLA FVNIFIALGL LYAGVLGPGG LQ
|
| |