Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0616 |
Symbol | |
ID | 5773001 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 554316 |
End bp | 557606 |
Gene Length | 3291 bp |
Protein Length | 1096 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 641316251 |
Product | hypothetical protein |
Protein accession | YP_001581950 |
Protein GI | 161528124 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCAGA CTCTGCATAT TGTATTTTTC ATATTCTTAC TTGTAGGGGT AATTGTTCCT GCATATGCTC AAACCGCAGA AAATGTTGTG ATTAACGAAG TTGATATTAA TCCTCCTGGC GATGATTCAA AATCCATTTC TGAGTGGGTT GAACTTTACA ATCCTACTGA TTCTGAAATT GATTTGAGTG GGTGGCAAAT TGCGTCAACA ACTGTTCTAA AGAAAACTAT GACAATTGGC TCTGGAACTA CTATTGAACC TGGACAATTT TTAACTTTTT CTTATCAAAG TGTTTGGTTT ACTGACATCA ATGAATCTGT TGAATTACGA GATGAAAATG GAATTGTAAT TGATAAGACT CCAATACTCT CAGATATTAA AAATGACTTT ACATCTTGGC AAAGGATCTA TGATGGTTAT GATTTGGACA ATCCTGATGA TTGGAAATTT GTAACATCTA CCTCTGGTTC TACTAATGGA AAACTAGTAC AAGAACAAAA ACAAGATGAA ATTTCTCTTT CTCTTTCTTC AGATAAATCA TTTTACTTGT TTGGAGAAAC TGCTGTAATT GAAGGAAGTG TTTCTAAAGC AGTATTTGTT GAGAAACCAT TCTTTCAACC TGAAGTAATT ACTGTAAAAA TTAGTGGTCC TAATTTTGAC AAAACATTAA CTTTGTACCC TGATCTAAAC AAAAATTTTA AAACAACTTT GGGGCTGCAA AAAGTTTTAG GAATCAATGA AGGCGATTAC ACAATAAATG CATCTTATGC AGGATCAACT GTTAATACGT CATTCTCAGT GGGATATGAA ATTACTGAAG AACAAGTACA ACAAGACAGT TTTCTCAATT TGAACACTGA CAAAACTCAA TACATTCCCG GTCAAATGGT TTCAATTACC GGTACTGCTA CCGATATTGT TGAATTTCAG GGGATGAAAT TCACTGTTAC TAATTCTGAG GGTACAATTG TGTATAATGG AAATTTGTTT CCTGTAAATG GTCAATTTAA AACCAGTATT TTCTTGTCAA CTGTAAATCC TGTATATGGT ACTTATGAAA TAATTGGTGA ATATGTTGAC AAATCTGTAA TCACAACATT TGAAGTAATA GAAGATGCAA AAGAAACTGT TCCAATATCT CTGTGGACTA ATAAGGATGT TTATGGAACT GGTGAAGTAG TAACCATAAC TGGAAGACTA AATGATGTGT GGGTTGCTTC CCTTGATTTA GAAATAGTAC AAACAAAGAA TCTTGCTTTG GGCACTGGCA GCCAACTTGG TGGTGGTAAT GTTCTAAAAA TTCTTGATGT TGTCCGAATT GACGGTGATG GTAAATTCAA GTATTCTTTT ACAATTCCTG ATGTGGATAC TCGATTAGGT GATTATAAAA TCAAAGTTTC TAAAGATATT GGCTCTGCAA AAAAGACTGT CATGGTAGTA AAAGAACCTG AAAACTTTGT CCCAATCACT GATCCACTAA TTGTTACAAC AAACAAGCTT GTTTATGATT TCACTTTGGA TAAAGAACTC GTAATCCGTG GTCAAATAAA GAACCCTGTA GATCGAACAA GTTTTGAAAC TCCTACCGTG TTAATTTCAT TTAAAGATGA AAACGGTAAA CCCCTTTCTA TTATTGGTGT TCCTGAGGGT GTTAATCAGG GAGCTGCTGG CGGCACAGGT AGTGTGACTG CAAAATATCA GTTCACTGCA ATACCTGAAT CTGGTGGAAC ATTTTCTGTA ACTGCTGACA TTAGTAGAGG TATATTTTCT GAAGGTACGT ACACTATAAC TGCACAATAT CTTGATCTTA CTTCAACAAC TTCGTTTGAT ATTGTTGATG ACTTAGCAGG CGGAGGTGTG GTATCTCTTG ATAAAGATGT TTATGGTTTA GGTGAACAGG TGGTTGTTAG TGGTATTATT CCGACAAGTG ATCGTTCTGT AACTATTTCT GTTACAAGAC CTGATGGTAC AAAAACCACA TATGGCGAAG CTGTTGATAA CCAAAGATTC TCTTGGTCTT GGACTACCCC TGTTTCAGAA CGATATCAAA CGCTGAAATC TGACGGTGAA CGTGGTGTTA CATTTTCAAA TTTCGGTATT TATAAAATCA AAGTAGCAGG TGATACATAC AGCAAAGATT TACTATTCAA AGTATCTACT GATCCTGAAA ATGATTCTTT ATCTGCCACC CCTCTTTTTA TAACTACAGA CAAATCTCTG TACCAAGCAG GAGATAAACT CAAAGTAATT GGAAATGTTA TTCCTCGTGA TCAAGGTGAT GAGGGATTAG TAGTTCCCGA CCGTGTAACT ATCAAAGTTT TGGATGGAAC ATTCCCGTAC AAGCAAATTC ATGAAGCATC TGTTTACCCA AAACAAGGTG GTGAATTTTC AAGCTTGTTT GAGTTACCTG CAACTATCTT TAGTGAGGGC ATGTATACTG TAAAAGCAAT TTATTCCACA AAGCAAGTAA CATCAACATT TAGTGTTGCT AATGATTTTA CATTTGGTAT TGATGAACCT GTATCGTTAT TAGCATCAAC TGATAAATCT GAATATTATC CTGGTGATAC CGTAATTATT TCTGGAAAAC CAAACAAATT GATTTATCTT GAAGCCTATG ATGTAAGTAT CATTAAAAAG TCTGATACTG AAATTACTTG TGGTTCTTTT ATTTGTGGAA CTCATGTTGG ACAAGTAAAA TCCATTCGTC CTAGCCCATC TGGTTCTTTT ACTCATGAAT TCCCAATCAA GAATACATTA TCTTCAATTG GAACATATGA AGTAACTATT GATGCTGACT TTGAAACAAA ACATATTAGA TTTAACGTTG TAGAAGAACC TCTTGCTCCT AAATTAGAAA CAGTGATTGA AAAAGAAAAC AGAATTCCTG ACAAAACAAT TTCTGTATCT ACTCCAGAAA AAACTGTAGA TGATGCAACC TTTGCTCCTA GAGTTGTTTC TGGCTCCTTA CTTACTCCAA TTAAAGATGA AGCCTCTAAT GTAAATCTCA AAGTATCTTC TGAGAGTGGT GTTTGTATAA TTGGTCCTGA TGCTGATTGT CTTGTTAGTG AATCCACTAG AAAACCTGGA CAAATTTATG ATGTCGTTGA AGTAGATGGA ATGAGTCTAA ATGTAAGGTA TAGTGGCCCT GATGTACGCC TAGAAAAATT CAGCATACTA CCTGAATCTT CTGAATCATT CTTGCCTGAT TCTGATTGGA ATGTAGAAAT ACTCAAAGAT GAACAAGTAT CCAGGTTCTA TTACAAAGTA ACTTACAAAA CAATAGAGTA A
|
Protein sequence | MNQTLHIVFF IFLLVGVIVP AYAQTAENVV INEVDINPPG DDSKSISEWV ELYNPTDSEI DLSGWQIAST TVLKKTMTIG SGTTIEPGQF LTFSYQSVWF TDINESVELR DENGIVIDKT PILSDIKNDF TSWQRIYDGY DLDNPDDWKF VTSTSGSTNG KLVQEQKQDE ISLSLSSDKS FYLFGETAVI EGSVSKAVFV EKPFFQPEVI TVKISGPNFD KTLTLYPDLN KNFKTTLGLQ KVLGINEGDY TINASYAGST VNTSFSVGYE ITEEQVQQDS FLNLNTDKTQ YIPGQMVSIT GTATDIVEFQ GMKFTVTNSE GTIVYNGNLF PVNGQFKTSI FLSTVNPVYG TYEIIGEYVD KSVITTFEVI EDAKETVPIS LWTNKDVYGT GEVVTITGRL NDVWVASLDL EIVQTKNLAL GTGSQLGGGN VLKILDVVRI DGDGKFKYSF TIPDVDTRLG DYKIKVSKDI GSAKKTVMVV KEPENFVPIT DPLIVTTNKL VYDFTLDKEL VIRGQIKNPV DRTSFETPTV LISFKDENGK PLSIIGVPEG VNQGAAGGTG SVTAKYQFTA IPESGGTFSV TADISRGIFS EGTYTITAQY LDLTSTTSFD IVDDLAGGGV VSLDKDVYGL GEQVVVSGII PTSDRSVTIS VTRPDGTKTT YGEAVDNQRF SWSWTTPVSE RYQTLKSDGE RGVTFSNFGI YKIKVAGDTY SKDLLFKVST DPENDSLSAT PLFITTDKSL YQAGDKLKVI GNVIPRDQGD EGLVVPDRVT IKVLDGTFPY KQIHEASVYP KQGGEFSSLF ELPATIFSEG MYTVKAIYST KQVTSTFSVA NDFTFGIDEP VSLLASTDKS EYYPGDTVII SGKPNKLIYL EAYDVSIIKK SDTEITCGSF ICGTHVGQVK SIRPSPSGSF THEFPIKNTL SSIGTYEVTI DADFETKHIR FNVVEEPLAP KLETVIEKEN RIPDKTISVS TPEKTVDDAT FAPRVVSGSL LTPIKDEASN VNLKVSSESG VCIIGPDADC LVSESTRKPG QIYDVVEVDG MSLNVRYSGP DVRLEKFSIL PESSESFLPD SDWNVEILKD EQVSRFYYKV TYKTIE
|
| |