Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1328 |
Symbol | |
ID | 5774745 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 1218185 |
End bp | 1219645 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641316973 |
Product | hypothetical protein |
Protein accession | YP_001582662 |
Protein GI | 161528836 |
COG category | [R] General function prediction only |
COG ID | [COG2373] Large extracellular alpha-helical protein |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.00000298022 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATAGCA TTAGATTGTT TTCAATATCT AGTGTAACAG CAATTTTTTT GTTGCTTTCT TTGTCCGTGT TTGTTCCCGT GTTAGGAGAA ATTGCACCGC CAAACAAACA GATGAAATTA GGAGTACTTG CCGAGGATGT AATTTGCAAG GAAGGATTAC AGCTAATGAT TAGAAGCAGT GGAAATGCCA TATGTGTAAA AGATTCTTCT ACAACTCGCC TAGCTGACTC TGGTTCTGCA ATAGTTGTCC CAAACAATCC AATTCCTCTC ATAGAAAAAC AAACAGAGGA ATCTGAAAAT ACAGTGCAAG ATGAAATTAC TCAAAAGGCT GAAAAAACCA TAGTAATTGA GCCTACAAAC GTAGGTTACT TTCCTGGCGA TACAATTGAT TTTTCAGGCA AGGCAAGAGG TAACCAGAAT CTTGAGATAA CCTTGATTGA TCCTAACGAA AAAGAGGTTT TCACGGAGGT TTTTGAATTG GACTCTTCTG GTAATGTGAG CTTCCAAATA GTAACTGAGA CTTTTTTTGT AGAGGGACGT TACTTTCTGA TTGCAGAACA AAGTGATGAC TCTGAAATAA CACCTGTAAC TATTGGACAT AATTCTAATG ACATTCAATT GGTAGTGGAT GATTTTTACA ACAAATTAGA TTCCGAATTA TTAATTGAGA TGTATAGTGA TCCTCTGTCC ACAGTAGAGT TGGCAGTGTA TGATTATGCA GAACATGAAA AATTTAGATC TGACATCACC GTCAATTCAA ATGGCTATGC GGAAACTACC ATAGATCTGA GCGGATACAA GTCAGGAGTT TACTCTGCAG TGCTAACACA TGGAACCGAA GATGTTGATG TAGATTTTGC AGTGGGTCTA AAAACAGGTT CCACCTTACC AATAACACTC AGTGTCACCA AAGAACACTA TGTTCCTAAT GAACGGGTTC TAATTTTTGG AACAGCAAGT AGTAATTCTG CAGTTACATT AGATGTGATA AATCCTGATG GTGATCTTGT TCACGAGATT GAATCATACT CTGATGCATC TGGAAAAATT TCAACCATGT TTAATCTTCC TACAAGTGCA GCTACAGGAG AATGGAAGAT AGCAACTACA GGAAGATCCA TAGTTAACGA AGCCACTTTT CATGTAACCG GAAATGATCC CCCATTGTCT TTTGAGATAG ACAAGGTAGA GCCATACGAG ACAGGAGATG TTTTAACATT GGTGGGAAGT GGCGTAGACA CCGTATCTAA AATTGCAATT AGCATCACAT CAGGTGAAGT AATAGAAAAA TTCGAGGTGT TTGCTACAGA AGACGGGGAT TTCTCTTTAA GTTGGAGCAT TCCTGAGGAT TTAGAGTCAG GAACACATAC CATTACTGTT GATGACAACA TTACAACAGT TACCAAAAAC TTTGAGGTCA TTAACCCTCT CTATGAGAAA AGTTATTTTA CATCTCCATG A
|
Protein sequence | MNSIRLFSIS SVTAIFLLLS LSVFVPVLGE IAPPNKQMKL GVLAEDVICK EGLQLMIRSS GNAICVKDSS TTRLADSGSA IVVPNNPIPL IEKQTEESEN TVQDEITQKA EKTIVIEPTN VGYFPGDTID FSGKARGNQN LEITLIDPNE KEVFTEVFEL DSSGNVSFQI VTETFFVEGR YFLIAEQSDD SEITPVTIGH NSNDIQLVVD DFYNKLDSEL LIEMYSDPLS TVELAVYDYA EHEKFRSDIT VNSNGYAETT IDLSGYKSGV YSAVLTHGTE DVDVDFAVGL KTGSTLPITL SVTKEHYVPN ERVLIFGTAS SNSAVTLDVI NPDGDLVHEI ESYSDASGKI STMFNLPTSA ATGEWKIATT GRSIVNEATF HVTGNDPPLS FEIDKVEPYE TGDVLTLVGS GVDTVSKIAI SITSGEVIEK FEVFATEDGD FSLSWSIPED LESGTHTITV DDNITTVTKN FEVINPLYEK SYFTSP
|
| |