Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1007 |
Symbol | |
ID | 5773093 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 881616 |
End bp | 882884 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 641316646 |
Product | hypothetical protein |
Protein accession | YP_001582341 |
Protein GI | 161528515 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGCTTAG AATATCAACT AGCTGCACTT GCAGGTCTAA TTGGTCTTTC TGGTTTTTTC AGTGGTCTTG AAGTTGCACT TGTGGGTACA AGTCAAGCCA CAATTGAGAG ACTCGTCAAA GATAATGTAA AGGGCGCAAA ATCTCTTCAG AAATTAAAGG CCAATCCTGG ATGGATGATG TCTAGTGTCA ATCTTGGAAA TAATCTTGTC AATATAGGCT CTGCATCCCT TGCTACTATA GTTGCAATTG AAATTTTTGG AGATAACGGA GTGGGAATTG CAGTTGGTAT TATGACTTTT CTTGTAATAA TATTTGGTGA AGTAACTCCA AAAACTTATT GTAATGCAAA TGCCACAAAA GTTGCTTTAC GATGCAGTAG AATTTTGTTA ACATTCAGTT ATGTTTTCTA TCCTGCAGTT TGGATTCTTG AAAAGATAAC TCGTGGAATT ATCAAAATAA CTGGAAGTGA TTATCAACCT CCTGCCCTAA CTGAGGATGA AATTAAAGGA ATTATTGCTC AGGGTCATAG AGATGAAGCT TTAGAAAAAT CTGAACGAGA TTTACTTTAC GGTGCTCTCA AATTTGATGA TACTGTGATA AGATCTGTAA TGATGCCAAG AACTAGAATG TTTAGTTTGC ATGGAGATAT GGAACTAATT ACAGCTGCAG ATAAGATTCA CAAGAGTGGT CATTCTAGAA TCCCCATATA TGGAAAGGAT CATGATGACA TACTTGGTAT TCTTCATGTA AGAGATATTC TCAAACATCT AAAAGATAAA GAACTGCAAA AAATGAAACT ACGAGAATTT GTAAGAGAAC CAATCTATGT GTCTCAGGAA AAACGAATGA GCGAACTTCT CAAACAAATG CAGGCAAAAA ATACCCATAT GGCCATAGTT GTTGATGAAT TTGGTGGCGT TGAAGGCCTT GTTACTCTAG AAGATCTTAT TGAAGAGATA GTTGGCGAAA TTCATGATGA GACTGATCTA AAGAGTCCTC ATTATCAAAA AATCAATAAT GATGTAATTC TTGCAAATGG AGAAATTGAA ATAGACGAGA TTAATGAAAT CTTCAAATCC AATCTTCCTA GAGGTGATGA TTATTCTACA TTAAATGGCT TGTTGCATGA GAAACTTCAT GATATTCCTC AAGTTGGAAA TGTCATAAAC ATTGATGCAT TAGAAATCAA GGTTGAAAAG GTTTCAAAAA ACAAACCTGT TTCCTTACGA ATTACTAAGA AAAAACCTCT TGAGGAGAAT CTAGATTGA
|
Protein sequence | MSLEYQLAAL AGLIGLSGFF SGLEVALVGT SQATIERLVK DNVKGAKSLQ KLKANPGWMM SSVNLGNNLV NIGSASLATI VAIEIFGDNG VGIAVGIMTF LVIIFGEVTP KTYCNANATK VALRCSRILL TFSYVFYPAV WILEKITRGI IKITGSDYQP PALTEDEIKG IIAQGHRDEA LEKSERDLLY GALKFDDTVI RSVMMPRTRM FSLHGDMELI TAADKIHKSG HSRIPIYGKD HDDILGILHV RDILKHLKDK ELQKMKLREF VREPIYVSQE KRMSELLKQM QAKNTHMAIV VDEFGGVEGL VTLEDLIEEI VGEIHDETDL KSPHYQKINN DVILANGEIE IDEINEIFKS NLPRGDDYST LNGLLHEKLH DIPQVGNVIN IDALEIKVEK VSKNKPVSLR ITKKKPLEEN LD
|
| |