Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0351 |
Symbol | |
ID | 5773254 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 318160 |
End bp | 319821 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 641315979 |
Product | TPR repeat-containing protein |
Protein accession | YP_001581685 |
Protein GI | 161527859 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGATTAA TCTTAGGAAT CACCCTAGTG TTAACAGCTT TTACACCACT TGCATTCAAT GATGTTTTTG CTGATTCATT TAATGTGAGT TTTAATCAGG ATAGTTATCA AACTGGTGAC ACACTAGTAA TTTCTGGTCA AATAATTGAT TTTGGAATGC CTGTCATTGC TATGAGTATA TTTGATCCTG ATGAAAAAAT CCTCACTGCA AATAGTTTAG AACTTTCTTC TGATGGTAGT TTCTCAAAAA CCATTCCTTT AGAGTCTCCA TTTTATGAAA TGTCTGGAGA TTATCTCATA AAATTAGACT ATGGACAAGT CACTGAAGAA CATAACTTTG TAATTGGTGA CATTATTTCT GAATCTGAAA TTGCTCCTCA GGAGGTTGTT GCACTTGAAA TTATGTCTCT CTATACTGAA AAAGAGCATT ACTTTGATGG TGATACAGTC AACATTTTTG GAACTGTGTC ATCCATAGAT TCTCCTACTA TTCTCATAGG AGTTCATGAT CCATTTGGAA CTCCTGCTGG ATTTTATTTT GGAAATATAA ACAATAACTT GGAGTTTTCC ACAAACTTTC TAATCAAAGA TGGAGTGAAT TTTAAAGCTG AAGGAACTTA CTCGATTAAA GCTCATTATG CTGAAACTAG TGTAGAATAC TTCTTTGAAT ATTCTGCAGA ATCATCTACA GAAACACAAA CAACTGAAGA CACAACACAA ACAACTGAAG ACACAACACA AACAACTGAA GACACAACAC AAACAACTGA AGACACAACA CAAACAACTG AAGACACAAC ACAAACAACT GAAGACACAA CACAAACAAC TGAAGACACA ACACAAACAA CTGAAGACAC AACACAAACA ACTGAAGACA ATATACCTAG AGAATCTGTT GAAGAAAAAA TAGTTTCACA AAATGAAGCA ATAACTGAAA CAATAAATGA AAAAACAGAA ACTGAAACAA AAATTATTAC AAAAGAAACT CAAACAAAAA ATGATTCTGA AAATATTTTA GAAAATGAAG AACATGATAA TCTTTCTGTA GAAGATATTG AGCTTGGAAA ATTATTAAAT CAAATTAATC TCGATTGTGA TTCTAGCACT TTTGTTGATA CTATTTCTTA CTATGATGGT ATGGGTCCTG CACTTTACAG ATTATGTCAA TTTGATAGTT CTTTGAATTT TTTTAATGAG TCTTTGACTA GTGATCCTGA TAATGTTGAA ATTCTTGTCA ATAAAGGGTC TACACTAGGA AAAATTGGAT TTGTTTCTGA AGCAATAGCA TATTATGATC ATGCTATCTC TCTTGATCCC CAATACCTGC CTGCAAAAAA TAACAAAGCA AATGCCTTGG CTAATCTTGG AAATCTTGAT GATGCCATTT TACTCTACAA TGAAATCTTA GAGGAAAATA CGAATTACTA TACTGCAAGA ACAAATCTCA ATACTGCATT ATTACTAAAA TCTGAAACTT CTGAAACTTC TGATGTGATA AGTACTGTTA AATCTGAAAT TGAAACTTCA CCTGAAAATA CTTCACCTGA AAACATTTTC CCTGAAAAAA CAATCTCAAC AGAAAACAAA AATGAAACAC AATCAAACTT TTTTGAAGAA TTAACTCGTG TATTTTCATC TTTGTTTGGT TTTACTGAGT GA
|
Protein sequence | MRLILGITLV LTAFTPLAFN DVFADSFNVS FNQDSYQTGD TLVISGQIID FGMPVIAMSI FDPDEKILTA NSLELSSDGS FSKTIPLESP FYEMSGDYLI KLDYGQVTEE HNFVIGDIIS ESEIAPQEVV ALEIMSLYTE KEHYFDGDTV NIFGTVSSID SPTILIGVHD PFGTPAGFYF GNINNNLEFS TNFLIKDGVN FKAEGTYSIK AHYAETSVEY FFEYSAESST ETQTTEDTTQ TTEDTTQTTE DTTQTTEDTT QTTEDTTQTT EDTTQTTEDT TQTTEDTTQT TEDNIPRESV EEKIVSQNEA ITETINEKTE TETKIITKET QTKNDSENIL ENEEHDNLSV EDIELGKLLN QINLDCDSST FVDTISYYDG MGPALYRLCQ FDSSLNFFNE SLTSDPDNVE ILVNKGSTLG KIGFVSEAIA YYDHAISLDP QYLPAKNNKA NALANLGNLD DAILLYNEIL EENTNYYTAR TNLNTALLLK SETSETSDVI STVKSEIETS PENTSPENIF PEKTISTENK NETQSNFFEE LTRVFSSLFG FTE
|
| |