Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0107 |
Symbol | |
ID | 5774470 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 97656 |
End bp | 98852 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 641315727 |
Product | peptidase M50 |
Protein accession | YP_001581445 |
Protein GI | 161527619 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0000498394 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGGAACTTG ATTTTATCAC TCAAAATTCG ATTATCTATG TGTTGATGGC ATGGGTAGTA ATTGTTATTG TTGCAAAAGG ACTAAAATTA GAGAAACATG GTTTTGAAAT AAAAGCATAC AGTCTAACTT ACAAAAACAA ACAAGTTAAC TCAGTACTGT TAAAACTTCT TAGTCGGACA AGAAGAGGAA TTAGAGTTTT TGCAGATGTA AGTGTAATTT CAGGTTTTAT AATGATGGGT TTTGCATTTT GGTTTTTGCT AAATAATGTT GCAAACTTTT TTGTTGCACA AACAGAATTT TCAGAATTAA CTGTTCTTAT TCCAGGAGTT ACATTAACTT CAGCAGCATC AATTACATAT TTTTTACTAT CAATTCCAGT TGTACTAGTA ATTCATGAGG GAGCACACGG TATTGTTGCA GCATTAGAAA AGATAAAGAT CAAAACAGGA GGATTTGCAA TATTCATTGC AATGTTTGCA GGCTTTGTAG AACCTGATGA GGAAGAATTC AACAAAGCAA AAAAGATCTC AAAACTCAGA GTTATTGGAG CAGGAGCTAC ATCAAATGTA ATTTTTGCAT TTGCTTTAGG AGTAATTTTA CTTACAAATC CATTTTTTGC AATGGTATTA CCTGAACCAC TATTAAGTAC ATTTTACGAA TTACCAGAAG GAGTCCTAAT TCTTTCAATT ATTGAAAATT CAGGAGCAGA GCAAGCAGGA TTACTTGCAA ATGACATCAT AACATCAATC AATGACAAGT CCATTCTAAG TCCGGCGGAT TTCCCCAGTT TAAATCCAGG AGAGACAGCA AGTGTCTCTG TACTTAGAGA TGGACAACCA TTGGACTTTA GTCTTGAAGT AATGCCAGCA CCAGATGATC CAGAAAGAGG ATTGATTGGA ATTATGAGAG ATAATTCATT TGCATACAAG CCTATATTGA ATTTTATTGA ATGGAATGAT CCAAACGTCT CAATGTTCCT CTTATGGTTA TGGATGATTT CATTTTTCAT TGGAATAATC AATATGCTTC CATTACCAAT TTTAGATGGA GGTAAATTCA TTCATACGAT TATTGATCAA AGAATTTCAG AAAAAGCAGT AAATGGAGTA ATGTGGGGAA TCTATGCGTT TACTTTTGCT TTGTTTGGCC TAAACATTGC CCTCTCATAT GTAAAATCTG GTTGGTTTAC AATATAA
|
Protein sequence | MELDFITQNS IIYVLMAWVV IVIVAKGLKL EKHGFEIKAY SLTYKNKQVN SVLLKLLSRT RRGIRVFADV SVISGFIMMG FAFWFLLNNV ANFFVAQTEF SELTVLIPGV TLTSAASITY FLLSIPVVLV IHEGAHGIVA ALEKIKIKTG GFAIFIAMFA GFVEPDEEEF NKAKKISKLR VIGAGATSNV IFAFALGVIL LTNPFFAMVL PEPLLSTFYE LPEGVLILSI IENSGAEQAG LLANDIITSI NDKSILSPAD FPSLNPGETA SVSVLRDGQP LDFSLEVMPA PDDPERGLIG IMRDNSFAYK PILNFIEWND PNVSMFLLWL WMISFFIGII NMLPLPILDG GKFIHTIIDQ RISEKAVNGV MWGIYAFTFA LFGLNIALSY VKSGWFTI
|
| |