Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0532 |
Symbol | |
ID | 5773391 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 473435 |
End bp | 474580 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 641316165 |
Product | pseudouridylate synthase-like protein |
Protein accession | YP_001581866 |
Protein GI | 161528040 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1258] Predicted pseudouridylate synthase |
TIGRFAM ID | [TIGR01213] conserved hypothetical protein TIGR01213 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTACTT ATCAGAAAAT TATTCCCATT GCAAATCAGA TATTAAAAAA ATATGATTTA TGTGATCATT GCCTTGGAAG ACTTTTCACA AAACAGCTTT ATCTTTCATC AAATAAACTC CTTGGGAAAA AATTAAAGAA AAATTCAAAA TCTTCTCAAA GATGTTATAT CTGTAAAAAT CTATTCGATA ATTTGAATTA TTTTTTGAAT ATGATGATTG ATTCTGCATC TCATTACTCT TATTCGTCAT TTAGTGTTGG CGCAACAATA AAGCCTTCAA TAATTGACAG AGATGATGTA ATTCGTTCTA AATACAAACT AAAAGGAATT GATGGGATAA AGACTGATGT AACAAAAGAA CTGGGAAAAT TATTTTCTAA AAAAACAAAG AAATCCTTTG ATTCTTTAGA TCCTGAGATT GTTTTTACTG TAAATCTAAA AGATGAATTT TGTGACATTA GATCCAAATC ATTAACTCTT TCAGGCAGAT ATGTTAAACC CGTACGAGGT TTCTTACAAA AACAAAAATC TTGTTCAAAT TGTTCTGGTA AGGGGTGTCG AATTTGTGAT TTTCATGGCA TCAAAGAATT TGATAGTGTT GAAGGAGAAA TTTCTCAATT TCTTTTTAAA AAATTAGGTG GCACCACTGC TAAATTCACC TGGATTGGTG GTGAGGATAA ATCAAGCTTG ATTCTAGGCA CAGGAAGACC GTTTTTTGTC AAGATCCAGA ACCCTCACAA AAGAAAATTG AGAGCAAAAT CTGCAAATCT AGAACACATC AAAGTTAGTA ATTTCAAAAT TGTTGCAGAC TCTCCAAAAA AACCATTAAA ATTTAATTCC TCAGTTGAAG CTAGAATTTC TACATCTTCA ATCATTGACG CAAAACTTTT ACGAAAATTA AAAAATCTCA CAAAAAAACC AATTGCAGTT TATGAAAAGT CTGGAAAACG ATCTGAGAAA AGAATACTTT CCATAAAATA TAAGAAATCT GATGAAACTT CCTTTACATT ATTTTTCAAA TTTGAAGGTG GTTTACCTGT AAAACGTTTT GTTACTGGTG ATGATGTTTC TCCTGGAATA AGCCAAATTC TTGATATGTC GTGTAAATGT CTTGAATTTG ATTTTCATGA TGTTGAAGTT AAATGA
|
Protein sequence | MTTYQKIIPI ANQILKKYDL CDHCLGRLFT KQLYLSSNKL LGKKLKKNSK SSQRCYICKN LFDNLNYFLN MMIDSASHYS YSSFSVGATI KPSIIDRDDV IRSKYKLKGI DGIKTDVTKE LGKLFSKKTK KSFDSLDPEI VFTVNLKDEF CDIRSKSLTL SGRYVKPVRG FLQKQKSCSN CSGKGCRICD FHGIKEFDSV EGEISQFLFK KLGGTTAKFT WIGGEDKSSL ILGTGRPFFV KIQNPHKRKL RAKSANLEHI KVSNFKIVAD SPKKPLKFNS SVEARISTSS IIDAKLLRKL KNLTKKPIAV YEKSGKRSEK RILSIKYKKS DETSFTLFFK FEGGLPVKRF VTGDDVSPGI SQILDMSCKC LEFDFHDVEV K
|
| |