Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1553 |
Symbol | |
ID | 5774221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 1421678 |
End bp | 1422388 |
Gene Length | 711 bp |
Protein Length | 236 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 641317205 |
Product | 1-(5-phosphoribosyl)-5-((5- phosphoribosylamino)methylideneamino)imidazole-4- carboxamideisomerase |
Protein accession | YP_001582887 |
Protein GI | 161529061 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0106] Phosphoribosylformimino-5-aminoimidazole carboxamide ribonucleotide (ProFAR) isomerase |
TIGRFAM ID | [TIGR00007] phosphoribosylformimino-5-aminoimidazole carboxamide ribotide isomerase [TIGR00734] hisA/hisF family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATAA TTCCTGCAAT TGACTTGATG GACGGACAAG TAGTCAGACT ATACAAAGGA GATCCTAAAC AGAAAACTGT ATACAGTGAT GATCCTGTAT CTGTTGCCAA AAAATGGCAA AAAGCAGGTG CGGATATGTT GCATATTGTT GATTTGGATG CTACTATTGG GACTGGCAGT AATTTGGATT TGATTGAAAA AATTTCTAAA GAACTATCAA TTCCTGTTGA GGTAGCTGGC GGTTTACGTA ATGAAGAAAT TATTGATAGA GCAATTTCTT TTTCAAATAG AGTTGTAATT GGTACTATGG CATTCAAAGA TAAAGAAATG TTACAACGAA TTGCCAAAAA ATATGATTTT TCAAAAATTG TAATTTCAGT TGATCATATA GATGGATTCA TTGTTACTCA TGGTTGGCAA GAGAGTACGA AGACTCCTTT GCTAGATGCA ATAAATGAAT TTGTGAGTAT GGGATTTACT GAATTCTTAC TCACAAATGT AAGTAAAGAT GGAACATTGG AAGGACCAGA CTTGGAATAT TTAGAAAAAG CATGTGCTGT ACAAAATGCA AATGTTATTG CAAGTGGTGG AATTTCAAAT ATTGATGATG TGTCTGATGT ACAAAGAAAA AATGCATTTG CTGTAATTCT AGGAAAAGCA TTGTATGAAA ATAAAATTTC CATTGAGGAG GCAAAACAAC TTGTTAACTA A
|
Protein sequence | MKIIPAIDLM DGQVVRLYKG DPKQKTVYSD DPVSVAKKWQ KAGADMLHIV DLDATIGTGS NLDLIEKISK ELSIPVEVAG GLRNEEIIDR AISFSNRVVI GTMAFKDKEM LQRIAKKYDF SKIVISVDHI DGFIVTHGWQ ESTKTPLLDA INEFVSMGFT EFLLTNVSKD GTLEGPDLEY LEKACAVQNA NVIASGGISN IDDVSDVQRK NAFAVILGKA LYENKISIEE AKQLVN
|
| |