Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0223 |
Symbol | |
ID | 5774582 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 197758 |
End bp | 198627 |
Gene Length | 870 bp |
Protein Length | 289 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 641315843 |
Product | formyl transferase domain-containing protein |
Protein accession | YP_001581557 |
Protein GI | 161527731 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0788] Formyltetrahydrofolate hydrolase |
TIGRFAM ID | [TIGR00639] phosphoribosylglycinamide formyltransferase, formyltetrahydrofolate-dependent [TIGR00655] formyltetrahydrofolate deformylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAAAAT CTTCAAAAAA CAAAGTCATG AAAAAGACAG TAGTTGGAAT AACAGTTGTT GGTAAAGATA GAGAAGGTAT TGTAGCTTCA TTTACAAATT TTGCATTCTC AAAAGGAGGA AACATTGAGA AAGTAAATCA GAATGTAATC AAGGGCCTTT TTGGAATGTA TCTAGAAGTT TCTTTTGCAA AAGCAGTTAA TGTAAAAAAA TTTGATGCAG AAATTCAAAC TTTGGCTAAA AAAGAAAAGA TGGATGTAAG TACTCATCAT GAAACAAATT CGCAAAAAAA TATTGCAGTT TTTGTGACAA AAGAACCATT ATGTCTACAA ACAATTCTTG CAAAATCAAA ATCACTAAAA GGAAAAATCT CAGTAATTAT AGGTACTGAA AAGACACTTG AATCATTAGC AAAGAAAGCA AAGATTCCAT TTGTTGCAGT TGAAGAGAAG AATCAACAAA AGGCAGAAGA AAAAATTATT CAGATTTGTA AAAAATACAA TATTGATTTG ATCTCACTTG CAAGATACAT GAGAATTCTT AGTCCTAACT TTGTTTGGAG ATATCCAAAT AGAATTATCA ACATACATCC ATCATTATTG CCAGCATTTC CTGGTGCACT AGCATATGCA CAAGCTTATG AAAGAGGTAC AAAGATTGTA GGAGTTACAT CCCATTACGT AACTGAAAAC TTGGATCAAG GACCAATAAT TTTCCAAGAT TCTTTCAAAG TAGATCCAAA TGATACTTTA GAGAAAATAA AATCAAAGGG GCAAAAATTA GAGGCAGATA CATTATTCAA AGCAATGAAA ATGCATTTAG AAAACAAACT AGATGTTCGT TGGAGAAAGG TTCACATCAA ATCAAAGTGA
|
Protein sequence | MRKSSKNKVM KKTVVGITVV GKDREGIVAS FTNFAFSKGG NIEKVNQNVI KGLFGMYLEV SFAKAVNVKK FDAEIQTLAK KEKMDVSTHH ETNSQKNIAV FVTKEPLCLQ TILAKSKSLK GKISVIIGTE KTLESLAKKA KIPFVAVEEK NQQKAEEKII QICKKYNIDL ISLARYMRIL SPNFVWRYPN RIINIHPSLL PAFPGALAYA QAYERGTKIV GVTSHYVTEN LDQGPIIFQD SFKVDPNDTL EKIKSKGQKL EADTLFKAMK MHLENKLDVR WRKVHIKSK
|
| |