Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0554 |
Symbol | |
ID | 5772944 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 493936 |
End bp | 495054 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 641316187 |
Product | radical SAM domain-containing protein |
Protein accession | YP_001581888 |
Protein GI | 161528062 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR00423] radical SAM domain protein, CofH subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.420102 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGTCAGA CAACAGAACA ATTAGAAAAA AGTGACATTA AAGATATTTT AGAAAATTCT CTTAATGGAA AAAGACCTGG TCCTGAAGAC TGTATGAGAT TGTTGGAGTC TGATGATGTT CATCTAATGG GACTTGTATC TGGCCATTTG ACAAGAAAGC AATTTGGAAA GAAAGCATCT TTTGTCAATA ATATTATTTT GAATTACACC AATGTCTGTA TTACTGACTG TAAGTTTTGT GCATTTTACA GATCACCTGG TGCTGATGAT TCTTACACTT TAACTTTGGA ACAAATTGAA TCACGTGTAA AAACCGCATG GGACATGTTT AAGATCCGAC AGGTCTTGAT TCAAGGTGGT CATAACCCAA ATCTGAAAAT TGAATACTAT GAAGATGCAT TTAGAATGAT TAGGGAGAAA TTCCCTAAAG TTGGTGTACA TGGATTGTCA ACATCAGAAA TTGACATGAT TGCAAGAGTT GAAAAATCCT CAACAAAAGA AATTTTATCA CGACTCAAAG ACGCAGGTTT ACAATCAATG CCTGGTGCAG GAGCTGAAAT CTTGACTGAC TCTGTTAAAG AAATCATTAG TCCAAAGAAA ATCTCTAGTG ATGCTTGGAT TAGAATCATG AATGAAGCTC ATTCACTTGG AATTCCATCT TCTGCAACAA TGATGTACGG ACATGTGGAA AACAAAAATG ACATTGTTGA ACACTTTTTC AAACTTGTAA AATTACAAGA AAAAACCAAA GGATTCATGG CATTTATCCC TTGGAACTTT GAGCCAAACA ATACTTTGAT GCATGAAGAG GGATTAGTTG AATATGGTAC TGGTGGAATT CAACTCTTGA AAATGATTGC AATCTCTAGA TTAATCTTTG ATGGACTTAT ACCTCACATA CAATCCTCAT GGCTGACAAA TGGTATCGGT ATGGCACAAC TAGCTTTACA GTATGGCGCT GATGACTTTG GTGGTACTCT AATTGGAGAA GAAGTAGTTT CATGTACTGG CGCACGCTCA ACTGAACTTA CTGATAAAAT AATCATGGAT GCAATTCATC AAATTGGTTA TTCAGTTGAA GAGAGAGATA ATTTCTATAA TCCTATTTCT GTATCATAG
|
Protein sequence | MSQTTEQLEK SDIKDILENS LNGKRPGPED CMRLLESDDV HLMGLVSGHL TRKQFGKKAS FVNNIILNYT NVCITDCKFC AFYRSPGADD SYTLTLEQIE SRVKTAWDMF KIRQVLIQGG HNPNLKIEYY EDAFRMIREK FPKVGVHGLS TSEIDMIARV EKSSTKEILS RLKDAGLQSM PGAGAEILTD SVKEIISPKK ISSDAWIRIM NEAHSLGIPS SATMMYGHVE NKNDIVEHFF KLVKLQEKTK GFMAFIPWNF EPNNTLMHEE GLVEYGTGGI QLLKMIAISR LIFDGLIPHI QSSWLTNGIG MAQLALQYGA DDFGGTLIGE EVVSCTGARS TELTDKIIMD AIHQIGYSVE ERDNFYNPIS VS
|
| |