Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0742 |
Symbol | |
ID | 5773024 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 671572 |
End bp | 672669 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 641316379 |
Product | radical SAM domain-containing protein |
Protein accession | YP_001582076 |
Protein GI | 161528250 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR00423] radical SAM domain protein, CofH subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.547138 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGGAGC AATTAGTAGG TGACTCTAAG GCATTAGATC GAGCAATTGC AGGAGAAGAT TTGTCCTACA ATGACGGTAT GGATTTGATG AATTATGATA ATTTACATCT TCTTGGTGCA GTTGCTGATG CTCGAAGAAA AGAGTTGGTT GGTGATACTG TAACTTTTGC AGCATCTTAT TACATGAATT ACACAAATGT CTGTGCTGCA AGTTGTCAAA TGTGTGCCTT TTACCGAAAA GATGGTGCTG AAGATGCATA TACGCTAACT CCTCAAGAAA TTGAACAAAG AGTTGGAATT GCAAAACAAA TGGGAGCAAC TGAAGTACAC ATTGTTGGAG GATTTCATCC AAAACTTCCA TTAGAATACT ATGAAGACAT GATGAGAATA ATCAAAAAAC ATCATCCTCA ACTAAACATT AAAGCATTAA CTGCTGCAGA AATTTTCTAT TTATCAAAAC TGACAAAAAA TTCTACTAAA GAAATTTTAT CTCGTTTAAA AGATGCTGGA CTTGATTCAA TGCCTGGTGG TGGTGCTGAA CTATTTCATC CAGAAATTAG AAAACAAATT GTTCGAGGAA AATGTACTGG ACAAGAATGG TTAGATGTAA TTGAAGAAGC ACATACCATG GGAATTCAAA GTAATGTAAC TATGCTTTAT GGACATATAG AAAAACCTGA GCACATTGTT GATCATCTCA TTAAAATTCG TGACTTGCAA AAAAAGACAA ATGGATTTAT CACTCTTATT CCTCTCAAAT TTAGCCTTGA TAATACTGAA TTAGAACAAG AACATTTAGT AAATAATGAA TGTTCTTCTG TGTATGACTT GAGAGTGATT GCATTGTCTC GTTTAATGCT TGCAAATTAT TTGAACAACA TTTCTGTTTA TTGGGTAGCA TATGGTAAGA AACTTGCTCA AGTAGCATTA TCTAATGGTG GTAGTGATCT TGTTGGTACT GCCTTTTCTG AAGAAATTTA TCGCGCTGCA GGAAAAGCAA CTAACTCTTC AGTAGATGAA CTCGCAACTA TGGTAAAAGA GATAGGACGT GTTCCTGCAC AACGAAATAC TCATTTTGGA ATTTTAAAGA ATTTTTAA
|
Protein sequence | MLEQLVGDSK ALDRAIAGED LSYNDGMDLM NYDNLHLLGA VADARRKELV GDTVTFAASY YMNYTNVCAA SCQMCAFYRK DGAEDAYTLT PQEIEQRVGI AKQMGATEVH IVGGFHPKLP LEYYEDMMRI IKKHHPQLNI KALTAAEIFY LSKLTKNSTK EILSRLKDAG LDSMPGGGAE LFHPEIRKQI VRGKCTGQEW LDVIEEAHTM GIQSNVTMLY GHIEKPEHIV DHLIKIRDLQ KKTNGFITLI PLKFSLDNTE LEQEHLVNNE CSSVYDLRVI ALSRLMLANY LNNISVYWVA YGKKLAQVAL SNGGSDLVGT AFSEEIYRAA GKATNSSVDE LATMVKEIGR VPAQRNTHFG ILKNF
|
| |