Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0977 |
Symbol | |
ID | 5773801 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 853513 |
End bp | 854892 |
Gene Length | 1380 bp |
Protein Length | 459 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 641316616 |
Product | NAD-dependent epimerase/dehydratase |
Protein accession | YP_001582311 |
Protein GI | 161528485 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0702] Predicted nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.0520232 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACAAG TGATTTCATC ATCTAAATCT TCTGATTACA AACCATTTTC AATCTTGGTT ACTGGTGCAA CGGGATTTAT TGGCTCAAGA TTAATTTCAT CTCTTGTTTC TTCAGGTTAC ACTGTAAAGG GTCTGAGTAG AAAACAATTA TCTGGTAATG ATAAAGTAAA ATATGTTAAA GCAGATGCAT TCAACTTTGA TGAATTAAAA AATGCAATGT TTGGTATTGA TACTGCATAT TATTTACTTC ACTCTATGGA AGGTGATAAA GGTGATTGGC AAGAATTTGC TACACGAGAA AATAATCAAG CTCAAAACTT TCTCAAAGCT GCCACAGAAT CTGGTGTAAA ACGAATTATT TACCTTGGTG GTTTAGTTAA TGATAGTCTT GGTCTCTCAC CTCATATGCG TAGTAGGAAA GAAGTAGGGG AAATTCTTGC ATCTGGAAAT ATTCCTGTTA CAGAATTTAG AGCTTCTATA ATTATTGGTG CAAAAAGTGG TTCTTATGCT ATGCTTCGTT ATCTTGTTGA AAGATTGAGT GTGATGGTAT GTCCTTCGTG GGTCAAGTCA TTGGCCCAAC CAATTGCAGT TGATGATGTG ATTAGTTATC TTGCTGAATC TCTATCAAAA CCTGAAACAA TGGGTAAAAT TTTTGAGATT GGAGGTCCTG ATAAAATGAC TTATGAGGAA TTAATGCGTG TGTATTCAGC ATATCTGAAC AAAAATCTGT TTGTCATTCA AATCCCATTT CTAACCACTA GACTATCTTC GTATTGGGTT GATCTTATCA CTCCTGTAAA GGCATCACTT GCAAGGCCTC TGATTGATAG CTTGGTTCAT GATACTGTTG TCGCCGACGA TTCTATAACA AAAATCATTC CTATGCATCT AAAATCCGTT CGTGAAGCAA TTGATATCGC AACAAAAGAA ATGAAATCTG ATCCTCCGCA AATGGAACAA AAAGAGGAGA AAACAGGGTT TAAAATCAAT CAAAAATTAA TTCAAGTTTC TCTATTTGCA TTAGCTATTA TTGGTTCAAG TTATTATTGG CTAGATGATA GAACTGATGT CTATGAACCT ATGTGGTTAA TTGGTTCTGC CTTTTGGTAT ATTGCTATTG GATTTGCAAT AATACTAATT CACAACAAAA CCCGTTTAGG TTATTTGATA GCAGGTGTAT TATCCTGGAT TACTTTGGTG TTCTGGTTGT TTGACAATTA CTATGTTATT TTTGAAAATT CATTGATAGC AACTACTCCA AATGAATTAA TGACAATAAG GAATTTTATT GGAATATTCA TAGTTGCATT AACTGTTATT GCATCTCATA ACTTGTTCCA CAAAGTAATT GATTATCAGT ACAAGGGTAA ACCCCTATGA
|
Protein sequence | MKQVISSSKS SDYKPFSILV TGATGFIGSR LISSLVSSGY TVKGLSRKQL SGNDKVKYVK ADAFNFDELK NAMFGIDTAY YLLHSMEGDK GDWQEFATRE NNQAQNFLKA ATESGVKRII YLGGLVNDSL GLSPHMRSRK EVGEILASGN IPVTEFRASI IIGAKSGSYA MLRYLVERLS VMVCPSWVKS LAQPIAVDDV ISYLAESLSK PETMGKIFEI GGPDKMTYEE LMRVYSAYLN KNLFVIQIPF LTTRLSSYWV DLITPVKASL ARPLIDSLVH DTVVADDSIT KIIPMHLKSV REAIDIATKE MKSDPPQMEQ KEEKTGFKIN QKLIQVSLFA LAIIGSSYYW LDDRTDVYEP MWLIGSAFWY IAIGFAIILI HNKTRLGYLI AGVLSWITLV FWLFDNYYVI FENSLIATTP NELMTIRNFI GIFIVALTVI ASHNLFHKVI DYQYKGKPL
|
| |