Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0121 |
Symbol | |
ID | 5773501 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 110737 |
End bp | 111624 |
Gene Length | 888 bp |
Protein Length | 295 aa |
Translation table | 11 |
GC content | 22% |
IMG OID | 641315741 |
Product | NAD-dependent epimerase/dehydratase |
Protein accession | YP_001581459 |
Protein GI | 161527633 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000000000196986 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAAAAC AAATTAAAAT TTTGGTTACT GGTGCAAGTG GAATGTTAGG CAATAAAATC ATATCTGAAT TAAGTAATAG TGATTATCAA TCTTTAGGCA TTTCAAAAAA AAATACACAT ACAATAAACA ACACAATAAT AAAAAAATGT GACATAACAA ATTACAAACA ATTAAAAAAA ATTTTTGACG CATTCAAACC AAATATTATA ATTCATACAG CAAGTATAAC AGGCAACATT GAATGTGAAG AAAATCCAGA AAAAACTTTT TTGGTAAATT GTTTAGGAAC ATTTAATATT TTAAATCTAA TGAAAAAAAA TGGTGCCAAA ATAATTTTTT GTTCTTCAAG AGAAGTTTAT GGAAATTCAA AAAAGAAAGT TACTGAAAAA GATCTAGAGT TTCCAATTAA TCTAAATGGA ATTACAAAAA TTACTAGTGA AAATTTAATA AAAAAATTTC ATCAAACATA TAATGTTCAA TATGTAATCC TAAGATTTAC AAATTTTTAT GGAGATCTTA ATTCTAAAAG AGGAATATCA TTAATGATAA AAAATGCTAT AAAAAATAAA CAAGTTACAA TTTATGGGGG GAAGCAAATT CTTAATTTAT TACATATAGA TGACGCAGTT AAGGCAATAT TATTATCAAT TAAATATAAA AATTCGAATA CATTCAATAT TGGTTCAGAT GAAAAAACAA CATTACCAAA ATTGATTAAA ATTATTGAAA ATAACATTAA TCAAAAAATT AAAATCAACA AAAAAAATGC AAGAGTAATA GAACCACAAA AATTTGTAAT TAATATTAAA AAAGCAAAAA ATGAGCTTGG ATTTACTCCA AATTTTACTC TTGATTTAGG AATTAAAAAA CTGGTAAAAG AAATCTAA
|
Protein sequence | MKKQIKILVT GASGMLGNKI ISELSNSDYQ SLGISKKNTH TINNTIIKKC DITNYKQLKK IFDAFKPNII IHTASITGNI ECEENPEKTF LVNCLGTFNI LNLMKKNGAK IIFCSSREVY GNSKKKVTEK DLEFPINLNG ITKITSENLI KKFHQTYNVQ YVILRFTNFY GDLNSKRGIS LMIKNAIKNK QVTIYGGKQI LNLLHIDDAV KAILLSIKYK NSNTFNIGSD EKTTLPKLIK IIENNINQKI KINKKNARVI EPQKFVINIK KAKNELGFTP NFTLDLGIKK LVKEI
|
| |