Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0117 |
Symbol | |
ID | 5773852 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 105630 |
End bp | 106637 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 641315737 |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_001581455 |
Protein GI | 161527629 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00000000592797 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGCAATCAA TTTTAAATAA AAAAAATATT CTTGTTACTG GTGGAACCGG TTCTATAGGA CAAGCTTTGG TTCAAAGAGC TATATCTGAT GGCGCAAAAC ATATCAAAGT TTTCAGTAAT GATGAAAATG CCCTTTATGA AATGGAATTA GATTTTTCTA AACATAAAAA CATCGAATAC ATAATTGGCG ATATTAGAGA TTTTGATAAA ATCAATTCAA TCGTAAAAAA TTGTGATATT ATTTTCCATG CCGCTGCACT CAAACATGTA GATAGATGTG AATTATATCC ATTAGAAACA ATGACCGTAA ACATAATTGG AACAAATAAT GTTGCAAAAG CTGCAGTCAA TGCGAATGTA TCAAAAGTTA TTTCTATTAG TACTGATAAA GCTGTAAATC CTATAGGTGT GATGGGTGCA ACAAAACTTC TTGCAGAAAA ATTAATTGCC GCTGAAGCAT ATCATTCAAA ATCAAAGACA GTTTTTTCCT CTGTACGATT TGGAAATGTA TTTCATACTA GAGGTTCAAT ATTACCTAAG ATAGAAAAAC AAATTCAAAA TGGTGGTCCT TTAACATTAA CTGATGAAAG AATGAAACGA TTTTTTATGA CTAAAGAAGA TGCAGTTGAT TTAATTTTAA ACGCAGCTTA TACTGCTAAA GGTGGAGAAA CTTTTATTCT CAAAATGCCT ATGCTGAATT TAAAAGATCT TTTTGAAGCA ATGAAAATTG TAATTGGTCC AAAACATGGG TATTCATCAA CAAAAATAAA AACAAAAATT ACTGGAATTA GACCTGGTGA AAAATTAACC GAATATCTAT TAACAAATTT TGAAATGGAA CATTGTTTAG AAACAAAGAA TTTTTTTATA ATTCCTAAAA TGTTTGAATC TTTAGATCCC AAAAAATATC CTGGTTCAAA AAAACCAAAG AACACAACAA AGTATTTCGA AACTGTTAAA CCAATTTCAC AAGAACAGAT TGTCAAACTT TTAAAAAAAA TTTATTAA
|
Protein sequence | MQSILNKKNI LVTGGTGSIG QALVQRAISD GAKHIKVFSN DENALYEMEL DFSKHKNIEY IIGDIRDFDK INSIVKNCDI IFHAAALKHV DRCELYPLET MTVNIIGTNN VAKAAVNANV SKVISISTDK AVNPIGVMGA TKLLAEKLIA AEAYHSKSKT VFSSVRFGNV FHTRGSILPK IEKQIQNGGP LTLTDERMKR FFMTKEDAVD LILNAAYTAK GGETFILKMP MLNLKDLFEA MKIVIGPKHG YSSTKIKTKI TGIRPGEKLT EYLLTNFEME HCLETKNFFI IPKMFESLDP KKYPGSKKPK NTTKYFETVK PISQEQIVKL LKKIY
|
| |