Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1608 |
Symbol | |
ID | 5772948 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 1467443 |
End bp | 1468774 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 641317261 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001582942 |
Protein GI | 161529116 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATAAAG ATCAGGTATT TGAATTAGTT AGAAAAGCAA AAAGAGCATT TCCTGAATGG AAAAAAGATT ATGAAAAACG TAGAAGTTAC ATTTACAATC TAGTTGAGCA TTTAAAGAAA AACAAAACAG AGTTGGCAAA AATTGCAACT AAGGAAATGG GAAAAGCACT AAAAGAATCA ATTGGTGAAG TTGAGAAATG TGCTTGGGCC TTAGAATTTT ATGCAGACCA TGGAGATAGT TTTCTTTCTG ACGAAGTACT AAACACAGAT GCAAGAAAGA GTTTTCTAAC ATTTGAACCA CTTGGAGTAA TTGGTTCTAT CATGCCATGG AACTTTCCAT ATTGGCAAGC TCTAAGATTT GCAGCTCCAT GTTTGATGGC AGGAAATGTC ATTGTGATGA AACCATCTAG AGTCACAATG CAATCAGGAA TTGAAATTGA AAAAGCATTT GCAGATGCAG GAATACCTGA CGGAGTATTC TCAACAGTAG TTGGCAGTGT AGATTCTGCA AATCACCTTA TTGATTCAGA AGTTAATGCT GTGACATTTA CTGGAAGTAC AAATGCAGGA GCAAAAGTAG GTGAGAGGGC TGCTATGAAT CTTAAAAAAT GTGTTTTAGA ATTGGGTGGA AGTGATCCTT TCATAGTTTT AGATGACGCT ATTATTGAAA AGGCAGCTGA TGGTGCAGCA AAAGGCAGAT TCATCAATTG TGGCCAAAGT TGTGTAGCCT CAAAAAGATT CTTTGTAGGC AAGAACATTG CGGAAGATTT CATTGAATTA TTCATCAAAA AGGCATCTGA GCTCAAAGTC GGAGACCCAA TGTCAATTGA GACAGATATT GGACCGCTAT CAAGCAAAGA TGGATTAGAG ACAATTTCTG GAATTGTAGA AGATGCAAAA GCAAAAGGTG CTGAAGTATT ACTAGGCGGA GAAGAGATGG ATGGAAATGG ATATTTCTAC AAACCAACAA TTCTCACAAA CATCACACCA GACATGAGAA TTGCAAAAGA AGAGACATTT GGACCAGTTG CACCAATAAC AATTGTTGAA AATGAAAGTG ATGCAATCAA GATGGCAAAT GACAGTGAGT TTGGATTAGG TGCAAGTATT TGGACAAAAG ATCTTGCAAA AGCAGATAAA ATGTCAAGAA GAATCGAATC AGGAATTGTT AGTGTAAACA ATGTAGTAAT TTCAGATCCA AGAATTCCAT TTGGTGGAAT AAAACACAGT GGATTTGGAA GAGAATTATC AAGATATGGA ATGTTAGAAT TTGTAAATCT AAAATCGGTT AGATTCTATG ATAACTTGAC ACATCATCAT TACGTAGAAT AA
|
Protein sequence | MDKDQVFELV RKAKRAFPEW KKDYEKRRSY IYNLVEHLKK NKTELAKIAT KEMGKALKES IGEVEKCAWA LEFYADHGDS FLSDEVLNTD ARKSFLTFEP LGVIGSIMPW NFPYWQALRF AAPCLMAGNV IVMKPSRVTM QSGIEIEKAF ADAGIPDGVF STVVGSVDSA NHLIDSEVNA VTFTGSTNAG AKVGERAAMN LKKCVLELGG SDPFIVLDDA IIEKAADGAA KGRFINCGQS CVASKRFFVG KNIAEDFIEL FIKKASELKV GDPMSIETDI GPLSSKDGLE TISGIVEDAK AKGAEVLLGG EEMDGNGYFY KPTILTNITP DMRIAKEETF GPVAPITIVE NESDAIKMAN DSEFGLGASI WTKDLAKADK MSRRIESGIV SVNNVVISDP RIPFGGIKHS GFGRELSRYG MLEFVNLKSV RFYDNLTHHH YVE
|
| |