Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1561 |
Symbol | |
ID | 5774697 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 1429128 |
End bp | 1430219 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 641317213 |
Product | glucose sorbosone dehydrogenase |
Protein accession | YP_001582895 |
Protein GI | 161529069 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2133] Glucose/sorbosone dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATAAAA GAATCTCAAT TGTAGCTGTT GTTGTTGCAA TTGCTTTTTC TGTATACACC TTAACACTTC CATCAGATCC TATTCCATTA CCAGAACCTA ACTTTAATTC TAAAAATGAC TCCTTTGATA TTTTAGCAGA GAATCTAGAA AAGCCTCGTT CAATTGCAGT ATCTGATAAT CGAATTTTTG TAACAGAAAA AGATGGTTCT ATTGTAGTAA TTGAAAACGA TATTCAACTA GAATCCCCAC TTGCTACTTT TCGTTCTGCT AATGTTTTTG ATGGTGGATT GTTAGGAATT GATTTACATC CAAATTTTGC AGAGAACCAT TACCTCTATG TATTTTTGAC TTATGAAGAA GATGAAGCAT TGTGGAATAA AATACTACGA ATAACCGAAT CTGAAAATAA ATTACAAGAT GCAGAAACTA TTTTTGATAA AATTCCAGGG TCTTCTTTTA CAAATGGTGG TTTTATCAAA TTTGGACCTG ATGGAAAGTT GTATGTAGGT ACAGGTGCTA CATCTGATTC ATCTCATTTA CCGCAAGATC TTGATTCACT TTCAGGTAAA ATTTTACGAA TAAACGATGA TGGTTCAATT CCTGACGATA ATCCTTTTTC AAATTCTCCT GTATACTCTT TAGGACACAG GAATCCTCAA GGAATGACTT GGGATAATAA TGGTAACTTG TATGTATCAG AATTTGGACC TGAAAAAAAT GATGAAATCA ATATAATTTT GGCAGGTAAG AATTATGGTT GGCCAGAACA AGAATGCTCT GGCAATGAAA GTTTTGAAAA TGCTGTTCTT TGTTATGATC CAAGCATAGA GCCTGGAGGA ATCTTGTACT ATACTGGTGA CAAATTCGAT TTTGAATTCC CTTTCATTAT GGCTTCAATG AGGGCATCAA ATGTCTATCA AGTAGATTTT GATGAGGGAT TGAGTTCTCA AAAATCTATT CTTAGTGGAA TTGGACGTGT TCGTGACGTG GTTCAAGGTC CTGATGGATA TCTCTATGTG ATTACTTCTA ACACTGATGG AAAAGGTTTT CCAGCTGCTA ATGATGATAA ATTATTGAGG ATATTGAAAT AA
|
Protein sequence | MDKRISIVAV VVAIAFSVYT LTLPSDPIPL PEPNFNSKND SFDILAENLE KPRSIAVSDN RIFVTEKDGS IVVIENDIQL ESPLATFRSA NVFDGGLLGI DLHPNFAENH YLYVFLTYEE DEALWNKILR ITESENKLQD AETIFDKIPG SSFTNGGFIK FGPDGKLYVG TGATSDSSHL PQDLDSLSGK ILRINDDGSI PDDNPFSNSP VYSLGHRNPQ GMTWDNNGNL YVSEFGPEKN DEINIILAGK NYGWPEQECS GNESFENAVL CYDPSIEPGG ILYYTGDKFD FEFPFIMASM RASNVYQVDF DEGLSSQKSI LSGIGRVRDV VQGPDGYLYV ITSNTDGKGF PAANDDKLLR ILK
|
| |