Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0371 |
Symbol | |
ID | 5773357 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 337790 |
End bp | 339022 |
Gene Length | 1233 bp |
Protein Length | 410 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641316000 |
Product | glucose sorbosone dehydrogenase |
Protein accession | YP_001581705 |
Protein GI | 161527879 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2133] Glucose/sorbosone dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATGATA AGATAGATAA GCACAGTATG AGAAACATCA TCTTTGTTGT ATCTATAATG ATTTTACTTG GAGGTACAAG TGCATATGCA GAATCTTTTC CAGAGATTGG GGTAAAAGTT GATGTGATTG CAGACAACCT CAAAATTCCA TGGGGAATTG ACTTTGCACA AGATGGACGA ATTTTCTTTA CAGAAAGACC AGGTACAGTA AATGTGATTG AAGATGGACA AGTAAGCCAG ATCATGTCTC GAGGTGTGGG AGGAGGAGAA GGCGGAATGC TAGGAATTGC ACTAGATCCA GAATTTGAGA AAAATCACTA CGTCTATGTG TACTATACGT ATAATGAACT ACTTGGAATC AAGAACAGAT TAGTACAATA TGTTGAATCA GACAACAAAC TAAATCATGA GAAAATTTTG CTTGAAGATA TTCCTGGTGC ACCGTATCAC GATGGTGGTC GAATAAAATT CGGACCAGAT GAAATGTTGT ATGTTACAAC AGGGGATGCA GTAGAACCAG AACTTTCACA AAACTTGAAT TCAGTTGCAG GAAAAATCTT GAGAATCAAG TCAGATGGAA CAATTCCTGA AGATAATCCG TTTGGTTCAG CAATCTACTC CATTGGACAT CGTAATCCGC AAGGAATTGC ATGGGACAAG TCTGGAAATT TAATTGCAAC AGAACATGGA CCTTCTGGAT GGCGTGGAGT TGCACATGAT GAAATCAATT GGATAGTATC AGGTGCAAAC TATGGATGGC CAGATGTTAT TGGTGATGAA ACATTAGAAG GTGCAACAAA TCCAATTTTG CATTCAGGTG ATGATACTTG GGCTCCTTCA GGTTCTACAT TTTACTATGG AGACGACATG CCAATGTTTG ATGGAAAATA TTTTGTTGCA GCACTTAAAG GACAACATAT TCACGTCATA GAATTTGATG AGAGTTACAA TGTGTTATTT CACGGAGAAT TATTTTCAGG AGAGTTTGGA AGAATTAGAG ATGTTGCAAA TGGTCCGGAT GGATTATACT TTATGACAAG TAATCAAGAT GGAAGAGGCA ATCCAAATCT CTACGATGAT AAAATTTTGA GAATTTCTCC ATTGTATAAC TATGAAAACA ATTCATGGGT ACAAAACATC TCAGAATGGT ACATGAAAGG AGAAATTTCA AAGGAAGAAT CAATTAATGC TCATTCATAT CTAATTGAAA GAGGAACAAT TTCTCAAAAT TAA
|
Protein sequence | MYDKIDKHSM RNIIFVVSIM ILLGGTSAYA ESFPEIGVKV DVIADNLKIP WGIDFAQDGR IFFTERPGTV NVIEDGQVSQ IMSRGVGGGE GGMLGIALDP EFEKNHYVYV YYTYNELLGI KNRLVQYVES DNKLNHEKIL LEDIPGAPYH DGGRIKFGPD EMLYVTTGDA VEPELSQNLN SVAGKILRIK SDGTIPEDNP FGSAIYSIGH RNPQGIAWDK SGNLIATEHG PSGWRGVAHD EINWIVSGAN YGWPDVIGDE TLEGATNPIL HSGDDTWAPS GSTFYYGDDM PMFDGKYFVA ALKGQHIHVI EFDESYNVLF HGELFSGEFG RIRDVANGPD GLYFMTSNQD GRGNPNLYDD KILRISPLYN YENNSWVQNI SEWYMKGEIS KEESINAHSY LIERGTISQN
|
| |