Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_1676 |
Symbol | |
ID | 8447275 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 1842043 |
End bp | 1842855 |
Gene Length | 813 bp |
Protein Length | 270 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 645040799 |
Product | histidinol-phosphate phosphatase |
Protein accession | YP_003201055 |
Protein GI | 258651899 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0483] Archaeal fructose-1,6-bisphosphatase and related enzymes of inositol monophosphatase family |
TIGRFAM ID | [TIGR02067] histidinol-phosphate phosphatase HisN, inositol monophosphatase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.00393257 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.332197 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCGTGA CTGACGATGT GGACCGCGAT CTGGCCCTGG CCCTGGACCT GGCCGATCTG GCCGACGAGA TCAGCCTGCC CCGCTTCGGG GCGGCCGACC TGCAGGTCAC CGCCAAGCCG GACCTGACCC CGGTCTCGGA CGCGGATCTG GCGGTGGAGA CCGCGCTGCG CGAGCGGATC GCGCGGGAGC GGCCCGACGA CGTCGTCGTC GGCGAGGAGT TCGGCGGAGC CGACCGCCCG GCCACCGGGC GCCGCTGGAT CATCGACCCG ATCGACGGGA CCAAGAACTT CGTCCGCGGG GTGCCGGTGT GGGCGACGCT GATCGCCCTG ATCGACCCGG CCGTGCACGC GGACCGGCCG GCGGTGGGTG TCGTGTCCGC GCCGGCGCTG GCCCGGCGCT GGTGGGCGGC CGCCGGCGGG GGTGCGCACG CCCGGTTCGC CGGCGGGCCG GCCCGGCGCT GCCGGGTGTC CGGGGTGAGC CGTCTGGCGG ACGCGTCGCT GTCGTATTCC GAGCCGGGGG AGTGGCAGGC CGCGGGCCGG CTGCGGCCCT TCCAGACCCT GGTGCAACGC TGCTGGCGGA CCCGCGCCTA CGGCGATTTC TGGTCCTACC TGCTGGTCGC CGAGGGAGCG GTGGACATCG CCGCCGAACC CGACCTGAGC CTGTGGGACG TGGCCGCGCT GATCCCGATC GTGGTCGAGG CCGGGGGTCG ATTCACGGCA ATCGACGGTC AGCCATCCGG CGGCAGCGGC GGGAGCGCCT TGGCCACGAA TGGCCTCCTG CATCCGGAGG TCATCTCGCT GCTGACCGGG TGA
|
Protein sequence | MPVTDDVDRD LALALDLADL ADEISLPRFG AADLQVTAKP DLTPVSDADL AVETALRERI ARERPDDVVV GEEFGGADRP ATGRRWIIDP IDGTKNFVRG VPVWATLIAL IDPAVHADRP AVGVVSAPAL ARRWWAAAGG GAHARFAGGP ARRCRVSGVS RLADASLSYS EPGEWQAAGR LRPFQTLVQR CWRTRAYGDF WSYLLVAEGA VDIAAEPDLS LWDVAALIPI VVEAGGRFTA IDGQPSGGSG GSALATNGLL HPEVISLLTG
|
| |