Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_0533 |
Symbol | |
ID | 4485116 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | - |
Start bp | 567164 |
End bp | 567961 |
Gene Length | 798 bp |
Protein Length | 265 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639729300 |
Product | histidinol-phosphate phosphatase |
Protein accession | YP_872292 |
Protein GI | 117927741 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0483] Archaeal fructose-1,6-bisphosphatase and related enzymes of inositol monophosphatase family |
TIGRFAM ID | [TIGR02067] histidinol-phosphate phosphatase HisN, inositol monophosphatase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.389982 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGACC CAACAGCCGC GGACCTCGCG TTGGCTCTCC GGCTTGCCGA CCTCGCCGAT GAGATCAGCC TCAGCCGGTT CCAGGCAATG GATTTTCGGG TGGAGACCAA ACCGGACCTC ACCCCGGTGA GCGATGTCGA CCTGTCGGTC GAACGCGAGG TACGCCGCGT GCTGGCCGCC GATCGGCCCG GCGATGCGGT GCTCGGCGAG GAATTCGGCG GCGAGCCGGT GGACGGTCGG GTGTGGGTCA TCGATCCCAT CGACGCCACG AAGAATTTCG TCCGCGGCGT GCCGATTTGG GCGACGCTCA TTGCCCTGCT GGACGCCGGA GAACCGGTGA TCGGCGTCGT CAGCGCACCA GCCCTTGCGT CGCGGTGGTG GGCCGGCCGC GGTCTTGGCA GCTGGACGGC ACGACTGGGT GCGGCACCGC GGCGTAATCA GGTCTCTGCG GTCCGCAACC TCTCGGACGC GTCATTGTCG TACTCCGGGT TAGGCGGCTG GGGGACCCGA GTCTCTGACT TTCTCAACCT CACGAAGGCG GTCTGGCGCA CGCGTGCGTA CGGGGATTTC TTTTCGCACG TGTTGGTCGC CGAGGGCGCG GTCGATATTT CCGCCGAGCC GGAGGTGTCG CTCTGGGATA CCGCAGCACT CGTCGTCATC GTGACCGAAG CGGGTGGGCG GGTGACCGGC GTTGACGGGG GCCCGAGCCC GGCGGCGTCC AGCATCCTCT GCACCAACGC GTGGCTGCAC GACGCCGCCC TTTCTCACCT CGCGGGATCG GCGCGTCGCG GAGGCTGA
|
Protein sequence | MSDPTAADLA LALRLADLAD EISLSRFQAM DFRVETKPDL TPVSDVDLSV EREVRRVLAA DRPGDAVLGE EFGGEPVDGR VWVIDPIDAT KNFVRGVPIW ATLIALLDAG EPVIGVVSAP ALASRWWAGR GLGSWTARLG AAPRRNQVSA VRNLSDASLS YSGLGGWGTR VSDFLNLTKA VWRTRAYGDF FSHVLVAEGA VDISAEPEVS LWDTAALVVI VTEAGGRVTG VDGGPSPAAS SILCTNAWLH DAALSHLAGS ARRGG
|
| |