Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0724 |
Symbol | |
ID | 4810342 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 879241 |
End bp | 880038 |
Gene Length | 798 bp |
Protein Length | 265 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640106141 |
Product | HisJ family histidinol phosphate phosphatase |
Protein accession | YP_001037152 |
Protein GI | 125973242 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1387] Histidinol phosphatase and related hydrolases of the PHP family |
TIGRFAM ID | [TIGR01856] histidinol phosphate phosphatase HisJ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATGATT GTCATGTTCA CAGCAGCTTT TCAGGAGACA GCAACATGGA CCCGAAAATT GCCATTAATA CTGCCATGAA ATTGGGATTT GAAGGTATTT CGTTTACAGA TCATTTGGAT ATAGACTATC CGGATTATGA TGATGTATTC ATGATTGATT TTGACAAATA TTCTGAGGCC ATGGACAGGC TTAAGCGGGA TTATTCCGGC AAAATCAAAG TTCTCAAAGG TATTGAAGTA GGTATACAAC CCCATGTTAT AGAGGAATCA AATGATATTG TAAAAAAATA TGATTTTGAT ATTGTAATTG GCTCGATTCA CGTTGTTGAC AAAACTGACC TTCATAACGG CGATTTTTGC AGGAACAAAT CCAAAAATGA AGCCTATTTG AGATATCTTG AAGCAGTTTT GGAATGCATA AACCTTTTTG ATAATTTTGA TATTTTGGGC CATATTGATC TCATCAGAAG GTATGGATGT TATGATGACA AGACGTTAAA ATTGGACGAT TTCAAAGACA GAGTTGATGC CATACTCAAG GCTCTCGTTG AAAAAGGCAA AGGTTTGGAG GTAAATACCT CCGGCTTTCG ATACAATTTA GATTCGCCAA TGCCGGACTA TGAAATTATA AAAAGGTACA GAGAGCTAAA GGGTGAGATA ATTTGTACAA GCTCTGACGC CCATACCCCC GAGTATATTG GTTATAAATT CGACTATGTG AAAGAACTTG TTAAAAATGC GGGTTTTAAA TATACGGCGC ATTTTGAAAA CAGAAAACCT GTGTTTACAC CAATCTAA
|
Protein sequence | MYDCHVHSSF SGDSNMDPKI AINTAMKLGF EGISFTDHLD IDYPDYDDVF MIDFDKYSEA MDRLKRDYSG KIKVLKGIEV GIQPHVIEES NDIVKKYDFD IVIGSIHVVD KTDLHNGDFC RNKSKNEAYL RYLEAVLECI NLFDNFDILG HIDLIRRYGC YDDKTLKLDD FKDRVDAILK ALVEKGKGLE VNTSGFRYNL DSPMPDYEII KRYRELKGEI ICTSSDAHTP EYIGYKFDYV KELVKNAGFK YTAHFENRKP VFTPI
|
| |