Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2198 |
Symbol | |
ID | 7978371 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2257438 |
End bp | 2258250 |
Gene Length | 813 bp |
Protein Length | 270 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644799013 |
Product | histidinol phosphate phosphatase HisJ family |
Protein accession | YP_002950173 |
Protein GI | 239827549 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1387] Histidinol phosphatase and related hydrolases of the PHP family |
TIGRFAM ID | [TIGR01856] histidinol phosphate phosphatase HisJ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00182011 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAACAG ACTACCATAA CCATCTAGAA CGCGGAACGC TGACATTGGA CTATTTGCGG CAATTTACCG ATGAAGCGGC GCGGAAAGGA ATTCAACATT TCGGTATATC CGAACATGCC TATCATTTTT ATCAAACGAA AAATATTTTA TCGAATCCAT GGGTAGACGA ACGTCGCTAT TATGATATGG CAGATTATGT CCGTCTTTTC CATGAGGCGT GGGATGCCGG AATTGATGTA AAAATGTCTA TCGAAATGGA TTATACGCCA GGAAAACATG AAGAGATGGC GGCGTTTATT CGATCATATG AGTTTGATTA TGTCATCGGT TCCATTCACT GGATCGATGA TTTCGGCATT GACTTAGCGG AATATCGAAA GGAATGGGAA CGCCGGGATT TGTACGATAC ATATCGCAAA TATTTTGACC AAGTCGTTAC TTTAGCCGAG TCGAACTTAT TTGATATTAT CGGTCATATA GATCTGGTCA AAATATTTAA ATATGTTCCG GAGGATGAGG AATTTTTATT AGAGCAATAT GATCGTGCCA CAACAGCGCT TGCGAATTCC AAAACATGTG TGGAAATCAG CACTGCTGGA CTCCGCAAGC CAGTCGGTGA GCTATATCCG GATAAGCGGC TGTTGCAAAT GTGCTATGAT AAGGGAATTC CGATTGTGCT TTCGTCTGAT GCGCACGTTC CAGAGCACGT GGGGGCGGAT TACGACAAGG CGATTGCGCT GGCGAAAAGC GTCGGCTATA CGGAGCTGAT GACGTTTCAA AAAGGGGAGC GTAAAGCGGT GCCGCTTGGA TAA
|
Protein sequence | MLTDYHNHLE RGTLTLDYLR QFTDEAARKG IQHFGISEHA YHFYQTKNIL SNPWVDERRY YDMADYVRLF HEAWDAGIDV KMSIEMDYTP GKHEEMAAFI RSYEFDYVIG SIHWIDDFGI DLAEYRKEWE RRDLYDTYRK YFDQVVTLAE SNLFDIIGHI DLVKIFKYVP EDEEFLLEQY DRATTALANS KTCVEISTAG LRKPVGELYP DKRLLQMCYD KGIPIVLSSD AHVPEHVGAD YDKAIALAKS VGYTELMTFQ KGERKAVPLG
|
| |