Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_3700 |
Symbol | |
ID | 5210679 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 4626519 |
End bp | 4627295 |
Gene Length | 777 bp |
Protein Length | 258 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640597294 |
Product | histidinol-phosphate phosphatase, putative |
Protein accession | YP_001278005 |
Protein GI | 148657800 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0483] Archaeal fructose-1,6-bisphosphatase and related enzymes of inositol monophosphatase family |
TIGRFAM ID | [TIGR02067] histidinol-phosphate phosphatase HisN, inositol monophosphatase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000574664 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0324911 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTCTG AAACGCTTGA CGCGCTGCGC GAATTCGCCG CCGACCTGGC GTGGCATGCC GGTCGGTTGA CGCTGCGCTA TTTTCAAACC GGTCTCACGC CGGATATCAA GGACGATCAG ACGCCGGTGA CTGTTGCCGA TCGCGAGGCG GAGCGATTGA TGCGTCGCAT GATCGAGGAT CGCTATCCAC ACCACAGTAT TCTCGGCGAG GAGGAGGGCG AAACGCGCCC CGGCGCGTCG CACCGCTGGA TCCTCGACCC GATTGACGGC ACGAAATCGT TCGTTCAGGG CGTGCCGCTT TATGGTGTCC TGGTCGGGCT GGAGCGCGAC GGCGAATCGG TGGTCGGTGC GGTATCCTTC CCCGCGCTCG GTGATTTTCT GACCGCAGCG AAAGGGCAGG GATGTCTGTG GAACGGCAGG CGTGCGCGCG TCTCACCGGT CAGCGAGTTG CGCCAGGCGA CCCTTCTCTC CAGCGACGCC GAGAGCATGG CGCCGCATGG GCGCGAATCC GCCTACCGTC GTCTGGCAGG ATCGGTGCGT CTGGTGCGCA CCTGGGGCGA TGCCTACGGC TACAGCCTGG TCGCAACCGG TCGCGCCGAG ATCATGATTG ACCCGGTGAT GAGCGTCTGG GATTGCGCGG CGCTTTTTCC GATCGTGACC GAGGCGGGTG GCACATTCAC CGACTGGAAT GGTATTCCAA CGATCCACGC TGGCGAAGCA ATCGGAACCA ATAGCGTCCT GTTGGAACAG GTGTTGCAGG CGATCCGGCA GGGTTGA
|
Protein sequence | MASETLDALR EFAADLAWHA GRLTLRYFQT GLTPDIKDDQ TPVTVADREA ERLMRRMIED RYPHHSILGE EEGETRPGAS HRWILDPIDG TKSFVQGVPL YGVLVGLERD GESVVGAVSF PALGDFLTAA KGQGCLWNGR RARVSPVSEL RQATLLSSDA ESMAPHGRES AYRRLAGSVR LVRTWGDAYG YSLVATGRAE IMIDPVMSVW DCAALFPIVT EAGGTFTDWN GIPTIHAGEA IGTNSVLLEQ VLQAIRQG
|
| |