Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1907 |
Symbol | |
ID | 4077404 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 2010784 |
End bp | 2011578 |
Gene Length | 795 bp |
Protein Length | 264 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638007223 |
Product | histidinol-phosphate phosphatase, putative, inositol monophosphatase |
Protein accession | YP_613902 |
Protein GI | 99081748 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0483] Archaeal fructose-1,6-bisphosphatase and related enzymes of inositol monophosphatase family |
TIGRFAM ID | [TIGR02067] histidinol-phosphate phosphatase HisN, inositol monophosphatase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACATC CTTCGACGCT GTCTGATCTG GACGTTGCCC GCCGCCTTGC CGAGGCTGCC CGTGCAGCAA TCCTTCCGCA TTTTCGCAAC AGCGCCACGG CAGCAGACAA CAAGCTGGAG GCAGGCTTCG ATCCGGTGAC GGTTGCAGAT CGCGCTGCCG AAGAAGCAAT GCGCGCGGTC CTGCGCAAGC TGCGCCCGGA CGACGGCGTG CTGGGCGAAG AGTTTGCGGC CACCACCGGC ACCAGCGGGC GGACATGGGT GCTCGATCCT ATTGATGGCA CACGTGGATT TATCAGTGGC ACCCCCACAT GGGGTGTTCT GATTGCGCTC TCGGACGCAA AAGGTCCAAT GCTGGGGATC ATTGATCAGC CCTATATCGG CGAGCGGTTC GAAGGCTGCC GCGGTCACGC GCAGGTGGTG GGGCCGCATG GCACCCGCCC GCTCCTGACA CGCAAGACCG AAGCGCTCTC CGAGGCGATT TTGTTCACCA CATTCCCCGA GGTTGGAACA GAGGCAGAGC GCACGGGCTT TGAACAGGTG GCACAGCATG TGAAACTCAC GCGCTATGGC ATGGATTGTT ACGCCTATGC GCTCTTGGCG GCGGGGACCG TGGATCTTGT GATCGAGGCG GGGCTCAATG CTTATGATAT CCAGGCACCG ATTGCCGTGA TCGAAGCCGC AGGCGGTCTG GTGACCAACT GGCAGGGCGG GGCGGCCCAT GAGGGTGGGC GCGCCTTGGC TGCGTCCAAT CCGACCATCC ACGCCGCGGC GCTGGAGCTG TTGCGCGAGG CTTGA
|
Protein sequence | MTHPSTLSDL DVARRLAEAA RAAILPHFRN SATAADNKLE AGFDPVTVAD RAAEEAMRAV LRKLRPDDGV LGEEFAATTG TSGRTWVLDP IDGTRGFISG TPTWGVLIAL SDAKGPMLGI IDQPYIGERF EGCRGHAQVV GPHGTRPLLT RKTEALSEAI LFTTFPEVGT EAERTGFEQV AQHVKLTRYG MDCYAYALLA AGTVDLVIEA GLNAYDIQAP IAVIEAAGGL VTNWQGGAAH EGGRALAASN PTIHAAALEL LREA
|
| |