Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_5019 |
Symbol | |
ID | 7115017 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | - |
Start bp | 5365016 |
End bp | 5365807 |
Gene Length | 792 bp |
Protein Length | 263 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643527713 |
Product | histidinol-phosphate phosphatase |
Protein accession | YP_002423712 |
Protein GI | 218532896 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0483] Archaeal fructose-1,6-bisphosphatase and related enzymes of inositol monophosphatase family |
TIGRFAM ID | [TIGR02067] histidinol-phosphate phosphatase HisN, inositol monophosphatase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.317763 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.049005 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGTCG TCGATCTCGC CCAGTTCATG GAAGACCTCG CCACCCAGTC CGGAGCGGCG ATCCTGCCGT TCTTCCGGGC GCATTTCGGC CTGGACGACA AGTCCCACGG CACGGGCCAC GCCTTCGACC CGGTCACCGA GGCGGACCGC GCGGCGGAAG CGGTGATGCG GCGGATGATC AACGACCGGC TGCCGAACCA CGGCATCCTC GGCGAGGAAT TCGGCTCCGA GCGGGCGGAT GCGGAATGCG TCTGGGTGCT CGACCCGATC GACGGCACCC GCGCCTTCAT CAGCGGCCTG CCGACCTGGG GCACGCTGAT CGGGCTGACC CATCACGGCG CGGCGGTGCG CGGCCTGATG CACCAGCCCT ATCTCGGCGA GCGCTTCCTC GGCGACGGCA AGACCGCGAG CGTGCGCTCC GCGAAGGGCG AACGCCCCCT CCACACCCGC CGCAACGAGG CACTCGGTAA CGCCATCCTC GCCACCACCG ACCCGCGCCT GTTCGCGCAA GGGGAGGAGG CCGAGCGGTT CCGGACGATC GAGGGGCAGG TGAAGATGTC CCGCTACGGC ACCGATTGCT ACGCCTATTG CATGCTCGCC GCCGGCCAGA TCGACCTCGT GGTCGAAGCG GGGCTGAAGC CCTACGACAT CGTCGCGCTG ATCCCCATCG TCGAGGGCGC GGGCGGCCTC GTCACGAGTT GGGACGGCGG CCCGGCCACC GGCGGCGGCC GAATCGTGGC CGCCGGAGAC CGCCGGCTGC ACGAGGCGGC GCTGAAGGTG CTCAACCCGT AG
|
Protein sequence | MSVVDLAQFM EDLATQSGAA ILPFFRAHFG LDDKSHGTGH AFDPVTEADR AAEAVMRRMI NDRLPNHGIL GEEFGSERAD AECVWVLDPI DGTRAFISGL PTWGTLIGLT HHGAAVRGLM HQPYLGERFL GDGKTASVRS AKGERPLHTR RNEALGNAIL ATTDPRLFAQ GEEAERFRTI EGQVKMSRYG TDCYAYCMLA AGQIDLVVEA GLKPYDIVAL IPIVEGAGGL VTSWDGGPAT GGGRIVAAGD RRLHEAALKV LNP
|
| |