Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4559 |
Symbol | |
ID | 5835571 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 5091012 |
End bp | 5091803 |
Gene Length | 792 bp |
Protein Length | 263 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641370353 |
Product | histidinol-phosphate phosphatase |
Protein accession | YP_001641998 |
Protein GI | 163853955 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0483] Archaeal fructose-1,6-bisphosphatase and related enzymes of inositol monophosphatase family |
TIGRFAM ID | [TIGR02067] histidinol-phosphate phosphatase HisN, inositol monophosphatase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.0522743 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGTCG TCGATCTCGC CCAGTTCATG GAAGACCTCG CCACCCAGTC CGGAGCGGCA ATCCTGCCGT TCTTCCGGGC GCATTTCGGC CTGGACGACA AGTCCCACGG GACGGGCCAC GCCTTCGACC CGGTCACCGA GGCGGACCGC GCGGCGGAAG CGGTGATGCG GCGGATGATC AACGACCGGC TGCCGAACCA CGGCATCCTC GGCGAGGAAT TCGGCTCCGA GCGGGCGGAT GCGGAATGCG TCTGGGTGCT CGACCCGATC GACGGCACCC GCGCCTTCAT CAGCGGCCTG CCGACCTGGG GCACGCTGAT CGGGCTGACC CATCACGGCG CGGCGGTGCG TGGCCTGATG CACCAGCCCT ATCTCGGCGA GCGCTTCCTC GGCGACGGCA AGACCGCGAG CGTACGCTCC GCGAAGGGCG AACGCCCCCT CCACACCCGC CGCAACGAGG CGCTCGGCAA CGCCATCCTC GCCACCACCG ACCCGCGCCT GTTCGCGCAA GGGGAGGAGG CCGAGCGATT CCGGACGATC GAGGGGCAGG TGAAGATGTC CCGCTACGGC ACCGATTGCT ACGCCTATTG CATGCTCGCC GCCGGCCAGA TCGACCTCGT GGTCGAAGCG GGGCTGAAGC CCTACGACAT CGTCGCGCTG ATCCCCATCG TCGAGGGCGC GGGCGGCCTC GTCACGAGTT GGGACGGCGG CCCGGCCACC GGCGGCGGCC GAATCGTGGC CGCCGGAGAC CGCCGGCTGC ACGAGGCGGC GCTGAAGGTG CTCAACCCGT AG
|
Protein sequence | MSVVDLAQFM EDLATQSGAA ILPFFRAHFG LDDKSHGTGH AFDPVTEADR AAEAVMRRMI NDRLPNHGIL GEEFGSERAD AECVWVLDPI DGTRAFISGL PTWGTLIGLT HHGAAVRGLM HQPYLGERFL GDGKTASVRS AKGERPLHTR RNEALGNAIL ATTDPRLFAQ GEEAERFRTI EGQVKMSRYG TDCYAYCMLA AGQIDLVVEA GLKPYDIVAL IPIVEGAGGL VTSWDGGPAT GGGRIVAAGD RRLHEAALKV LNP
|
| |