Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_2972 |
Symbol | |
ID | 5835546 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 3318387 |
End bp | 3319382 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641368772 |
Product | metallophosphoesterase |
Protein accession | YP_001640432 |
Protein GI | 163852389 |
COG category | [R] General function prediction only |
COG ID | [COG1409] Predicted phosphohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.992281 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGGATA TTGACGCCGG CCCGCGTTGC CGCGATGCGG GGTCGCGTAT GTTCCGTCTT GCTCATCTCA CTGATCCCCA TGTCGGGCCG CTGCCGCGGC CGCGCCTGCG CCAGCTTCTC AGCAAGCGGG CGGCCGGCTA CGTGAACTGG CGCCGGGGCC GCAGCCGCCA CCACGACATG GACCTGCTCG GCGCCCTCAT CGCCGACCTG CACGGCCAGG GCGTCGACCA TGTGGCCTGC ACCGGTGACC TGTGCAATCT CGGCCTTCCC GACGAATGGG AGAGCGCGCG GGTGTTTCTC GAAGCGCTGG GTCCGGCGGA CCGCGTGAGC TTCGTGCCGG GCAACCACGA CGCCTATGTC CGTGGCTCGC TGGAAGGGCT GCTCGCCGCC TGCGGCGGCT GGACCGAGGC CGACGACGGG CAGATCCGCC TCTTTCCCTA TCTGCGGCGG CGCGGGCCGC TCGCCCTGGT GGGCCTGTCC TCGGCGATCC CGACCAAGCC GTTCGTCGCG AGCGGCCGGC TCGGGCCGGT GCAGATCGAG GCGGCCGAGC GCGTGCTGCG CGACCTCGCG ACAGCGCCGG ATCGGCCCTG CCGCGTGGTG ATGATCCACC ACCCGCCCCA TCCGGGCGGG GCGGCCTCGG GACGCGAATT GAAGGATGCG GCCGCCTTCG CCGCGATGAT CGGCCGGGCC GGGGCGGACC TGATCCTGCA CGGGCACAAT CATGTCGGCA CCGTGGCTCG GATCACCGGG CCCGACGGGC GTCCGGTGCC GGTCGTCGGC GCGCCCTCGG CTTCGGCGCG CACGCTGCTG ACCAACCGGC GTGCCTCCTA CTACCTCTAC ACGGTCACGC CGGGTGAGAA CGGCTTCCAG ATCGCGGTGA CCGAGCGCGG CCTCGACGAG GCCGGCGGCA TCGGCGAGCT GTCCGGCTTC GACATCGAGA CACCGCCCGC GGACCGGATC GGACTGGTCC ATCGGCGGCG CCAGCGCCAC ACCTGA
|
Protein sequence | MPDIDAGPRC RDAGSRMFRL AHLTDPHVGP LPRPRLRQLL SKRAAGYVNW RRGRSRHHDM DLLGALIADL HGQGVDHVAC TGDLCNLGLP DEWESARVFL EALGPADRVS FVPGNHDAYV RGSLEGLLAA CGGWTEADDG QIRLFPYLRR RGPLALVGLS SAIPTKPFVA SGRLGPVQIE AAERVLRDLA TAPDRPCRVV MIHHPPHPGG AASGRELKDA AAFAAMIGRA GADLILHGHN HVGTVARITG PDGRPVPVVG APSASARTLL TNRRASYYLY TVTPGENGFQ IAVTERGLDE AGGIGELSGF DIETPPADRI GLVHRRRQRH T
|
| |