Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_2536 |
Symbol | |
ID | 5833220 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 2846467 |
End bp | 2847936 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641368337 |
Product | sulfatase |
Protein accession | YP_001640001 |
Protein GI | 163851958 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCTCG CGACCTATTT TGTGCCCCTC GCGGTCGCGC TGACCGGTGC CTTTGCGATC GAGGCGGCGG CCTCGCCGCA GCGCCCGAGC CTCCGGCCCG GCGATCTGGC GATCCGGGCC GCCGGCTACG CGCTGATCAC GCTGTTCTGG TTCCAGTTCT CGTGGCGACC CTGGCTCGCG GCCTCCTCCT GCCTGCTGAC GCTCGCCATC CTCTCGATGG TGGACCGGCT GAAGCGCCGG GTGATCGGTG AGCCGGTGGT GTTCAGCGAC GTCGCGCTCC TCGCCCAGGT GCCGCGCCAC CCGCAGCTCT ACTACACCCT GCCGCTCACG GACCTGCGGA TCGCCGGGCC TCTGCTGCTC GCCATCGCCA CCGTCGTCGC TTGGTACGTC CTGGAGCCTG CCGCTTTGCC GAGCGGGGCG GGCGCATCCC TCCTCGCGAT CCTCGGCCTG TCGGTGGCGC TCGCGGCTCT GGCGCTCGCG GGCTTGACGC CGCTCGGGCA GCGGGCGTTG CGCCCCATCT TCCCGCATCC GGACCTCGCC CGTGATGTCG CCCGCTACGG GCTCGTCGCG ACGATGATGG GCTACGCGCT GCGCCGCCTC GGCGAAGACG ATGCGCGGCC GGTGCCAGCG CGTGGCGAAG GCAGCTCCGA CGACGAGGTC GTCGTCGTGG TCCAGCTCGA ATCCTTCCTC GACCCCGCCC GCCTCGGCGG TCCGGATCTC CCGGTGATGG CGCGGATCAG AGCGCAAGCG GCGCAGTACG GCCGCCTCGC GGTCCCCGCG CACGGCGCCT ACACCATGCG CTCCGAGCAC GCCGTCCTTA CCGGCCTCGA CCCGGAAAGC CTCGGCTTTG GCCGCTACGA TCCCTATCTC GCGCGCAAGG GCGAGGAGCC GACCAGCCTC GCCCGGCTCG CCCGCGCCGC CGGGTTCGAG ACCGTGTTCG TGCACCCCTT TCACCGCGAC TTCTTCGACC GGGCTCGGGT GTTTCAGCGC CTCGGCTTCG AGCGCCTCGT CATGGAGGAG GATTTCGCCG GCGCGCCTCG GATCGGCCCC TATATCGGCG ACGTCGCCGT CGCCGAGCGC ATTCTGGCGG AGGTAGCAAA GGGCCGGGGG AGCGCAAAAG GCCGCACGTT CGTGTTCTCC GTCACGATGG AGAATCACGG CCCCTGGAAG CCGGGCCGGC TCACCGGCAT CGACGAGCCG CTGGCGCAAT ACCTCCACCA CGTCGCCCAT ACCGGCCAAG CGATCGAACG GCTGATCGAC GGACTCGCCG GCCGACGGGC GACGCTCTGC GTGTTCGGCG ATCATGCGCC CTCGCTGCCC GACCTCCGTC CGCCGGAATC CGGCCCGATG GCGACGGACT ACGCGCTATT CCGTTTCGGC CGGGACGGCA GCCCACCCCC CGCCCGGATC GATCTCACCG CGGCACAACT CGGCTGTGTC CTGCGGGACG CCGTCACCCC GAAACGTTGA
|
Protein sequence | MSLATYFVPL AVALTGAFAI EAAASPQRPS LRPGDLAIRA AGYALITLFW FQFSWRPWLA ASSCLLTLAI LSMVDRLKRR VIGEPVVFSD VALLAQVPRH PQLYYTLPLT DLRIAGPLLL AIATVVAWYV LEPAALPSGA GASLLAILGL SVALAALALA GLTPLGQRAL RPIFPHPDLA RDVARYGLVA TMMGYALRRL GEDDARPVPA RGEGSSDDEV VVVVQLESFL DPARLGGPDL PVMARIRAQA AQYGRLAVPA HGAYTMRSEH AVLTGLDPES LGFGRYDPYL ARKGEEPTSL ARLARAAGFE TVFVHPFHRD FFDRARVFQR LGFERLVMEE DFAGAPRIGP YIGDVAVAER ILAEVAKGRG SAKGRTFVFS VTMENHGPWK PGRLTGIDEP LAQYLHHVAH TGQAIERLID GLAGRRATLC VFGDHAPSLP DLRPPESGPM ATDYALFRFG RDGSPPPARI DLTAAQLGCV LRDAVTPKR
|
| |