Gene Mext_2536 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2536 
Symbol 
ID5833220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2846467 
End bp2847936 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content71% 
IMG OID641368337 
Productsulfatase 
Protein accessionYP_001640001 
Protein GI163851958 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTCG CGACCTATTT TGTGCCCCTC GCGGTCGCGC TGACCGGTGC CTTTGCGATC 
GAGGCGGCGG CCTCGCCGCA GCGCCCGAGC CTCCGGCCCG GCGATCTGGC GATCCGGGCC
GCCGGCTACG CGCTGATCAC GCTGTTCTGG TTCCAGTTCT CGTGGCGACC CTGGCTCGCG
GCCTCCTCCT GCCTGCTGAC GCTCGCCATC CTCTCGATGG TGGACCGGCT GAAGCGCCGG
GTGATCGGTG AGCCGGTGGT GTTCAGCGAC GTCGCGCTCC TCGCCCAGGT GCCGCGCCAC
CCGCAGCTCT ACTACACCCT GCCGCTCACG GACCTGCGGA TCGCCGGGCC TCTGCTGCTC
GCCATCGCCA CCGTCGTCGC TTGGTACGTC CTGGAGCCTG CCGCTTTGCC GAGCGGGGCG
GGCGCATCCC TCCTCGCGAT CCTCGGCCTG TCGGTGGCGC TCGCGGCTCT GGCGCTCGCG
GGCTTGACGC CGCTCGGGCA GCGGGCGTTG CGCCCCATCT TCCCGCATCC GGACCTCGCC
CGTGATGTCG CCCGCTACGG GCTCGTCGCG ACGATGATGG GCTACGCGCT GCGCCGCCTC
GGCGAAGACG ATGCGCGGCC GGTGCCAGCG CGTGGCGAAG GCAGCTCCGA CGACGAGGTC
GTCGTCGTGG TCCAGCTCGA ATCCTTCCTC GACCCCGCCC GCCTCGGCGG TCCGGATCTC
CCGGTGATGG CGCGGATCAG AGCGCAAGCG GCGCAGTACG GCCGCCTCGC GGTCCCCGCG
CACGGCGCCT ACACCATGCG CTCCGAGCAC GCCGTCCTTA CCGGCCTCGA CCCGGAAAGC
CTCGGCTTTG GCCGCTACGA TCCCTATCTC GCGCGCAAGG GCGAGGAGCC GACCAGCCTC
GCCCGGCTCG CCCGCGCCGC CGGGTTCGAG ACCGTGTTCG TGCACCCCTT TCACCGCGAC
TTCTTCGACC GGGCTCGGGT GTTTCAGCGC CTCGGCTTCG AGCGCCTCGT CATGGAGGAG
GATTTCGCCG GCGCGCCTCG GATCGGCCCC TATATCGGCG ACGTCGCCGT CGCCGAGCGC
ATTCTGGCGG AGGTAGCAAA GGGCCGGGGG AGCGCAAAAG GCCGCACGTT CGTGTTCTCC
GTCACGATGG AGAATCACGG CCCCTGGAAG CCGGGCCGGC TCACCGGCAT CGACGAGCCG
CTGGCGCAAT ACCTCCACCA CGTCGCCCAT ACCGGCCAAG CGATCGAACG GCTGATCGAC
GGACTCGCCG GCCGACGGGC GACGCTCTGC GTGTTCGGCG ATCATGCGCC CTCGCTGCCC
GACCTCCGTC CGCCGGAATC CGGCCCGATG GCGACGGACT ACGCGCTATT CCGTTTCGGC
CGGGACGGCA GCCCACCCCC CGCCCGGATC GATCTCACCG CGGCACAACT CGGCTGTGTC
CTGCGGGACG CCGTCACCCC GAAACGTTGA
 
Protein sequence
MSLATYFVPL AVALTGAFAI EAAASPQRPS LRPGDLAIRA AGYALITLFW FQFSWRPWLA 
ASSCLLTLAI LSMVDRLKRR VIGEPVVFSD VALLAQVPRH PQLYYTLPLT DLRIAGPLLL
AIATVVAWYV LEPAALPSGA GASLLAILGL SVALAALALA GLTPLGQRAL RPIFPHPDLA
RDVARYGLVA TMMGYALRRL GEDDARPVPA RGEGSSDDEV VVVVQLESFL DPARLGGPDL
PVMARIRAQA AQYGRLAVPA HGAYTMRSEH AVLTGLDPES LGFGRYDPYL ARKGEEPTSL
ARLARAAGFE TVFVHPFHRD FFDRARVFQR LGFERLVMEE DFAGAPRIGP YIGDVAVAER
ILAEVAKGRG SAKGRTFVFS VTMENHGPWK PGRLTGIDEP LAQYLHHVAH TGQAIERLID
GLAGRRATLC VFGDHAPSLP DLRPPESGPM ATDYALFRFG RDGSPPPARI DLTAAQLGCV
LRDAVTPKR