Gene Smed_1567 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1567 
Symbol 
ID5322425 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1662853 
End bp1664790 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content58% 
IMG OID640790511 
Productsulfatase 
Protein accessionYP_001327243 
Protein GI150396776 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.116076 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.327443 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTGG AGACGCCGTT GGCCCGTATC GATTCCGCCG TGAGCAGCTC TGACGTCTAC 
GCCAATGGGC AAGTCTCTTC CGCAAGATAC GCCAGAACCC TGTCGAGCGC CCGCAGCGCC
CTATTCACGC TTCTGCTCGC GATCTCGGTC GTCTTCACCA TCGAACTTAT CGTGCGCTGG
TCGTGGCCCG ACACCGTCGC CTATTTTACC GATCCCATGC GGCCGGCCTG GACCACGGTT
GCCGTCTTCT TCCTCGCAAT GCTCGGCGTC GATGCCCTGT TCGGCCGGGA ACACAAGGCA
GCACTGCTCG TTGCGCCGCT TGCCGTGGTA CCCGCCTTCA TCAGTCAGCA GAAGCAGGTC
TTCCTGTCCG ATCCACTCTA CCCGACCGAT TTCCTCTTTG GCCGGCAGAT CATGGAACTG
ATGCCGGTTC TCGTAAAGGA TAGGCCCTGG ACTGCCGTCG GCGTCGTGGC CGGAATCATA
ATCGCGATCG TCGTTTCCGT CCTCCTTCTG CGATTTGCCT GGCGGAACTT CCCCAAGCTG
ACCCGCCGTG AACGTATGGC GCGCATCGCA TTTGCCTTGC CGTTGCTGGT AGCGTTCTGG
AACATCATGG ACTACAACCA GTTCTCCTGG ATTCGTGACC GGCTGCGGGT CATCCCCATC
ATGTGGGACC AGACCGAGAA CTATCGCCAC AACGGCTTTG CCCTGGCTTT CGCCATCAAC
CTCCCCATGG CCAATGTAAG CGCGCCGGCT GGCTACATGG CGGATGCGAT CGAGCGGATT
CCGGTCAAGC CGCTTCCCGC CGGTACGAGC CATCGCGGCA AGCCGGACGT GATCGTGCTC
ATGAGCGAAT CCTTCTGGGA CCCCACCCGT CTTCCCAAGG TGAAGCTGAC ACCCGATCCC
ATGCCGACGA TCCGCGAACT GCAGGGCGGC AACGTATTTT CTCCGGAGTT CGGTGGCATG
ACAGCCAATG TCGAATTCGA GGCGGTGACG GGTTTTTCCA ACGCGTTCCT TCCCTATGGC
AGCATTCCCT ACCAGCAATA TATACGAAAT CCGATCCCCT CGCTTGCCAC CTTCTTCCGC
AGTGAAGGTT ACGTCTCACG CGCCATTCAT CCTTTTCAGG GATGGTTCTG GAACCGCAAT
GCCGTCTACA AAGCCTTCGG TTTCGATATG TTCAAGTCGG AGGAGAACAT GCCGCCGATG
GCCAAGCGTG GCATCTTCGC CTCTGACGAG TCGTTGACGA AGGAGATCAT CCGCCAGGCA
GACGAGCTGG AAGACCCTTT CTTCTTCTTC GCCGTAACCC TGCAGGGCCA TGGTCCCTAT
GAGGCCAACC GATACGCGAA GAACACGATC AAGGTCGAAG GCGAGCTCTC CGACGCCGAT
CGTCAGGTAC TTGCGACCTA TGCTCAAGGC GTGAAGGAAG CCGATGACAG CCTCAAGATG
CTGATGGACT GGGCGAAAGA ACGGGACCGG GAGACGATCA TCGTTCTCTT CGGCGATCAC
CTGCCGCCGC TGAACACCGT CTATTCCAGC ACCGGCTACA TGAAGGGAAT CACGGCCGAG
CGGAAGGGAC CGAAGGATCA GATGAAGGCC GAGCACGAAA CACCGCTCGT CGTCTGGTCG
AACAAGACAG GTCCGAAAAA GAAGATCGGC ACGATCAGCC CGGCCTTTCT TTCCTATCAG
ATTCTGAAGC AGGCCGGATA TGAGCACCCC TACTACACCG GTTTCCTTGG AAAGGTTTAT
GATCACTACC GCGTCCTCGA CCGTTACATG CTGATCCGCA AGAACGGCAA GGATGTCGCC
GACTGGCTCC GCCAACGGAA GATACCGGCA TCGTTGCGTG ACTACCGCTT CCTGCAGCAC
GACATGATGT TCGGCAAGCG CTACAGCACC GAGCGCTTCT TCCAGTCCCA CGCCGATCTC
TACAGCGCCG GTTTGTAA
 
Protein sequence
MKLETPLARI DSAVSSSDVY ANGQVSSARY ARTLSSARSA LFTLLLAISV VFTIELIVRW 
SWPDTVAYFT DPMRPAWTTV AVFFLAMLGV DALFGREHKA ALLVAPLAVV PAFISQQKQV
FLSDPLYPTD FLFGRQIMEL MPVLVKDRPW TAVGVVAGII IAIVVSVLLL RFAWRNFPKL
TRRERMARIA FALPLLVAFW NIMDYNQFSW IRDRLRVIPI MWDQTENYRH NGFALAFAIN
LPMANVSAPA GYMADAIERI PVKPLPAGTS HRGKPDVIVL MSESFWDPTR LPKVKLTPDP
MPTIRELQGG NVFSPEFGGM TANVEFEAVT GFSNAFLPYG SIPYQQYIRN PIPSLATFFR
SEGYVSRAIH PFQGWFWNRN AVYKAFGFDM FKSEENMPPM AKRGIFASDE SLTKEIIRQA
DELEDPFFFF AVTLQGHGPY EANRYAKNTI KVEGELSDAD RQVLATYAQG VKEADDSLKM
LMDWAKERDR ETIIVLFGDH LPPLNTVYSS TGYMKGITAE RKGPKDQMKA EHETPLVVWS
NKTGPKKKIG TISPAFLSYQ ILKQAGYEHP YYTGFLGKVY DHYRVLDRYM LIRKNGKDVA
DWLRQRKIPA SLRDYRFLQH DMMFGKRYST ERFFQSHADL YSAGL