Gene Smed_0194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0194 
Symbol 
ID5321024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp213236 
End bp214828 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content60% 
IMG OID640789127 
Productsulfatase 
Protein accessionYP_001325888 
Protein GI150395421 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCGCTCA TTCCCTTCCA ACTGCATGAC TACCCCTTAG CCTTGACCCT TGGCTGCTAT 
CTGCTGGCCT GCGCGGTGAT CTTCCATACG GACCGTTTCG CTCTGCCGGC GCGCGAACGC
CGGAAGAACG CACCTCGATA TGGTCGCCAT CACGACAGGA TAGACCGCCT CGCCCGGCTG
CCGGTCATCG CACTCGTTTT CGCCGGCTTT TTCGCTATTT CGTGGAGGCC GCTCTATGCC
GCTGCCGGAA CGATGAGTTT CTTCATCATC TTCACCGGCA TTTCCCGCGC CAAGTACAAG
TTCATCCGCG AGCCATTGGT CTTTTCCGAC ATCGCCCTGG TCGCGGACGT ATTCAAATAC
AAGTCCATCT TCTATGCGAC CTCACTGAAC GTCGTCTTCT GGATCGTCGC CTTCCTGTAC
GTGTTTGGCG TGTCGGGGCT CTATATGTAT TTCGAGCCTG CTATCCTGCC CGAGAGAAAC
CGGCTCTTCT GGGTTCTCGT CTTGATCGGA ATTGCCGCCG GGCCCTGGGC CCTGCTGTTC
TACGGACCGG TCAACCGCCC GACAGCCGCT CTCGTGCAGA GGCTTGTGAA GGCGATCAAC
GTCAAGATCA ACACGGTGCG TTTCGGCACC TTCGCTTCCG TCGTCTTCCA CTTCATCATC
TGGCTCGGCG TCAAGCGCGA CAAGATCGTC GCCGAATTGT CGGGAATGCT GCGCGCCGCA
GTACACGACC TCATCGGTCA CGAGGAAGCC CCGCTCATCG TAGTATGGCA ATCGGAGTCC
TTCATCGACA TGCGGCACTT CGGCGTCGAT TCCATCAAGC TTCCGACGAT CGACCGGCTG
CGCAAGCAGG CGGTGCAATG GGGCCGATTG AGCAATGTCT TCGAAGGCGG ATATACGCTG
CGGACCGAGT TTGCGGTCCT CAGCGGCCTC GTTCCCGACG ATATTCACGT CGACGCAAGC
TATCCCTATC TCCGCGCCGC GCACTATGCC GACGTCGTCT GGCCGGGAAA GCTGAAGCGT
GCCGGTTGGC GCACGCATTT CATCCACCCC TACGACCGGA CATTCTTCCT GAGGCATAAG
GCAATGCCCC TTCTCGGATT CGAGAAGCTG ACCATGCTCG ATGCCTTCGA CCACAATCCG
GAGCGTGACG GACTCTATGT CTCCGACGCG ACGCTGGCGG CGCGCGTGCT GAGCGAGGTC
CAGAAGCTGC CGGAAGAGGA AAGCGGTTTC TTCTTCGTCG CATCAATGGC CAACCACGGC
CCCTGGGAGC CAGGACGTGT CGGAACGCTC ACCAACCCGG TCGACATCTA TCTGGCAATT
CTCGAGCAGT CGGACGCCGC GCTGAAGCAG TTGGTCGACG GCCTCAACAA GCTCGACCGG
CCGGTCTGGC TCGTCTTCTA TGGCGACCAT GCGCCCCTTC TGAAGTCTTT CGCGGACCCC
TTCCCGGATC CCCGCTCGGA TTATTTCATC GTGCCGCTCG CCAAGGCGCG CGCTTCGGCC
CATAGCTCGA AGCAAGCGAA AGACGAGGAT CCCTGGAACC TGCTCGGGTC CATGCTGAAG
CACGCCAATC TGCACAAGGA CGCGCTGCAA TAG
 
Protein sequence
MALIPFQLHD YPLALTLGCY LLACAVIFHT DRFALPARER RKNAPRYGRH HDRIDRLARL 
PVIALVFAGF FAISWRPLYA AAGTMSFFII FTGISRAKYK FIREPLVFSD IALVADVFKY
KSIFYATSLN VVFWIVAFLY VFGVSGLYMY FEPAILPERN RLFWVLVLIG IAAGPWALLF
YGPVNRPTAA LVQRLVKAIN VKINTVRFGT FASVVFHFII WLGVKRDKIV AELSGMLRAA
VHDLIGHEEA PLIVVWQSES FIDMRHFGVD SIKLPTIDRL RKQAVQWGRL SNVFEGGYTL
RTEFAVLSGL VPDDIHVDAS YPYLRAAHYA DVVWPGKLKR AGWRTHFIHP YDRTFFLRHK
AMPLLGFEKL TMLDAFDHNP ERDGLYVSDA TLAARVLSEV QKLPEEESGF FFVASMANHG
PWEPGRVGTL TNPVDIYLAI LEQSDAALKQ LVDGLNKLDR PVWLVFYGDH APLLKSFADP
FPDPRSDYFI VPLAKARASA HSSKQAKDED PWNLLGSMLK HANLHKDALQ