Gene Smed_2065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2065 
Symbol 
ID5322924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2116263 
End bp2117891 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content60% 
IMG OID640791002 
Productsulfatase 
Protein accessionYP_001327733 
Protein GI150397266 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0577602 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACCAA CCGAGAGAGC CTTGATGAGC AAGCGCCCCA ATATCGTCCT CGTACTGGCT 
GATGACATGG GTTTTTCCGA CCTCGGCTGC TATGGCGGAG AAATCTCGAC GCCCAATCTG
GACAGCCTCG CGCGTCGCGG CGCGCGCTTC ACGCAGTTCT ATAACACGGC GCGATGCAGC
CCATCGCGTG CGTCGCTTCT GACCGGGCTT CATCCTCACC AGACCGGCAT CGGTATTCTG
ACCAACAACG ACTTGCCGCG AGGCTATCCG GGTAACCTGA ACCTGCGATG CGCCACGCTG
GCGGAAATGC TGAAAGCTGC CGGATATGCG ACATGCCTCT CGGGGAAATG GCACCTGGCG
AGCGAAATGC ACGAACCGAA CGATACCTGG CCGACGAGAC GCGGTTTCGA CCGGTTCTTC
GGCACGCTCA CCGGCTGCGG CAGCTTCTAT ACGCCCGGAA CGCTGACCCG CGGCGAATGC
GACGCCTCGG CCGAAGCACT CGACCCGGCA TTCTTCTATA CCGACGCCAT CGCCTCTCAT
GCCGCGGAAT TCGTCACCGA ACAGTCCGCG GCAGGCAATC CGTTCTTTCT CTACGCCGCC
TTCACCGCTC CCCATTGGCC GCTTCATGCA CATCCCGGCG ATATCGACCG TTATCGGGGG
CGCTTTGACG AAGGCTGGGA CGTTCTGCGC GAAAAGCGGA TGAAGCGGCT GGTCGAGGAA
GGAATTCTTA CGGCGAGCAC CGCAATCAGC GCGCGCGATC CCACGCAGCC CGCCTGGTCT
GATACGAAGG AAAAGGCTTG GCAAGTCAAA CGAATGCAGG CCTATGCTGC CCAAATCGAG
CGAATGGACC GCGGGATCGG CAAAATCATC GAGGCACTTA AGACCGGCGG CACCTTCGAA
AACACGGCTT TCATCTTCCT GTCCGACAAT GGAGCATCGC CGGAGGATCT GCCGCAATTC
GACGCCGAAA AATTCATGCG GCGAACGGAC ATTCTTCCAC GGGCGACGCG CGATGGACTA
CCGATGCGTG TCGGCAATAC TCCCGATATC TGCCCTGGTG CCGAAGACAC ATATTCCAGC
TATGGCCGTG CCTGGGCAAA CCTGTCCAAT ACGCCCTTCC GCTTCTACAA ACGGTGGGTG
CATGAAGGCG GCATCGCCAC GCCATTGATT GTCCATTGGC CCGCAGGAGG ACTCGATTGC
GGCGCGATCC TCGATCAACC CGCCCAGCTC GTCGATATCG CCCCAACCAT TCTGGAAGTG
ACCGGTGCAA GCTATCCGCT TCAGGCTATC GGCCGGGAAA TCGCTCCGCT GGAAGGCTGC
AGCCTGCTTC CTGCATTGAA GGGCGAAATA CTCTTCGAGA GGCCGCTCTA CTGGGAACAC
ACGGGTAATG CCGCAATCCG CCTCGGACGA TGGAAGCTTG TTCGCGAAGA GCCAAATGGC
TGGGAACTTT ACGACCTTGC AGCCGATCGC ACAGAGCTGA ACGACGTGGC GCCGGGCAAC
CCTGAGGTCG TCGCGGACCT CCGGGCAAAA TGGGAAGCCT GGGCAGAGCG CATCGGCGTC
ATCCCCTGGG AGGTAACGCT CGGCATTTAC GAGGAACGCG GTCTGCATCC GACCTGGGCA
GCCGGCTGA
 
Protein sequence
MQPTERALMS KRPNIVLVLA DDMGFSDLGC YGGEISTPNL DSLARRGARF TQFYNTARCS 
PSRASLLTGL HPHQTGIGIL TNNDLPRGYP GNLNLRCATL AEMLKAAGYA TCLSGKWHLA
SEMHEPNDTW PTRRGFDRFF GTLTGCGSFY TPGTLTRGEC DASAEALDPA FFYTDAIASH
AAEFVTEQSA AGNPFFLYAA FTAPHWPLHA HPGDIDRYRG RFDEGWDVLR EKRMKRLVEE
GILTASTAIS ARDPTQPAWS DTKEKAWQVK RMQAYAAQIE RMDRGIGKII EALKTGGTFE
NTAFIFLSDN GASPEDLPQF DAEKFMRRTD ILPRATRDGL PMRVGNTPDI CPGAEDTYSS
YGRAWANLSN TPFRFYKRWV HEGGIATPLI VHWPAGGLDC GAILDQPAQL VDIAPTILEV
TGASYPLQAI GREIAPLEGC SLLPALKGEI LFERPLYWEH TGNAAIRLGR WKLVREEPNG
WELYDLAADR TELNDVAPGN PEVVADLRAK WEAWAERIGV IPWEVTLGIY EERGLHPTWA
AG