Gene Smed_3791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3791 
Symbol 
ID5317992 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp239958 
End bp241604 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content60% 
IMG OID640775604 
Productsulfatase 
Protein accessionYP_001312537 
Protein GI150375941 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.425915 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAG AACCGATTGC GGACCAGCGT GATCCACGCG AGCGCGCCAA TATCACGCGG 
CGCAGCATTC TGCTTGGCGG GACTGCCATT GCCGCGGCAT CGACCGTACT GACAACCGGC
GGCGCGCAGA CCGCCCAGGC GCAGACGCAA ACGCAAGCGC CAGATGCCGG CGGGTCCGCC
AAGCAGCCGA ACATCCTGGT CATCTGGGGC GATGACATCG GCACCTGGAA TATCAGCCAT
AATAATCGTG GCATGATGGG CTATAAGACG CCGAACATCG ACCGGATCGC TCAGGAAGGC
TTGTCCTTCA CGGATTATTA CGGACAGCAG AGCTGCACGG CCGGGCGCGC CGCTTTCATT
GGCGGTAATG TACCGGTGCG CACCGGCATG ACCAAGGTCG GCCTCCCCGG CGCGAAGGAA
GGCTGGCAGG AGACGGATGT CACGATGGCA ACCGTTCTCA AGAGCCAGGG CTACGCGACG
GGCCAGTTCG GCAAGAACCA CCAGGGGGAC AGGGACGAGC ACCTGCCGAC CAATCACGGC
TTCGACGAGT TTTTCGGCAA CCTGTATCAC CTCAATGCGG AGGAGGAGCC GGAGAACCGC
GACTATCCGA AGGACCCGGA ATTCCGCCGT CGATTTGGCC CGCGCGGCGT GATCCATTCC
TTCGCCGACG GCAAGATCGA GGATACCGGC GCGCTCACCA AAAAGCGGAT GGAAACGATT
GACGAGGAGT CCCTTGCTGC GGCCAAGGAT TTCATTACGC GCCAGAACCA GGCAGGCACG
CCCTTCTTCG TCTGGTGGAA CGGCACGCGC ATGCATTTCA GAACACATGT CAAGGCGGAA
CATGCCGGCA TATCGGGGCC AAGCGGCGAC GAGTATCACG ATGGTATGGT CGAACATGAC
ATGCATGTCG GCGAGTTGCT CAAGCTCCTC GACGAACTCG GTATCGCCGA AAATACTGTG
GTCATGTATT CGACCGACAA TGGACCTCAT TTCAACACAT GGCCGGATGC CGCCACGACG
CCGTTCAGGA GCGAAAAGAA CTCGAACTGG GAAGGCGCCT ATCGCGTTCC GGCTTTCGTG
CGCTGGCCGG CCCGGTTCCC GGCAGGCAAG ACTCTCAACG GTATCGTTGC TCATGAGGAT
TGGCTGCCTA CGTTTGGCGC GATTGCCGGT GCACCGGACG TCAAGGAAAA GCTCCTGGAG
GGTGTCGAAC TCAACGGCCG CCGCTATCGC AACTATATCG ACGGCTACAA TCAGCTCGAA
TATCTGGAAG GCAAGACCGA CCAGTCGCCG CGTCACGAGT TCTGGTACGT GAATGACGAC
GGGCAGGTCG TCGCCGCGCG CTACGACGAC TGGAAAGTGG TATTCCTCGA GAATCGCGGC
GAAGCCTTCG GCGTCTGGCG CGAACCCTTT ACCGAATTGC GCGTGCCGCT CTTGTTCAAC
CTGCGACGCG ACCCCTTCGA GAAGGCGCAG CACAACGCCA ACACCTATGA TGACTGGTTC
CTTGAACGGG CCTTCGTCGT GGTGCCGATC CAGGGTCTGG CGGCGAAGTT CCTGCAGACG
ATGAAGGAAT ATCCGCCAAG CCAGTCACCC GGTTCCTTCA ATCTCACCAA AATCGAAGAG
AGCCTGAAGG CCGGGATGAG AAACTAA
 
Protein sequence
MKKEPIADQR DPRERANITR RSILLGGTAI AAASTVLTTG GAQTAQAQTQ TQAPDAGGSA 
KQPNILVIWG DDIGTWNISH NNRGMMGYKT PNIDRIAQEG LSFTDYYGQQ SCTAGRAAFI
GGNVPVRTGM TKVGLPGAKE GWQETDVTMA TVLKSQGYAT GQFGKNHQGD RDEHLPTNHG
FDEFFGNLYH LNAEEEPENR DYPKDPEFRR RFGPRGVIHS FADGKIEDTG ALTKKRMETI
DEESLAAAKD FITRQNQAGT PFFVWWNGTR MHFRTHVKAE HAGISGPSGD EYHDGMVEHD
MHVGELLKLL DELGIAENTV VMYSTDNGPH FNTWPDAATT PFRSEKNSNW EGAYRVPAFV
RWPARFPAGK TLNGIVAHED WLPTFGAIAG APDVKEKLLE GVELNGRRYR NYIDGYNQLE
YLEGKTDQSP RHEFWYVNDD GQVVAARYDD WKVVFLENRG EAFGVWREPF TELRVPLLFN
LRRDPFEKAQ HNANTYDDWF LERAFVVVPI QGLAAKFLQT MKEYPPSQSP GSFNLTKIEE
SLKAGMRN