Gene Smed_1239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1239 
Symbol 
ID5322086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1327067 
End bp1328731 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content60% 
IMG OID640790180 
Productsulfatase 
Protein accessionYP_001326924 
Protein GI150396457 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0342863 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.143694 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCGTA GAATTATCTG CGGCGTTGGA GCATTGGCTG CTTCCACCGT CCTTTGGGGC 
GCGCTTGCCC CAGTGCAAGC TCAGGAGACG AGAAAGCCCA ACATCCTGTT CATCGTGTCC
GACGATACCG GCTACGGCGA TCTCGGCCCT TACGGCGGCG GCGAGGGCCG CGGCATGCCT
ACGCCGAACA TCGACAAGCT GGCGGATGAA GGCATGACCT TCTTTTCCTT CTACGCCCAA
CCAAGTTGCA CGCCCGGCCG CGCTGCCATG CAGACCGGGC GCATTCCGAA CCGCAGCGGC
ATGACCACGG TCGCCTTCCA GGGCGAGGGC GGTGGGCTGC CGGCAGCCGA ATGGACGTTG
GCGTCTGTGC TGAAACGCGG CGGGTACCAG ACCTATTTCA CCGGCAAGTG GCATCTCGGC
GAGGCAGACT ACGCGCTGCC GATCGCACAG GGCTACGATG AAATGAAATA TGTCGGCCTC
TATCATCTCA ATGCCTACAC TTATGCGGAC GCAACCTGGT TCCCGGACAT GGACCCGGAA
CTCAGGGCCA TGTTCCAGAA AGTGACGAAA GGCTCGCTTT CCGGCAAAGC CGGCGGGGAG
GTGACGGAGG ACTTCAAAAT CAACGGACAA TACGTCGACA CTCCCGTGAT CGACGGTAAG
CCCGGTGTCG TCGGGATCCC GTTCTTCGAC GGCTATGTCG AAAAGGCGGC AATCGAATTC
CTCGATGCCG CCGCCAAAAA GCCGGATCAG CCGTTCTACA TCAACGTGAA CTTCATGAAG
GTGCACCAGC CGAATCTTCC TGCCCCGGAG TTCCAGCATA AGTCCTTGTC AAAGACCAAA
TATGCGGACT CCGTCGTGGA GCTCGACACG CGCATCGGCA GGATCCTGGA CAAGCTGCGT
GAAACCGGCA TGGACAGGAA CACCCTGGTT TTCTACACGA CCGACAACGG GGCCTGGCAG
GACGTCTATC CCGACGCCGG CTATACGCCG TTTCGCGGTA CCAAGGGCAC CGTGCGCGAG
GGCGGCAATC GCGTTCCCGC CATCGCGGTC TGGCCCGGCA AGATCAAGCC GAGTTCGAAG
AATCACGACA TCGTCGGCGG CCTTGACCTG ATGGCGACGT TCGCGGCAGC CGGCGGCGTC
CCGCTTCCTG ACAAGGACCG CGAAGACAAG CCGATCGTCT TCGACAGCTA CGACATGTCG
CCGGTTCTTC TCGGCACCGG CAAGTCGGCG CGCAAGTCCT GGTTCTACTT CACCGAGAAC
GAGTTGACGC CGGGCGCCGC CCGTGTCGGC AACTACAAGG CGGTGTTCAA CCTGCGGGGT
GACAACGGGC AATCGACCGG AGGATTGGCA GTCGATTCCA ATCTCGGCTG GAAGGGGGCG
GAGAGCTATG TGGCGACAGT CCCGCAGGTC TTTGATCTCT GGCAGGATCC GCAGGAACGC
TACGACATTT TCATGAACAA CTATACGGAG CACACCTGGA CGATGGTGAG CATCAGCAAC
GCCATCGAGG AGTTGATGAA GACCTATGTG CAGCATCCGC CGCGCAAGCT GCAGAGCGAA
TCTTACAGCG GACCGCTTAC GATCACCAGT TACCAGCGCT TTGAGTGGGC GCGCCAGCAA
CTTGGGAAGG AAGGCGTGAA CATCCCGCTA CCGACAGGCA ACTGA
 
Protein sequence
MNRRIICGVG ALAASTVLWG ALAPVQAQET RKPNILFIVS DDTGYGDLGP YGGGEGRGMP 
TPNIDKLADE GMTFFSFYAQ PSCTPGRAAM QTGRIPNRSG MTTVAFQGEG GGLPAAEWTL
ASVLKRGGYQ TYFTGKWHLG EADYALPIAQ GYDEMKYVGL YHLNAYTYAD ATWFPDMDPE
LRAMFQKVTK GSLSGKAGGE VTEDFKINGQ YVDTPVIDGK PGVVGIPFFD GYVEKAAIEF
LDAAAKKPDQ PFYINVNFMK VHQPNLPAPE FQHKSLSKTK YADSVVELDT RIGRILDKLR
ETGMDRNTLV FYTTDNGAWQ DVYPDAGYTP FRGTKGTVRE GGNRVPAIAV WPGKIKPSSK
NHDIVGGLDL MATFAAAGGV PLPDKDREDK PIVFDSYDMS PVLLGTGKSA RKSWFYFTEN
ELTPGAARVG NYKAVFNLRG DNGQSTGGLA VDSNLGWKGA ESYVATVPQV FDLWQDPQER
YDIFMNNYTE HTWTMVSISN AIEELMKTYV QHPPRKLQSE SYSGPLTITS YQRFEWARQQ
LGKEGVNIPL PTGN