Gene Smed_1130 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1130 
Symbol 
ID5321976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1197841 
End bp1199454 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content58% 
IMG OID640790071 
Productsulfatase 
Protein accessionYP_001326816 
Protein GI150396349 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.216257 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAGC AGCCAAATAT TCTGTTCATA ATGTCGGACG ACCATGCCGC CCGGGCGATA 
TCGGCTTACG GCTCGGGCCT GAACAGCACG CCCAACATCG ACCGCATCGC CAATGAGGGG
ATGCGGCTCG ATCGGTGTTA TGTCACCAAT TCGATCTGCA CGCCCAGCCG CGCCGCTATT
TTGACCGGTA CCTATAACCA TGTGAACATG GTCACCACGC TCGACACCCA TATCGATAAT
CGGCTCCCAA ACGTCGCCAA GCACTTGCGC GCCGGTGGTT ATCAGACGGC GATCTTCGGC
AAGTGGCATC TGGGCGAGGG AAAGGCGCAC GAGCCTTCGG GTTTCGATGA ATGGTCCGTG
GTTCCCGGCC AGGGTGAGTA TTTCGATCCG GTGATGATCG ACCCGAGCGG CTCCCGCATG
GAGAAGGGCT ACGCCACCGA CATCATCACC GACAAATGCC TCGATTTTCT CAGCAGGCGC
GATATCGGAA GACCCTTCTT CCTGATGTGC CATCACAAGG CCCCACATCG GAGCTTCGAG
CCGCATCCAC GCTACAAACA ACTTTATGCA GACGGGAATC TGCCGGTTCC GGAAACATTT
TCCGACGACT ATTCCAACCG CGCCGCCGCT GCCGCCGCCG CGAAAATGCG CGTCCGCTCC
GACATGACGT ATAAGGACCT CGGTCTCGTC CAGCCCGAAG GCGGCGAGGA GACTGGCGAA
CTGCTCCTGC CGGGCTGGAC GCAGCGCAAG GTCCCCGATA TCGCGGAAGG TGGATCGTTG
CGTCTAATCG ACGGGGCGAC AGGTGAGAAC TACCTCTTCA CCGATCCGCA GAAGCTGGCT
CTTTTCAAAT ATCAGCGCTA CATGATGCGT TATCTGCAGA CCATCGCCGC CGTGGACGAC
AATGTCGGCC GATTGCTCGA CTATCTCGAC GCGGAAGGAC TGCGCGGCGA CACCATCGTC
ATCTACACGT CCGACCAGGG CTTCTTTCTC GGCGAACACG GCTGGTTCGA CAAGCGCTTC
ATGTACGAAG AATCAATGCA GATGCCTTTC CTGATCCGCT ATCCGCAAGG CATCGAGGCC
GGCGTGCAGG CCAGCCACAT CGCGACCAAT GTCGACTTTG CGCCGACCTT TCTCGATTAT
GCCGGCTTGC AGATCCCCAG TTACATGCAG GGACGGAGCA TGCGCCCGAT TTTCGACCGA
ACGGCAGATG ACGGCGACAC GGGCCTCGCC TATCACAGAT ATTGGATGCA CAAGGACGAG
TTCCACAACG CGTTCGCGCA TTACGGCGTT CGTGACGCGC GCTATAAGCT CATCTATTGG
TATAACGATC CGCTCGGGCA ACTCGGCGCC TTTTCCGGTG TTGAACCTCC GGAGTGGGAG
CTGTTCGATT GTGAGAAGGA CCCATTCGAA CTTCATAACC GCGCGAACGA CCCGGCCTAT
TCGGAAATTT TCGAAGAATT GCTTGCCAAG CTCGATGCGC GCATGGCCGA GATCGGCGAT
ACTCCGGAGC ATAGGAGCGC TGAGGTGCTG GCTGGTCTGC GAAGCAGGGG TCTGAGCCTC
GACATGCAAT CGGATCAAGC GAACGACAGG CGGTGGGCAA TAGCTGCCCC TTAA
 
Protein sequence
MTKQPNILFI MSDDHAARAI SAYGSGLNST PNIDRIANEG MRLDRCYVTN SICTPSRAAI 
LTGTYNHVNM VTTLDTHIDN RLPNVAKHLR AGGYQTAIFG KWHLGEGKAH EPSGFDEWSV
VPGQGEYFDP VMIDPSGSRM EKGYATDIIT DKCLDFLSRR DIGRPFFLMC HHKAPHRSFE
PHPRYKQLYA DGNLPVPETF SDDYSNRAAA AAAAKMRVRS DMTYKDLGLV QPEGGEETGE
LLLPGWTQRK VPDIAEGGSL RLIDGATGEN YLFTDPQKLA LFKYQRYMMR YLQTIAAVDD
NVGRLLDYLD AEGLRGDTIV IYTSDQGFFL GEHGWFDKRF MYEESMQMPF LIRYPQGIEA
GVQASHIATN VDFAPTFLDY AGLQIPSYMQ GRSMRPIFDR TADDGDTGLA YHRYWMHKDE
FHNAFAHYGV RDARYKLIYW YNDPLGQLGA FSGVEPPEWE LFDCEKDPFE LHNRANDPAY
SEIFEELLAK LDARMAEIGD TPEHRSAEVL AGLRSRGLSL DMQSDQANDR RWAIAAP