Gene Smed_0564 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0564 
Symbol 
ID5321400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp612127 
End bp613665 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content62% 
IMG OID640789500 
Productsulfatase 
Protein accessionYP_001326255 
Protein GI150395788 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID[TIGR03417] choline-sulfatase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.449143 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCACCG GGAAGCCAAA CATTCTGATC ATCATGGTGG ACCAGTTGAA CGGAAAGCTC 
TTTCCGGACG GGCCTGCCGA CTTCCTGCAT GCGCCCAACC TGAAGGCGCT GGCCAAGCGA
TCGGCACGCT TTCGCAACAA CTACACCTCG TCGCCCCTGT GTGCCCCCGC CCGCGCGTCC
TTCATGGCGG GCCAGTTGCC GAGCCGCACG CGGGTTTACG ACAATGCGGC CGAGTACCAG
TCCTCGATCC CGACCTACGC GCATCACCTG CGCCGGGCCG GGTACTACAC GGCGCTTTCC
GGCAAGATGC ACTTCGTCGG CCCGGACCAG TTGCATGGTT TCGAGGAGCG GCTGACGACC
GATATCTATC CGGCCGATTT CGGCTGGACA CCGGACTACC GGAAGCCCGG CGAGCGAATC
GACTGGTGGT ATCACAATCT TGGCTCCGTA ACCGGAGCGG GTGTCGCCGA AATCACCAAC
CAGATGGAAT ATGACGACGA GGTCGCGTTC CTCGCAAACC AGAAGCTCTA CCAGCTTTCG
CGAGAAAACG ACGACGAAAG CCGGCGCCCC TGGTGCCTCA CCGTCTCCTT CACACACCCG
CACGACCCCT ATGTCGCGCG AAGGAAATTC TGGGACCTCT ACGAGGATTG CGAGCACCTC
ACGCCCGAGG TCGAAGCCAT CCCGCTCGAC AAGCAGGACT CGCATTCGCA ACGCATCATG
CTCTCCTGCG ACTACCGGAA TTTCGACGTG ACCGAAGAAA ATGTCCGCCG CTCGCGCCGC
GCCTATTTCG CCAACATCTC CTATCTCGAC GAGAAGGTCG GCGAATTGAT CGATACGCTC
ACACGGACGC GGATGCTGGA CGACACGCTC ATCCTCTTCT GTTCGGACCA CGGCGACATG
CTCGGCGAGC GCGGCCTCTG GTTCAAGATG AACTTTTTCG AAGGCTCAGC GCGCGTGCCG
CTGATGATTG CCGGACCCGG CATCGCGCCG GGCCTCCATC TGACACCGAC CTCCAACCTC
GACGTGACGC CGACGCTTGC GGATCTCGCC GGCATCTCGC TCGAGGAAGT GGGACCCTGG
ACCGATGGCG TCAGCCTCGT GCCGATGGTC AACGGCGTCG AGCGCACGCA GCCGGTGCTG
ATGGAATATG CCGCCGAAGC CTCCCATGCG CCGCTGGTCG CCATCCGCGA GGGCAAGTGG
AAATATGTTT ACTGCACGCT CGATCCGGAG CAGTTGTTCG ACCTGGAGGC AGACCCGCTG
GAACTCAGCA ACCTCGCTGA AAAGCCGCGC GGCCCGGTCG ACCAGGCGAC GCTCACGGCC
TTCCGAGACA TGCGCGCTGC GCATTGGGAC ATGGAGGCCT TCGATGCCTC CGTGCGCGAA
AGCCAGGCCC GGCGTTGGGT GGTCTACGAA GCGCTTCGAA ACGGCGCCTA CTATCCCTGG
GACCACCAGC CGCTGCAGAA AGCTTCGGAG CGCTACATGC GCAATCACAT GAACCTCGAC
AATCTCGAGG AATCCAAACG CTATCCGCGA GGAGAATGA
 
Protein sequence
MTTGKPNILI IMVDQLNGKL FPDGPADFLH APNLKALAKR SARFRNNYTS SPLCAPARAS 
FMAGQLPSRT RVYDNAAEYQ SSIPTYAHHL RRAGYYTALS GKMHFVGPDQ LHGFEERLTT
DIYPADFGWT PDYRKPGERI DWWYHNLGSV TGAGVAEITN QMEYDDEVAF LANQKLYQLS
RENDDESRRP WCLTVSFTHP HDPYVARRKF WDLYEDCEHL TPEVEAIPLD KQDSHSQRIM
LSCDYRNFDV TEENVRRSRR AYFANISYLD EKVGELIDTL TRTRMLDDTL ILFCSDHGDM
LGERGLWFKM NFFEGSARVP LMIAGPGIAP GLHLTPTSNL DVTPTLADLA GISLEEVGPW
TDGVSLVPMV NGVERTQPVL MEYAAEASHA PLVAIREGKW KYVYCTLDPE QLFDLEADPL
ELSNLAEKPR GPVDQATLTA FRDMRAAHWD MEAFDASVRE SQARRWVVYE ALRNGAYYPW
DHQPLQKASE RYMRNHMNLD NLEESKRYPR GE