Gene Smed_2068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2068 
Symbol 
ID5322927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2119512 
End bp2120987 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content57% 
IMG OID640791005 
Productsulfatase 
Protein accessionYP_001327736 
Protein GI150397269 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00300653 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGCGAGCCA TATTTGTATT GTTCGATTCA CTGAACCGCA CTGCCGTGGG CCGCTACGGT 
GCGAACGCGG TCAAAACGCC CAACTTCGAT CGGTTTGCAG AACGCGCGAC CACATTCGAC
AGTCACTTCG TCGGCAGTCT TCCCTGCATG CCGGCTCGTC GGGATTTGCA CACGGGCCGC
CTGAACTTCA TGCATCGAAG CTGGGGGCCG CTGGAGCCGT TCGACAATTC CTTCCCGGAG
CTGCTGGGCA AGTGCGGCGT TCACTCGCAC CTGATCACCG ACCACCTTCA TTATTTCGAG
GATGGCGGCT CGACCTATCA TACCCGGTTC CGCACATGGG ATTTCATCCG CGGACAGGAA
GACGACCCCT GGAAAGCGAT GGTGCAGCCG CCACTCGAGC GCTTCAAGGA AATGTATTCG
GAGAAGCATT ATGACTTTGA TGATCCGTGG AAGCGCATGC AGAGCGCGGT CAATCGCGAA
TTCGTTCGTG GCGAGCACGA GTATCCGGGT CCCCGCTGCT TTAAGTCCGC TTTGGAATTC
CTGGATCTGA ACCGAGCAGC AGACGACTGG TTCCTGATGG TCGAATGCTT CGATCCCCAT
GAGCCATTCG CCGCGCCGGA GCGGTTCAAG GAGCAATACG CTACGGGATG GGAGGGCGGT
GTTCTCGACT GGCCGAAATA TGAGAAAGTC GTCGACAGCC CGGAGGAAAT TGCGGAAATA
CGCGCCAACT ATGCCGCCTT GGTAACAATG TGCGACGAAT ACTTCGGGCG CCTGCTCGAC
TATTTTGACG AGCACGACCT TTGGAAAGAC ACAGCGATCA TCCTGTCCAC CGATCACGGA
TTCCTTCTTG CCGAGCACGA CTGGTGGGGC AAGAATCGGA TGCCTTACTA TGCCGAAATT
TCCCAGATTC CACTCATCAT TTACCATCCG GAGCATGCCG GAGGAGGCGG GACGCGACGT
TCGGCGCTTA CGCAGACCAT CGATCTGATG CCGACCTTCC TCGATCTCTT CGGCATCGAT
GTGCCGCAGG AAGTGCAGGG ACATTCCCTC CTACCCCTGT TGAAGGAGGA TAGATCGATG
CGGGACGTTG CCATTTTCGG CGTATTCGGC GGCCCCATCG GATCAACCGA CGGCAGGTAC
ACTTATTACC TGTATCCCGA AGACCTCTAT GGTCCCGACC TCCACGAGTA CACTCTCATG
CCAATGCATA TGACTTCATT GTTCACCCCG GAGGAACTGA AGACGTCGGC ACTTACGGCT
GGTTTCAATT TCACCAAGAA TATGCCAGTC CTTCGGATCG ATGCGCTGCG AGATGCGCGA
CGAATCCCCA ACAATGATCG GGTCGGGTGG TCGGTGGACC TTGGAACGAA CCTTGTACGA
TCTTCATCTG GACCGAACGC AGATGCGGCC CTTCCGGGAT TCGGAGATAG AGCTCCGCCT
GTCCGAGGGA ATCCGGAGTG TGCTTATCGC CCATGA
 
Protein sequence
MRAIFVLFDS LNRTAVGRYG ANAVKTPNFD RFAERATTFD SHFVGSLPCM PARRDLHTGR 
LNFMHRSWGP LEPFDNSFPE LLGKCGVHSH LITDHLHYFE DGGSTYHTRF RTWDFIRGQE
DDPWKAMVQP PLERFKEMYS EKHYDFDDPW KRMQSAVNRE FVRGEHEYPG PRCFKSALEF
LDLNRAADDW FLMVECFDPH EPFAAPERFK EQYATGWEGG VLDWPKYEKV VDSPEEIAEI
RANYAALVTM CDEYFGRLLD YFDEHDLWKD TAIILSTDHG FLLAEHDWWG KNRMPYYAEI
SQIPLIIYHP EHAGGGGTRR SALTQTIDLM PTFLDLFGID VPQEVQGHSL LPLLKEDRSM
RDVAIFGVFG GPIGSTDGRY TYYLYPEDLY GPDLHEYTLM PMHMTSLFTP EELKTSALTA
GFNFTKNMPV LRIDALRDAR RIPNNDRVGW SVDLGTNLVR SSSGPNADAA LPGFGDRAPP
VRGNPECAYR P