Gene Smed_4544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4544 
Symbol 
ID5319045 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1031566 
End bp1032591 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content59% 
IMG OID640776345 
Productsulfate ABC transporter, periplasmic sulfate-binding protein 
Protein accessionYP_001313277 
Protein GI150376681 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1613] ABC-type sulfate transport system, periplasmic component 
TIGRFAM ID[TIGR00971] sulfate/thiosulfate-binding protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.284119 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTCTA ATAGACTTGC CGGAATAGCG AAATTAGCGC TCGTCGTCGG AAGCCTGCAG 
CTTGGCTCGG TTGGCCTCGT GCATGCGGAC ACGACGATCC TGAACGTGTC CTACGACCCG
ACTCGGGAAC TTTATAAAGA GTTCAATGCA GCCTTTGCCG AGAAGTGGCA GGCTGATACC
GGCGAAACCG TGACGATCCA AACATCGCAT GGCGGCTCCG GCAAGCAGGC GCGTTCGGTT
ATCGACGGGC TGGAGGCGGA CGTGGTGACC CTGGCACTAG AAGCCGACAT CGATGCGATC
GCCAGAGAGA GCGGAAAAAT TCCCGCCGAT TGGAAGACCC GTTTGGAAAA CAACAGTTCT
CCCTACACCT CGACGATCGT CTTCCTCGTC CGCAAGGGAA ACCCGAAGGG GATCAAGGAT
TGGGGCGATC TTGTCCGGGA GGACGTGCAG GTGATCACGC CCAATCCGAA GACCTCGGGC
GGCGCTCGCT GGAACTTCCT CGCTGCCTGG GCCTGGGCCC GCGCTGCCAA TAACGGCGAC
GACACAAAGG CGCAGGAATA CGTGACGCAA CTCTTCAAGC ACGTTCCGGT TCTCGACACC
GGCGCGCGCG GTGCGACGAC CACCTTCGTT CAGCGCGGGT TGGGAGACGT GTTGCTCGCC
TGGGAAAACG AAGCCTATCT GTCGCTGGAA GAGCTCGGCC CGGACAATTT TGACATAGTA
ACCCCGTCTA TTTCGATCAA GGCGGAACCA CCCGTGGCGC TCGTCGATGG CAATGTCGAT
CGCAAGGGCA CGCGTAAGGT GGCGGAAGCC TATCTCGACT ATCTCTACAG CGATGCCGGC
CAGAAGATCG CGGCCAAGCA CTATTACCGG CCGTTCAAGC CGGAAGCGGC AGACGCTGAG
GACACGGCCC GCTTCAAGGA ATTGAAGCTA GTCACGATCA ACGACTTCGG CGGCTGGAAG
GAAGCTCAAC CGAAATTCTT CGGCGATGGC GGAATTTTCG ACCAGATTTA CAGACCGGGG
CAATGA
 
Protein sequence
MSSNRLAGIA KLALVVGSLQ LGSVGLVHAD TTILNVSYDP TRELYKEFNA AFAEKWQADT 
GETVTIQTSH GGSGKQARSV IDGLEADVVT LALEADIDAI ARESGKIPAD WKTRLENNSS
PYTSTIVFLV RKGNPKGIKD WGDLVREDVQ VITPNPKTSG GARWNFLAAW AWARAANNGD
DTKAQEYVTQ LFKHVPVLDT GARGATTTFV QRGLGDVLLA WENEAYLSLE ELGPDNFDIV
TPSISIKAEP PVALVDGNVD RKGTRKVAEA YLDYLYSDAG QKIAAKHYYR PFKPEAADAE
DTARFKELKL VTINDFGGWK EAQPKFFGDG GIFDQIYRPG Q