Gene Smed_5896 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5896 
Symbol 
ID5320198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp861207 
End bp862232 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content61% 
IMG OID640777591 
Productsulfate ABC transporter, periplasmic sulfate-binding protein 
Protein accessionYP_001314523 
Protein GI150377928 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1613] ABC-type sulfate transport system, periplasmic component 
TIGRFAM ID[TIGR00971] sulfate/thiosulfate-binding protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.183484 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.734335 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAGCGC GTGTATTTGT CGGGATCATG TTCGGCGTGA TTTCGGTCGG AGCCCTGCAA 
GTCGGCTTTC TCGGTGCCGC TGTCGCCGGC ACGACGATCG TGAACGTTTC CTATGACTCG
ACGCGAAAAC TCTACAAGGA GTTCAATGCT GCGTTCGCTG AAAAATGGGA GACGGAGACC
GGCGAGAAGG TGACGATCGA CATGTCGCAC GGCGGTTCGG GCAAGCAAGC GCGATTGGTG
ATCGATGGCC TCGAAGCGGA TGTGGTGACA CTGGCGCTCG AAGGCGATAT CGATGCCATT
GCTCAGGCGA CCGGCAAGCT TCCCCCGGAT TGGAGAACAC GCCTCGAGAA CAATAGTGCG
CCCTACACAT CGACGGTCGT TTTCCTGGTT CGCAAGGGCA ATCCGAGAGG CATCCGGGAT
TGGGGCGATC TGACGAAGGA GGGCATCCAG GTCGTCACGC CGAACCCGAA GACCTCGGGT
GGCGCGCGCT GGAACTTCCT TGCAGCCTGG GCCTGGGCGC GGGATGCAAA CAACGGCGAC
GAGGCCAAGG CGCAGGAATA TGCGGCGGCG CTTTTCAAGC AGGTTCTCGT TCTCGACACC
GGCGCGTGGG GAGCGATGAC CACTTTCGTC CATCGCGGGC TCGGCGACGT GCTGCTCGCC
TGGGAGAATG AGGCCTATCT CGCGCTCGAT GAACTCGGCC CCGACAAGTT CGAGATCGTG
ACACCGTCCA TATCGATCAG GGCCGAGCCC TCCGTGGCGC TCTTGGACGG GAATGTCGAC
AGCAAAGGCA CCCGCAATGT TGCCGAAGCC TATCTCGGCT ACCTCTACAG CGACGTCGGC
CAGAAGATCG TCGCCAAGCA CTACTATCGG CCGTTCAAGC CCGAGCTGGC CGACCCCGCG
GACTCGGCAC GCTTTGCCGA TCTCAAACTG GTCACCATTG GCGACTTCGG CGGTTGGCAG
GAAGCCCAGC CGAAGTTCTT CGACGATGGG GGGATTTTCG ACCAGATCTA TAAGCCGGGC
CGATAG
 
Protein sequence
MGARVFVGIM FGVISVGALQ VGFLGAAVAG TTIVNVSYDS TRKLYKEFNA AFAEKWETET 
GEKVTIDMSH GGSGKQARLV IDGLEADVVT LALEGDIDAI AQATGKLPPD WRTRLENNSA
PYTSTVVFLV RKGNPRGIRD WGDLTKEGIQ VVTPNPKTSG GARWNFLAAW AWARDANNGD
EAKAQEYAAA LFKQVLVLDT GAWGAMTTFV HRGLGDVLLA WENEAYLALD ELGPDKFEIV
TPSISIRAEP SVALLDGNVD SKGTRNVAEA YLGYLYSDVG QKIVAKHYYR PFKPELADPA
DSARFADLKL VTIGDFGGWQ EAQPKFFDDG GIFDQIYKPG R