Gene Smed_4067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4067 
Symbol 
ID5317896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp529696 
End bp531204 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content62% 
IMG OID640775874 
Productsulphate transporter 
Protein accessionYP_001312807 
Protein GI150376211 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.124032 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGAAGAA AATCAATGAA GCTGACATTT ACGAGTTATA AACGCGAATG GTTCTCCAAC 
ATCCGCGGTG ACGTCCTTTC AGGCATCGTC GTGGCGCTTG CGCTGATCCC GGAGGCCATC
GGCTTTTCTG TCATTGCCGG CGTCGATCCG AAGGTCGGCC TGTTCGCCTC CTTCGCCATT
GCATGCGTTT CCGCCTTCAC CGGCGGCCGT CCGGGCATGA TTTCAGCGGC GACCGCCGCC
ACCGCCGTGC TGATGGTGAC GCTCGTCAAG GAATACGGCC TCGAATACCT GTTTGCCGCC
ACCCTGCTGA TGGGGCTCAT CCAGATAGCC GCGGGCTTTC TGAAGCTCGG GCGGGTCATG
CGCTTCGTGT CGCGTTCTGT CATCACCGGA TTCGTCAATG CGCTTGCGAT CATGATTTTC
ATGGCGCAGC TTCCCGAACT CATCGGCGTG CCGCACGTAA CCTATGCGAT GATCGGGGCG
GGGCTCGCGA TCATCTATCT CTTCCCCTAT GTCACCAAGG CGGTTCCGTC GCCGCTCGTC
GCAATCGCCG CCCTGACAGC GATCGCCGTC TGGACCGGCA TGGACATCCG CACCGTAGGC
GACCTCGGCG AATTGCCGTC AAGCCTGCCG ATCTTCGCCC TTCCGCAGGT GCCGCTCACC
TTCGAGACGC TGCAGATCGT CTTCCCCTAC TCGGTGGGAC TTGCCGCGGT CGGACTGCTC
GAGTCCCTTT TGACTGCGCA AATCGTCGAC GACATGACGG ATACGACGAG CAACAAGAGC
CAGGAATGCA TCGGCCAGGG CGCAAGCAAC ATCGCATCCG GCCTCATCGG CGGAATGGGC
GGCTGTGCGA TGATCGGCCA GTCGGTCATC AACGTGACCT CCGGGGGACG CGGGCGGCTA
TCAACCTTCG TGGCCGGTGC CCTCCTTCTC TTTCTCATCC TCGTCCTCGA CGACATCGTG
CGCATCATTC CGATGGCGGC ACTGGTGGCG GTGATGATCA TGGTCTCAAT CGGCACCTTC
TCCTGGCGTT CGATCCTGGA CCTGCGCCGC AATCCCCTGC CCTCCTCCGT CGTAATGCTG
GCAACGGTCG TTACCACCGT CGGAACCCAT GATCTCGCAA AAGGCGTTCT GGTCGGCGTG
CTCTTGTCCG GGATATTCTT CGCCGGCAAG GTCGCGCGCC TCTTCCATGT CCGCTCGATA
CTTGACCAAA GCGGCCGGGA GCGCACCTAT TACGTCGACG GCCAGATCTT TTTTGCCTCG
ACGGAGGGTT TCGTCGGCGC CTTCGATTTC GCCGAACCGC TGGACAAGGT CGTCATCGAC
GTGAGCGGGG CGCATCTATG GGACATCACC GCAGTCGGCG CGCTCGACAA GGTGGTGCTG
AAATATCGTC GCCACGGCGT GGCGGTGGAG GTGATCGGCC TCAACGAGGC AAGCGCCCAT
ATGCTCGACC GCTTCGCCGT GCACGACAAG AGCGAAGAGG CGGGCGCGGC CCTCACTCCA
GCCCATTGA
 
Protein sequence
MRRKSMKLTF TSYKREWFSN IRGDVLSGIV VALALIPEAI GFSVIAGVDP KVGLFASFAI 
ACVSAFTGGR PGMISAATAA TAVLMVTLVK EYGLEYLFAA TLLMGLIQIA AGFLKLGRVM
RFVSRSVITG FVNALAIMIF MAQLPELIGV PHVTYAMIGA GLAIIYLFPY VTKAVPSPLV
AIAALTAIAV WTGMDIRTVG DLGELPSSLP IFALPQVPLT FETLQIVFPY SVGLAAVGLL
ESLLTAQIVD DMTDTTSNKS QECIGQGASN IASGLIGGMG GCAMIGQSVI NVTSGGRGRL
STFVAGALLL FLILVLDDIV RIIPMAALVA VMIMVSIGTF SWRSILDLRR NPLPSSVVML
ATVVTTVGTH DLAKGVLVGV LLSGIFFAGK VARLFHVRSI LDQSGRERTY YVDGQIFFAS
TEGFVGAFDF AEPLDKVVID VSGAHLWDIT AVGALDKVVL KYRRHGVAVE VIGLNEASAH
MLDRFAVHDK SEEAGAALTP AH