Gene SeD_A1540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1540 
Symbol 
ID6873303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1488055 
End bp1489740 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content56% 
IMG OID642784692 
Productputative sulfate transporter YchM 
Protein accessionYP_002215362 
Protein GI198244029 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0985805 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value0.161381 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACAAAT TATTTTCCTC ACATGTGATG CCTTTCCGCG CTCTCATCGA TGCTTGCTGG 
AAAGAAAAAT ATACCGCCTC CCGGTTCACC CGCGATGTGA TAGCCGGGAT CACCGTCGGG
ATTATTGCTA TCCCGCTGGC GATGGCGCTG GCAATTGGCA GTGGCGTTGC GCCGCAGTAT
GGCCTCTATA CCTCCGCTGT CGCCGGGATC GTGATCGCGC TAACCGGCGG CTCGCGCTTT
AGCGTTTCCG GCCCTACCGC CGCGTTTGTG GTGATTTTGT ATCCGGTATC GCAACAGTTT
GGTCTGGCGG GCCTACTGGT CGCCACGCTG ATGTCGGGCT TCTTCCTGAT CCTTTTCGGC
CTGGCGAGAC TGGGGCGATT GATTGAATAT ATCCCGGTGT CGGTCACGTT GGGTTTTACC
TCAGGGATTG GTATTACCAT CGGTACCATG CAGATTAAAG ATTTTCTTGG TCTGCAGATG
GCCCATGTGC CAGAGCACTA TTTGCAGAAA GTCGGCGCGC TGTTTATGGC GTTGCCCACC
GTCAATATTG GCGATGCCGC CATTGGCGTG GTAACGCTGG GAACGTTGAT TTTCTGGCCG
CGTCTCGGTA TTCGTCTGCC AGGACATCTT CCCGCGCTGC TGGCCGGTTG CGCCGTAATG
GGGATTGTTA ATCTGCTGGG CGGCAATGTG GCGACTATCG GCTCACAGTT CCATTACGTT
CTGGCTGACG GCACTCAGGG CAACGGCATC CCGCAGCTCC TGCCGCAACT GATGCTGCCG
TGGAGTCTTC CTAGCTCCGA TTTCACGCTA AGCTGGGATT CACTGCGCGC GCTGCTGCCA
GCGGCCTTCT CGATTGCAAT GCTGGGGGCA ATCGAATCAT TGCTCTGCGC CGTCGTGCTG
GACGGCATGA CCGGCACCAA ACATAAAGCT AACAGCGAAC TTATCGGCCA GGGGCTGGGG
AATATGGTCG CGCCGTTCTT TGGCGGCATC ACCGCCACCG CCGCGATTGC CCGCTCTGCC
GCCAACGTCC GCGCTGGCGC GACCTCTCCC ATCTCGGCGG TAATTCACGC TATCCTGGTC
ATTCTGGCGC TACTGGTCTT GGCCCCGCTA CTCTCCTGGC TGCCGCTTTC CGCGATGGCG
GCGCTACTGC TGATGGTGGC ATGGAATATG AGTGAAGCCC ATAAAGTGGT GGATCTGTTA
CGCCATGCGC CGAAAGACGA CATTATCGTT ATGCTGCTGT GCATGTCATT AACGGTTCTG
TTTGATATGG TCATCGCCAT CAGCGTGGGG ATTGTCCTTG CTTCCCTGCT GTTTATGCGC
CGTATTGCGC GAATGACTCG ACTTGCGCCG GTCAATGTTG ATGTACCTGA AGATGTGCTG
GTGCTGCGTG TTATCGGTCC GCTCTTTTTC GCCGCGGCGG AAGGGCTGTT TACCGACCTT
GAGTCACGTA TTAAGGGCAA ACGTATCGTC GTTCTGAAAT GGGACGCAGT ACCAGTGCTG
GATGCAGGCG GGCTTGATGC TTTTCAGCGT TTTGTGAAGC GTCTGCCGGA GGGTTGCGAA
TTGCGTATCA GTAATCTGGA GTTCCAACCG CTGCGCACAA TGGCGCGTGC CGGTATCAAA
CCTATTCCTG GGCGTCTGAC CTTCTTCCCA AACAGGACGG AGGCGTTAGC GGATTTACTA
AGTTAA
 
Protein sequence
MNKLFSSHVM PFRALIDACW KEKYTASRFT RDVIAGITVG IIAIPLAMAL AIGSGVAPQY 
GLYTSAVAGI VIALTGGSRF SVSGPTAAFV VILYPVSQQF GLAGLLVATL MSGFFLILFG
LARLGRLIEY IPVSVTLGFT SGIGITIGTM QIKDFLGLQM AHVPEHYLQK VGALFMALPT
VNIGDAAIGV VTLGTLIFWP RLGIRLPGHL PALLAGCAVM GIVNLLGGNV ATIGSQFHYV
LADGTQGNGI PQLLPQLMLP WSLPSSDFTL SWDSLRALLP AAFSIAMLGA IESLLCAVVL
DGMTGTKHKA NSELIGQGLG NMVAPFFGGI TATAAIARSA ANVRAGATSP ISAVIHAILV
ILALLVLAPL LSWLPLSAMA ALLLMVAWNM SEAHKVVDLL RHAPKDDIIV MLLCMSLTVL
FDMVIAISVG IVLASLLFMR RIARMTRLAP VNVDVPEDVL VLRVIGPLFF AAAEGLFTDL
ESRIKGKRIV VLKWDAVPVL DAGGLDAFQR FVKRLPEGCE LRISNLEFQP LRTMARAGIK
PIPGRLTFFP NRTEALADLL S