Gene SNSL254_A1914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1914 
SymbolychM 
ID6483374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1875012 
End bp1876697 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content57% 
IMG OID642737283 
Productputative sulfate transporter YchM 
Protein accessionYP_002041033 
Protein GI194442347 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.200297 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACAAAT TATTTTCCTC ACATGTGATG CCTTTCCGCG CTCTCATCGA TGCTTGCTGG 
AAAGAAAAAT ATACCGCCTC CCGGTTCACC CGTGATGTGA TAGCCGGGAT CACCGTCGGG
ATTATTGCTA TCCCGCTGGC GATGGCGCTG GCAATTGGCA GTGGCGTTGC GCCGCAGTAT
GGCCTCTATA CCTCCGCTGT CGCCGGGATC GTGATCGCGC TAACCGGCGG CTCGCGCTTT
AGCGTTTCCG GCCCTACCGC CGCGTTTGTG GTGATTTTGT ATCCGGTATC GCAACAGTTT
GGTCTGGCGG GCCTACTGGT CGCCACGCTG ATGTCGGGCT TCTTCCTGAT CCTTTTCGGC
CTGGCGAGAC TGGGGCGATT GATTGAATAT ATCCCGGTGT CGGTCACGTT AGGTTTTACC
TCAGGGATTG GCATTACCAT CGGTACCATG CAGATTAAAG ATTTTCTTGG TCTGCAGATG
GCCCATGTGC CAGAGCACTA TTTGCAGAAA GTCGGCGCGC TGTTTATGGC GTTGCCCACC
GTCAATATTG GCGATGCCGC CATTGGCGTG GTAACGCTGG GAACGTTGAT TTTCTGGCCG
CGTCTCGGTA TTCGTCTGCC AGGACATCTT CCCGCACTAC TGGCCGGTTG CGCCGTGATG
GGGATCGTTA ATCTGCTGGG CGGCAATGTG GCGACTATCG GCTCACAGTT CCATTATGTT
CTGGCTGACG GCACTCAGGG CAACGGCATC CCGCAGCTCC TGCCGCAACT GATGCTGCCG
TGGAGTCTTC CTGGCTCCGA TTTCACGCTA AGCTGGGATT CACTGCGCGC GCTGCTGCCA
GCGGCCTTCT CGATGGCAAT GCTGGGGGCA ATCGAATCAT TGCTCTGCGC CGTCGTGCTG
GACGGCATGA CCGGCACCAA ACATAAAGCT AACAGCGAAC TTATCGGCCA GGGGCTGGGG
AATATGGTCG CGCCGTTCTT TGGCGGCATC ACCGCCACCG CCGCGATTGC CCGCTCTGCC
GCCAACGTCC GCGCTGGCGC AACCTCTCCC GTCTCAGCGG TAATTCACGC TATCCTGGTT
ATTCTGGCGC TACTGGTCTT AGCCCCGCTA CTCTCCTGGC TGCCGCTTTC CGCGATGGCG
GCGCTACTGC TGATGGTGGC ATGGAATATG AGTGAAGCCC ATAAAGTGGT GGATCTGTTA
CGCCATGCGC CGAAAGACGA CATCATCGTT ATGCTGCTGT GCATGTCATT AACGGTTCTG
TTTGATATGG TCATCGCCAT CAGCGTGGGG ATTGTCCTTG CTTCCCTGCT GTTTATGCGC
CGTATTGCGC GAATGACTCG ACTTGCGCCG GTCAATGTTG ATGTGCCTGA AGATGTGCTG
GTGCTGCGTG TTATCGGTCC GCTCTTTTTC GCCGCGGCGG AAGGGCTGTT TACCGACCTT
GAGTCACGTA TTAAGGGCAA ACGTATCGTC GTTCTGAAAT GGGACGCAGT ACCAGTGCTG
GATGCAGGCG GGCTTGATGC TTTTCAGCGT TTTGTGAAGC GTCTGCCGGA GGGTTGCGAA
TTGCGTATCA GTAATCTGGA GTTCCAACCG CTGCGCACAA TGGCGCGTGC CGGTATCAAA
CCTATTCCTG GGCGTCTGAC CTTCTTCCCG AACAGGACGG AGGCGTTAGC GGATTTACTC
AGTTAA
 
Protein sequence
MNKLFSSHVM PFRALIDACW KEKYTASRFT RDVIAGITVG IIAIPLAMAL AIGSGVAPQY 
GLYTSAVAGI VIALTGGSRF SVSGPTAAFV VILYPVSQQF GLAGLLVATL MSGFFLILFG
LARLGRLIEY IPVSVTLGFT SGIGITIGTM QIKDFLGLQM AHVPEHYLQK VGALFMALPT
VNIGDAAIGV VTLGTLIFWP RLGIRLPGHL PALLAGCAVM GIVNLLGGNV ATIGSQFHYV
LADGTQGNGI PQLLPQLMLP WSLPGSDFTL SWDSLRALLP AAFSMAMLGA IESLLCAVVL
DGMTGTKHKA NSELIGQGLG NMVAPFFGGI TATAAIARSA ANVRAGATSP VSAVIHAILV
ILALLVLAPL LSWLPLSAMA ALLLMVAWNM SEAHKVVDLL RHAPKDDIIV MLLCMSLTVL
FDMVIAISVG IVLASLLFMR RIARMTRLAP VNVDVPEDVL VLRVIGPLFF AAAEGLFTDL
ESRIKGKRIV VLKWDAVPVL DAGGLDAFQR FVKRLPEGCE LRISNLEFQP LRTMARAGIK
PIPGRLTFFP NRTEALADLL S