Gene SNSL254_A1689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1689 
Symbol 
ID6484494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1653144 
End bp1654631 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content58% 
IMG OID642737069 
Productmethyl viologen resistance protein SmvA 
Protein accessionYP_002040821 
Protein GI194443839 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value0.815414 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTCGTC AGTGGTTAAC GTTAGTCATT ATTGTGCTGG TCTATATTCC TGTCGCCATT 
GATGCCACGG TCTTGCATGT CGCCGCGCCG ACACTGAGTA TGACACTGGG GGCCAGCGGC
AACGAGCTGC TGTGGATCAT TGATATTTAT TCTTTGGTCA TGGCTGGCAT GGTGTTGCCG
ATGGGCGCGC TTGGCGATCG TATCGGTTTT AAACGCCTGC TGATGCTGGG CGGGACGCTC
TTTGGCCTGG CATCATTGGC GGCGGCGTTT TCGCATACCG CCAGTTGGCT TATCGCCACC
AGGGTATTAC TGGCTATTGG CGCGGCGATG ATTGTACCGG CGACGCTGGC CGGGATACGC
GCCACCTTTT GTGAAGAGAA GCATCGCAAC ATGGCGCTGG GCGTCTGGGC AGCGGTAGGT
TCGGGCGGAG CGGCGTTTGG GCCGCTCATC GGCGGCATAT TATTAGAGCA TTTTTACTGG
GGATCGGTTT TCCTGATCAA CGTGCCGATT GTGCTGGTCG TCATGGGCTT AACCGCCCGT
TATGTTCCTC GCCAGGCGGG CCGTCGCGAT CAACCGCTCA ATCTTGGCCA TGCGGTGATG
CTGATTATTG CCATTTTGCT GTTGGTCTAT AGCGCTAAAA CCGCGCTGAA AGGGCATCTG
TCGCTGTGGG TCATCTCGCT TACCCTGCTT ACCGGCGCGT TGCTACTGGG ACTCTTTATC
CGCACACAGC TTGCGACATC GCGTCCGATG ATTGATATGC GACTATTTAC CCATCGCATT
ATCCTGAGCG GCGTCGTGAT GGCAATGACC GCGATGATCA CGCTGGTGGG TTTTGAGCTG
CTGATGGCGC AAGAGCTGCA GTTTGTTCAC GGACTATCGC CTTATGAGGC CGGGGTATTT
ATGCTGCCGG TGATGGTCGC CAGTGGATTC AGCGGGCCGA TTGCGGGCGT GCTGGTCTCG
CGTCTGGGAC TACGGCTGGT CGCGACGGGC GGCATGGCGT TAAGCGCGCT GAGTTTTTAT
GGCCTGGCGA TGACGGATTT CAGCACCCAA CAATGGCAGG CTTGGGGGCT GATGGCGCTG
CTGGGATTTA GCGCCGCCAG CGCATTGCTG GCTTCCACGT CGGCAATTAT GGCCGCTGCG
CCGGCAGAAA AAGCGGCGGC GGCCGGCGCG ATAGAAACGA TGGCTTATGA ACTGGGCGCG
GGACTGGGCA TCGCCATTTT CGGTCTGTTG TTAAGCCGTA GCTTCTCCGC GTCTATCCGT
CTGCCTGCCG GGCTTGAGGC GCAAGAGATT GCCAGAGCGT CATCTTCAAT GGGAGAAGCC
GTGCAGTTGG CGAATAGCCT ACCGCCCACG CAGGGGCAGG CAATACTGGA CGCCGCCAGA
CATGCCTTTA TCTGGTCGCA TAGCGTGGCG TTAAGCAGCG CCGGGAGTAT GCTTCTTTTG
CTGGCGGTAG GGATGTGGTT CAGCCTGGCA AAAGCCCAAC GCCGATAA
 
Protein sequence
MFRQWLTLVI IVLVYIPVAI DATVLHVAAP TLSMTLGASG NELLWIIDIY SLVMAGMVLP 
MGALGDRIGF KRLLMLGGTL FGLASLAAAF SHTASWLIAT RVLLAIGAAM IVPATLAGIR
ATFCEEKHRN MALGVWAAVG SGGAAFGPLI GGILLEHFYW GSVFLINVPI VLVVMGLTAR
YVPRQAGRRD QPLNLGHAVM LIIAILLLVY SAKTALKGHL SLWVISLTLL TGALLLGLFI
RTQLATSRPM IDMRLFTHRI ILSGVVMAMT AMITLVGFEL LMAQELQFVH GLSPYEAGVF
MLPVMVASGF SGPIAGVLVS RLGLRLVATG GMALSALSFY GLAMTDFSTQ QWQAWGLMAL
LGFSAASALL ASTSAIMAAA PAEKAAAAGA IETMAYELGA GLGIAIFGLL LSRSFSASIR
LPAGLEAQEI ARASSSMGEA VQLANSLPPT QGQAILDAAR HAFIWSHSVA LSSAGSMLLL
LAVGMWFSLA KAQRR