Gene SNSL254_A2186 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2186 
SymbolarsB 
ID6486747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2100108 
End bp2101157 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content57% 
IMG OID642737535 
Productarsenical-resistance protein 
Protein accessionYP_002041277 
Protein GI194445401 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID[TIGR00832] arsenical-resistance protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.368686 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value0.934587 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCAC AAACCCAGGC CGCTCCGGCC ATGAACCTGT TTGAACGTTA CCTGAGCGTC 
TGGGTTGCGC TCTGCATCGC CATCGGTATT CTGCTCGGGC AGGTAATGCC GGGCGTCTTT
CGCGTTATCG GCGGTCTGGA GATTGCCCGC GTCAATTTGC CGGTCGGTTT ACTTATCTGG
GTAATGATTA TTCCTATGCT GCTGCGCATT GATTTCGGAG CGCTGGGTCA GGTCAAAGCG
CACTGGCGCG GGATCGGCGT AACGTTGTTT ATCAACTGGC TGGTTAAACC ATTCTCAATG
GCGCTGCTCG GCTGGCTGTT TATCCGTTAT TTATTCGCGC CGTGGCTGCC AACCGATCAG
CTTGACAGCT ACATCGCTGG GCTTATTCTG CTGGCCGCCG CGCCCTGTAC GGCAATGGTG
TTTGTCTGGA GCCGACTCAC CAACGGCGAC CCTTATTTCA CGCTGTCGCA GGTGGCGCTG
AACGATGCCA TCATGATTTT CGCCTTTGCG CCGATTGTTG GGCTGCTACT CGGCTTGTCC
TCAATCATCG TGCCGTGGGC CACCCTGTTA ACCTCCGTGG TACTGTACAT CGTTGTGCCG
GTTATTCTGG CCCAGCTCTG GCGTAAGGCC CTGCTACGCA AGGGGCAAGC GGCATTCGAT
AATGCGCTGA CGAAAATCGG CCCGTGGTCA ATGGCGGCAT TACTGGCAAC GCTGGTGTTA
CTGTTTGCGT TTCAGGGCGA AGCCATTCTG CAACAGCCGC TGGTCATTGC CCTGCTGGCG
GTGCCGATTC TGATTCAGGT GCTGTTTAAC TCGGCGCTGG CCTATGGGTT GAATCGACTG
GTGGGCGAAA AGCACAATAT TGCCTGTCCA TCCAGCCTGA TCGGCGCATC GAACTTCTTT
GAACTGGCTG TCGCAGCGGC AATCAGCCTG TTTGGTTTTC ATTCCGGTGC GGCGCTGGCG
ACCGTGGTTG GGGTGCTCAT TGAGGTGCCG GTGATGCTGC TGGTTGTGCG AGTGGTGAAT
CGTTCGAAAG GGTGGTATGA GAGAGCTTAG
 
Protein sequence
MSAQTQAAPA MNLFERYLSV WVALCIAIGI LLGQVMPGVF RVIGGLEIAR VNLPVGLLIW 
VMIIPMLLRI DFGALGQVKA HWRGIGVTLF INWLVKPFSM ALLGWLFIRY LFAPWLPTDQ
LDSYIAGLIL LAAAPCTAMV FVWSRLTNGD PYFTLSQVAL NDAIMIFAFA PIVGLLLGLS
SIIVPWATLL TSVVLYIVVP VILAQLWRKA LLRKGQAAFD NALTKIGPWS MAALLATLVL
LFAFQGEAIL QQPLVIALLA VPILIQVLFN SALAYGLNRL VGEKHNIACP SSLIGASNFF
ELAVAAAISL FGFHSGAALA TVVGVLIEVP VMLLVVRVVN RSKGWYERA