Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A2186 |
Symbol | arsB |
ID | 6486747 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 2100108 |
End bp | 2101157 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642737535 |
Product | arsenical-resistance protein |
Protein accession | YP_002041277 |
Protein GI | 194445401 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0798] Arsenite efflux pump ACR3 and related permeases |
TIGRFAM ID | [TIGR00832] arsenical-resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.368686 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 0.934587 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCAC AAACCCAGGC CGCTCCGGCC ATGAACCTGT TTGAACGTTA CCTGAGCGTC TGGGTTGCGC TCTGCATCGC CATCGGTATT CTGCTCGGGC AGGTAATGCC GGGCGTCTTT CGCGTTATCG GCGGTCTGGA GATTGCCCGC GTCAATTTGC CGGTCGGTTT ACTTATCTGG GTAATGATTA TTCCTATGCT GCTGCGCATT GATTTCGGAG CGCTGGGTCA GGTCAAAGCG CACTGGCGCG GGATCGGCGT AACGTTGTTT ATCAACTGGC TGGTTAAACC ATTCTCAATG GCGCTGCTCG GCTGGCTGTT TATCCGTTAT TTATTCGCGC CGTGGCTGCC AACCGATCAG CTTGACAGCT ACATCGCTGG GCTTATTCTG CTGGCCGCCG CGCCCTGTAC GGCAATGGTG TTTGTCTGGA GCCGACTCAC CAACGGCGAC CCTTATTTCA CGCTGTCGCA GGTGGCGCTG AACGATGCCA TCATGATTTT CGCCTTTGCG CCGATTGTTG GGCTGCTACT CGGCTTGTCC TCAATCATCG TGCCGTGGGC CACCCTGTTA ACCTCCGTGG TACTGTACAT CGTTGTGCCG GTTATTCTGG CCCAGCTCTG GCGTAAGGCC CTGCTACGCA AGGGGCAAGC GGCATTCGAT AATGCGCTGA CGAAAATCGG CCCGTGGTCA ATGGCGGCAT TACTGGCAAC GCTGGTGTTA CTGTTTGCGT TTCAGGGCGA AGCCATTCTG CAACAGCCGC TGGTCATTGC CCTGCTGGCG GTGCCGATTC TGATTCAGGT GCTGTTTAAC TCGGCGCTGG CCTATGGGTT GAATCGACTG GTGGGCGAAA AGCACAATAT TGCCTGTCCA TCCAGCCTGA TCGGCGCATC GAACTTCTTT GAACTGGCTG TCGCAGCGGC AATCAGCCTG TTTGGTTTTC ATTCCGGTGC GGCGCTGGCG ACCGTGGTTG GGGTGCTCAT TGAGGTGCCG GTGATGCTGC TGGTTGTGCG AGTGGTGAAT CGTTCGAAAG GGTGGTATGA GAGAGCTTAG
|
Protein sequence | MSAQTQAAPA MNLFERYLSV WVALCIAIGI LLGQVMPGVF RVIGGLEIAR VNLPVGLLIW VMIIPMLLRI DFGALGQVKA HWRGIGVTLF INWLVKPFSM ALLGWLFIRY LFAPWLPTDQ LDSYIAGLIL LAAAPCTAMV FVWSRLTNGD PYFTLSQVAL NDAIMIFAFA PIVGLLLGLS SIIVPWATLL TSVVLYIVVP VILAQLWRKA LLRKGQAAFD NALTKIGPWS MAALLATLVL LFAFQGEAIL QQPLVIALLA VPILIQVLFN SALAYGLNRL VGEKHNIACP SSLIGASNFF ELAVAAAISL FGFHSGAALA TVVGVLIEVP VMLLVVRVVN RSKGWYERA
|
| |