Gene SNSL254_A3014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3014 
SymbolemrA 
ID6485332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2938419 
End bp2939591 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content54% 
IMG OID642738330 
Productmultidrug resistance protein A 
Protein accessionYP_002042059 
Protein GI194445068 
COG category[V] Defense mechanisms 
COG ID[COG1566] Multidrug resistance efflux pump 
TIGRFAM ID[TIGR00998] efflux pump membrane protein (multidrug resistance protein A) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.00193784 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCTCAA ATGCGGAGAT CCAAACCCCG CAGCAACCGG CTAAGAAGAA AGGCAAACGC 
AAAACAGCGC TGCTACTTCT TACCTTGCTC TTTGTTATTA TTGCCGTGGC ATATGGAATT
TATTGGTTTT TAGTATTGCG TCATATTGAA GAGACAGATG ATGCTTACGT GGCAGGGAAC
CAGGTTCAAA TCATGGCGCA GGTGTCAGGC AGCGTGACGA AAGTCTGGGC TGATAACACC
GACTTTGTAA AAGAGGGCGA TGTTCTGGTC ACGCTCGATC AGACTGACGC CAAACAAGCG
TTTGAAAGAG CCAAAACGGC GCTGGCCTCC AGCGTGCGCC AGACGCACCA GTTGATGATT
AACAGCAAGC AGTTGCAGGC GAATATCGAC GTGCAAAAAA CCGCCCTGGC GCAAGCGCAA
AGCGACCTTA ACCGTCGTGT GCCGCTGGGT AATGCCAATC TTATTGGCCG TGAAGAGCTG
CAACACGCCC GCGATGCCGT CGCCAGCGCG CAGGCACAGC TGGATGTCGC CATTCAACAG
TACAATGCCA ACCAGGCAAT GATACTCAAC AGTAATCTGG AAGATCAGCC TGCGGTTCAA
CAAGCGGCGA CCGAAGTGCG TAACGCCTGG CTGGCGCTGG AGCGTACCCG CATCGTCAGC
CCAATGACTG GTTATGTCTC CCGCCGCGCC GTCCAGCCTG GCGCGCAAAT CAGCCCCACC
ACGCCGCTGA TGGCCGTGGT GCCTGCAACC GATCTGTGGG TGGACGCTAA CTTTAAAGAA
ACCCAATTAG CGAATATGCG CATTGGGCAG CCAGTGACGG TGATTACTGA TATTTATGGC
GACGACGTAA AATACACCGG TAAAGTCGTC GGTCTGGATA TGGGAACAGG CAGCGCCTTC
TCCCTGCTGC CCGCGCAAAA TGCGACGGGT AACTGGATTA AAGTGGTTCA ACGTCTGCCG
GTACGCGTCG AACTGGACGC CCGCCAGTTA GAACAACATC CGCTGCGTAT TGGTTTATCG
ACGCTGGTCA CCGTGGATAC CGCTAATCGC GACGGTCAGG TACTGGCCAG CCAGGTACGA
ACGACGCCGG TTGCCGAAAG TAACGCACGC GAAATTAATC TCGCGCCGGT CAATAAACTG
ATCGACGACA TCGTACAGGC TAACGCGGGT TAA
 
Protein sequence
MSSNAEIQTP QQPAKKKGKR KTALLLLTLL FVIIAVAYGI YWFLVLRHIE ETDDAYVAGN 
QVQIMAQVSG SVTKVWADNT DFVKEGDVLV TLDQTDAKQA FERAKTALAS SVRQTHQLMI
NSKQLQANID VQKTALAQAQ SDLNRRVPLG NANLIGREEL QHARDAVASA QAQLDVAIQQ
YNANQAMILN SNLEDQPAVQ QAATEVRNAW LALERTRIVS PMTGYVSRRA VQPGAQISPT
TPLMAVVPAT DLWVDANFKE TQLANMRIGQ PVTVITDIYG DDVKYTGKVV GLDMGTGSAF
SLLPAQNATG NWIKVVQRLP VRVELDARQL EQHPLRIGLS TLVTVDTANR DGQVLASQVR
TTPVAESNAR EINLAPVNKL IDDIVQANAG