Gene SNSL254_A4079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4079 
SymbolemrD 
ID6485374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3970273 
End bp3971457 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content58% 
IMG OID642739336 
Productmultidrug resistance protein D 
Protein accessionYP_002043045 
Protein GI194445701 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones81 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGGC AGAGAAACGT CAATTTGTTG TTGATGTTGG TGTTACTGGT GGCCGTAGGG 
CAGATGGCGC AAACCATTTA TATTCCCGCC ATCGCCGATA TGGCACAAGC GCTAAACGTC
CGGGAAGGCG CCGTCCAGAG CGTAATGGCT GCTTACCTCC TGACCTACGG CGTCTCGCAA
CTGTTTTACG GCCCGCTTTC CGACCGGGTT GGGCGCCGCC CCGTAATCCT CGTCGGCATG
TCTATTTTTA TGGTAGCGAC CCTGTTCGCC ATGACCACGC ATAGTTTGAC GGTGTTGATT
GCCGCCAGCG CCATGCAAGG GATGGGAACC GGCGTTGGCG GAGTAATGGC GAGAACGCTC
CCGCGCGATC TGTATGAAGG AACGCAACTT CGTCACGCCA ATAGCCTGTT AAATATGGGG
ATTCTGGTCA GCCCGCTGTT AGCGCCGCTG ATTGGCGGTC TGCTGGACAC CCTGTGGAAC
TGGCGCGCGT GTTACGCTTT CCTGCTGGTG CTTTGCGCTG GCGTCACCTT CAGCATGGCG
CGCTGGATGC CGGAAACCCG CCCCGCCGGC GCGCCGCGCA CACGGCTGAT CGCCAGCTAT
AAAACGCTGT TTGGCAACGG CGCATTTAAC TGTTATCTGC TGATGCTAAT CGGCGGGCTG
GCTGGCATTG CGGTCTTTGA AGCCTGTTCC GGCGTGCTGA TGGGGGCAGT ATTAGGTCTC
AGCAGTATGG TGGTAAGCAT TCTGTTTATT CTGCCGATTC CGGCGGCGTT CTTCGGCGCC
TGGTTTGCCG GACGCCCGAA TAAACGCTTC TCAACCCTGA TGTGGCAGTC AGTTATTTGC
TGTCTGCTGG CAGGCCTTAT GATGTGGATT CCCGGCTGGT TTGGCGTGAT GAACGTCTGG
ACGCTACTCA TCCCCGCTGC GCTGTTTTTC TTCGGCGCCG GGATGTTATT TCCACTGGCC
ACCAGCGGCG CGATGGAGCC GTTTCCGTTC CTCGCAGGCA CCGCTGGCGC GCTGGTCGGC
GGGCTGCAAA ATATTGGTTC CGGCGTACTG GCGTGGCTTT CGGCAATGCT GCCGCAAACC
GGTCAGGGCA GTCTGGGATT GCTGATGACC CTTATGGGAT TGCTGATCCT GGCGTGCTGG
CTGCCGCTGG CGTCACGGAT ATCGCATCAG GGGCAGACGG TTTAA
 
Protein sequence
MKRQRNVNLL LMLVLLVAVG QMAQTIYIPA IADMAQALNV REGAVQSVMA AYLLTYGVSQ 
LFYGPLSDRV GRRPVILVGM SIFMVATLFA MTTHSLTVLI AASAMQGMGT GVGGVMARTL
PRDLYEGTQL RHANSLLNMG ILVSPLLAPL IGGLLDTLWN WRACYAFLLV LCAGVTFSMA
RWMPETRPAG APRTRLIASY KTLFGNGAFN CYLLMLIGGL AGIAVFEACS GVLMGAVLGL
SSMVVSILFI LPIPAAFFGA WFAGRPNKRF STLMWQSVIC CLLAGLMMWI PGWFGVMNVW
TLLIPAALFF FGAGMLFPLA TSGAMEPFPF LAGTAGALVG GLQNIGSGVL AWLSAMLPQT
GQGSLGLLMT LMGLLILACW LPLASRISHQ GQTV