Gene SeD_A4920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4920 
Symbol 
ID6874102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4756929 
End bp4758170 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content57% 
IMG OID642787796 
Productmultidrug resistance protein MdtM 
Protein accessionYP_002218389 
Protein GI198244192 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.998541 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value0.506347 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACGGA TTATTCAGTT TTTCTCCCAA CGCGCCACAA CGTTATTTTT CCCGATGGCG 
CTGATTTTGT ACGATTTTGC CGCCTATCTG ACGACGGATC TGATTCAGCC TGGCATCATT
AACGTCGTGC GCGATTTTAA TGCTGACGTC AGTCTTGCGC CGGCCTCGGT CAGTCTCTAT
CTGGCGGGAG GAATGGCGTT GCAATGGCTT CTGGGGCCGC TATCCGACAG GATTGGCCGC
CGGCCAGTGC TGATTGCAGG CGCATTAATT TTTACTCTCG CCTGCGCGGC AACGCTGTTG
ACGACCTCAA TGACGCAGTT TCTGGTTGCC CGTTTTGTGC AGGGCACCAG CATTTGCTTT
ATCGCAACGG TCGGCTACGT CACGGTACAG GAGGCCTTCG GTCAAACCAA AGCCATTAAG
CTGATGGCGA TTATTACCTC CATTGTGCTG GTCGCTCCGG TTATCGGCCC GCTCTCCGGC
GCCGCGCTGA TGCACTTCGT TCACTGGAAG GTACTGTTCG GTATTATTGC GGTGATGGGA
CTGTTGGCAT TGTGCGGCCT GCTGCTGGCG ATGCCAGAAA CGGTGCAACG CGGCGCGGTG
CCGTTCAGCG CGGTGAGCGT ACTGCGCGAT TTTCGTAATG TGTTTCGCAA CCCGATTTTT
CTCACCGGGG CGGCGACGCT GTCGTTGAGC TACATCCCGA TGATGAGCTG GGTGGCGGTG
TCGCCGGTGA TCCTGATCGA TGCCGGCGGG ATGAGTACTT CGCAATTCGC CTGGGCGCAG
GTGCCCGTAT TCGGCGCGGT GATCGTGGCC AATATGATCG TTGTGCGCCT GGTGAAAGAT
CCGACCCGAC CGCGTTTTAT CTGGCGCGCC GTACCAATCC AGTTAAGCGG GCTGGCGACA
TTGCTCCTCG GTAATCTTCT TCTGCCGCAC GTCTGGCTTT GGTCTGTGCT GGGCACCAGC
CTGTACGCGT TCGGCATTGG CATGATTTTC CCGACGCTGT TTCGCTTCAC GCTTTTTTCC
AACAACTTGC CAAAAGGGAC AGTTTCAGCC TCGCTGAACA TGGTGATTCT GACGGTGATG
GCGGTGTCGG TCGAAGTTGG CCGCTGGCTG TGGTTTCACG GCGGGAGACT GCCGTTTCAC
CTGCTGGCAG CGGTGGCCGG GGTGATTGTG GTCTTCACTC TGGCGACACT ATTACAGCGC
GTGCGCCAGC ATGAGGCGGC AGAACTGGCC GCTGAGAAGT AA
 
Protein sequence
MQRIIQFFSQ RATTLFFPMA LILYDFAAYL TTDLIQPGII NVVRDFNADV SLAPASVSLY 
LAGGMALQWL LGPLSDRIGR RPVLIAGALI FTLACAATLL TTSMTQFLVA RFVQGTSICF
IATVGYVTVQ EAFGQTKAIK LMAIITSIVL VAPVIGPLSG AALMHFVHWK VLFGIIAVMG
LLALCGLLLA MPETVQRGAV PFSAVSVLRD FRNVFRNPIF LTGAATLSLS YIPMMSWVAV
SPVILIDAGG MSTSQFAWAQ VPVFGAVIVA NMIVVRLVKD PTRPRFIWRA VPIQLSGLAT
LLLGNLLLPH VWLWSVLGTS LYAFGIGMIF PTLFRFTLFS NNLPKGTVSA SLNMVILTVM
AVSVEVGRWL WFHGGRLPFH LLAAVAGVIV VFTLATLLQR VRQHEAAELA AEK