Gene SeAg_B2022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B2022 
Symbol 
ID6793404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp1961857 
End bp1963065 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content56% 
IMG OID642776246 
Productmultidrug resistance protein MdtH 
Protein accessionYP_002146877 
Protein GI197251455 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCGCG TCTCGCAGGC GAGGAACCTG GGTAAATATT TTCTTCTCAT CGATAACATG 
TTGGTGGTGC TGGGTTTTTT CGTCGTCTTC CCGCTCATCT CTATTCGCTT TGTCGATCAA
ATGGGGTGGG CTGCCGTAAT GGTAGGGATC GCGCTCGGCC TGCGCCAGTT TATTCAACAA
GGTCTGGGCA TTTTTGGCGG CGCCATCGCC GATCGCTTTG GCGCGAAACC GATGATTGTC
ACCGGTATGC TGATGCGCGC CGCAGGCTTT GCCACCATGG GTATCGCGCA TGAGCCCTGG
CTCTTGTGGT TTTCCTGCTT TCTTTCCGGT CTCGGCGGTA CGCTTTTCGA CCCGCCGCGT
TCAGCGCTGG TGGTCAAATT AATTCGTCCG GAGCAACGGG GCCGCTTCTT CTCTCTGTTG
ATGATGCAGG ACAGCGCGGG CGCGGTGATT GGCGCGCTGC TGGGAAGCTG GTTGCTACAA
TACGATTTTC GCCTGGTCTG CGCGACGGGC GCTATTTTGT TCATATTATG CGCCCTTTTC
AATGCATGGC TGCTTCCGGC CTGGAAGCTA TCAACGGTCA GAACGCCGGT GCGTGAAGGA
ATGCGCCGCG TCATGAGCGA TAAAAGGTTT GTCACCTACG TGCTGACGCT GGCGGGCTAC
TATATGCTGG CGGTACAGGT CATGTTAATG CTGCCGATTA TGGTAAACGA TATCGCCGGT
TCGCCTGCTG CCGTGAAATG GATGTACGCT ATTGAGGCGT GTCTCTCGCT GACGTTGCTC
TACCCGATTG CCCGCTGGAG CGAAAAGCGT TTTCGGCTGG AGCATCGGCT GATGGCCGGT
TTGCTCGTCA TGTCGCTGAG CATGCTCCCC ATCGGGATGG TGGGCAATTT ACAGCAGCTT
TTTACGCTTA TTTGCGCTTT CTACATCGGC TCGGTTATCG CCGAACCGGC GCGCGAAACG
CTCAGCGCGT CGCTCGCAGA CGCGAGGGCG CGGGGAAGCT ATATGGGCTT TAGCCGTCTG
GGATTAGCCA TTGGCGGCGC GATTGGTTAT ATCGGCGGCG GCTGGTTGTT TGATATGGGT
AAAGCGCTTG CGCAGCCTGA ACTACCGTGG ATGATGCTCG GTATTATCGG CTTTATCACC
TTTTTGGCTT TAGGCTGGCA ATTTAGTCAT AAACGCACGC CGCGCCGGAT GCTGGAACCC
GGCGCCTGA
 
Protein sequence
MSRVSQARNL GKYFLLIDNM LVVLGFFVVF PLISIRFVDQ MGWAAVMVGI ALGLRQFIQQ 
GLGIFGGAIA DRFGAKPMIV TGMLMRAAGF ATMGIAHEPW LLWFSCFLSG LGGTLFDPPR
SALVVKLIRP EQRGRFFSLL MMQDSAGAVI GALLGSWLLQ YDFRLVCATG AILFILCALF
NAWLLPAWKL STVRTPVREG MRRVMSDKRF VTYVLTLAGY YMLAVQVMLM LPIMVNDIAG
SPAAVKWMYA IEACLSLTLL YPIARWSEKR FRLEHRLMAG LLVMSLSMLP IGMVGNLQQL
FTLICAFYIG SVIAEPARET LSASLADARA RGSYMGFSRL GLAIGGAIGY IGGGWLFDMG
KALAQPELPW MMLGIIGFIT FLALGWQFSH KRTPRRMLEP GA