Gene SeD_A3953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3953 
Symbol 
ID6873586 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3788001 
End bp3789218 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content59% 
IMG OID642786912 
Productmajor facilitator superfamily transporter 
Protein accessionYP_002217540 
Protein GI198243187 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.515945 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones96 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGAAC CCGTCGCTGA ACCGGCGCTA AACGGATTGC GCCTCAATCT GCGTATTGTC 
TCCATTGTGA TGTTTAACTT TGCCAGCTAC CTGACCATCG GCCTGCCGCT CGCCGTCTTG
CCCGGCTATG TGCATGATGC GATGGGATTC AGCGCGTTCT GGGCGGGGCT TATTATCAGC
CTGCAATACT TCGCCACTCT GTTAAGCCGT CCCCATGCCG GGCGGTATGC GGATGTATTA
GGGCCGAAAA AAATCGTTGT CTTTGGCTTA TGCGGCTGTT TTTTAAGCGG ACTCGGCTAC
CTGCTGGCGG ATATCGCCAG CGCCTGGCCG ATGATCAGTT TGTTGCTACT GGGGCTGGGT
CGCGTGATTT TGGGGATTGG GCAAAGTTTT GCCGGCACCG GTTCGACACT ATGGGGCGTC
GGCGTCGTCG GGTCGTTGCA TATTGGTCGG GTCATCTCCT GGAACGGTAT CGTCACCTAC
GGCGCAATGG CGATGGGCGC GCCGCTGGGC GTGCTGTGTT ATGCCTGGGG CGGGTTACAG
GGACTGGCGC TAACGGTGAT GGGCGTGGCG CTGTTGGCGG TACTGTTAGC CCTTCCACGT
CCGTCGGTGA AGGCGAACAA AGGCAAGCCG CTGCCGTTTC GCGCGGTGCT GGGGCGTGTC
TGGCTGTATG GTATGGCGTT GGCGCTGGCC TCGGCAGGGT TTGGCGTCAT CGCGACGTTT
ATTACCTTAT TTTATGATGC TAAAGGCTGG GATGGCGCCG CCTTTGCGCT CACGTTATTT
AGCGTCGCGT TTGTCGGCAC GCGTTTGCTG TTCCCTAACG GCATCAATCG TTTAGGCGGG
TTGAATGTCG CCATGATCTG CTTTGGCGTG GAGATTATTG GTCTGTTACT TGTGGGGACG
GCAGCCATGC CGTGGATGGC AAAAATCGGC GTTTTACTCA CGGGGATGGG GTTTTCGCTG
GTCTTTCCGG CGCTGGGCGT GGTGGCCGTC AAAGCCGTGC CGCCGCAGAA CCAGGGCGCG
GCGCTGGCGA CCTATACCGT CTTTATGGAT ATGTCTTTGG GGATTACCGG GCCGCTGGCG
GGGCTGGTGA TGACCTGGGC GGGCGTGCCG GTGATTTATC TGGCGGCAGC CGGGCTGGTA
ACGATGGCGC TATTGCTGAC TTGGCGCTTA AAAAAACGGC CTCCGTCTGC ACTGCCGGAG
GCCGCATCAT CATCGTAA
 
Protein sequence
MPEPVAEPAL NGLRLNLRIV SIVMFNFASY LTIGLPLAVL PGYVHDAMGF SAFWAGLIIS 
LQYFATLLSR PHAGRYADVL GPKKIVVFGL CGCFLSGLGY LLADIASAWP MISLLLLGLG
RVILGIGQSF AGTGSTLWGV GVVGSLHIGR VISWNGIVTY GAMAMGAPLG VLCYAWGGLQ
GLALTVMGVA LLAVLLALPR PSVKANKGKP LPFRAVLGRV WLYGMALALA SAGFGVIATF
ITLFYDAKGW DGAAFALTLF SVAFVGTRLL FPNGINRLGG LNVAMICFGV EIIGLLLVGT
AAMPWMAKIG VLLTGMGFSL VFPALGVVAV KAVPPQNQGA ALATYTVFMD MSLGITGPLA
GLVMTWAGVP VIYLAAAGLV TMALLLTWRL KKRPPSALPE AASSS