Gene SeD_A2526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2526 
Symbol 
ID6872822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2405943 
End bp2407301 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content59% 
IMG OID642785606 
Product4-hydroxybenzoate transporter 
Protein accessionYP_002216264 
Protein GI198245347 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00895] benzoate transport 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCAAC GACGTGATCT ACAAGCCCTT ATTGATGCCG CGCCCGTCGG CAAAATGCAG 
TGGCGCGTTA TCATCTGCTG TTTTCTGGTG GTTATGCTCG ACGGTTTCGA CACCGCCGCG
ATTGGCTTCA TCGCCCCGGA TATTCGTACC CACTGGCAGC TAAGCGCCAG CGAACTTGCG
CCGCTGTTTG GCGCAGGGCT GCTGGGGCTT ACGGCCGGCG CGCTGCTATG CGGGCCGCTG
GCGGATCGCT TTGGCCGCAA GCGGGTCATT GAGCTTTGCG TGGCGCTATT CGGCGCATTG
AGTCTGCTTT CCGCTTTCTC GCCGGATATA GAAACCTTGG TGTTGCTGCG CTTCTTAACC
GGTCTGGGAC TGGGCGGAGC GATGCCGAAT ACCATCACCA TGACGTCGGA ATACCTTCCC
GCTCGTCGAC GCGGAGCGCT GGTCACGCTG ATGTTCTGCG GTTTTACCCT GGGGTCGGCG
ATGGGCGGGA TTGTGAGCGC GCAACTGGTG CCGCTGATTG GCTGGCACGG AATTCTGGCG
TTAGGCGGCA TCTTGCCTTT GCTGCTGTTT TTCGGCCTGC TGTTCGCGCT GCCGGAATCT
CCCCGCTGGC AGGTACGTCG CCAACTACCG CAAGCCGTTG TCGCCCGGAC GGTCAGCGCC
ATTACCGGCG AGCGCTATCA CGATACGCAA TTCTTTCTGC ATGAGACGGC AGCCGTCGCC
AAAGGCAGTA TTCGCCAGCT TTTTGCCGGG CGACAGCTTG TCATTACCCT GATGTTATGG
GTGGTGTTCT TTATGAGCCT GCTCATTATC TATCTGCTTT CCAGCTGGAT GCCGACGTTA
CTTAACCATC GCGGTATTAA TCTGCAACAG GCGTCGTGGG TGACTGCCGC ATTCCAGGTT
GGCGGCACGC TTGGCGCGCT GTTACTCGGC GTGTTGATGG ATCGGCTTAA CCCGTTCCGG
GTACTGGCGG TGAGCTATGC GCTGGGCGCG GTTTGCATTG TCATGATAGG CCTGAGCGAA
AACGGCCTTT GGCTGATGGC GCTGGCGATT TTTGGTACCG GCATCGGTAT TAGCGGTTCA
CAGGTCGGGC TGAATGCTCT GACGGCGACG CTGTACCCCA CCCAAAGCCG GGCGACGGGC
GTGAGCTGGT CGAACGCCAT TGGACGCTGC GGGGCGATTG TCGGTTCGCT CTCCGGCGGC
ATGATGATGG CCCTCAATTT CTCTTTCGAT ACGTTGTTTT TTGTCATTGC TATTCCGGCG
GCTATCAGCG CGGTAATGCT TACCCTGCTG ACGGTGGTTG TCCGCCTTTC GATTTCTGTA
CCTGACGACC TGCCGCGTGC CAGCGTCGTA AACGAATAA
 
Protein sequence
MTQRRDLQAL IDAAPVGKMQ WRVIICCFLV VMLDGFDTAA IGFIAPDIRT HWQLSASELA 
PLFGAGLLGL TAGALLCGPL ADRFGRKRVI ELCVALFGAL SLLSAFSPDI ETLVLLRFLT
GLGLGGAMPN TITMTSEYLP ARRRGALVTL MFCGFTLGSA MGGIVSAQLV PLIGWHGILA
LGGILPLLLF FGLLFALPES PRWQVRRQLP QAVVARTVSA ITGERYHDTQ FFLHETAAVA
KGSIRQLFAG RQLVITLMLW VVFFMSLLII YLLSSWMPTL LNHRGINLQQ ASWVTAAFQV
GGTLGALLLG VLMDRLNPFR VLAVSYALGA VCIVMIGLSE NGLWLMALAI FGTGIGISGS
QVGLNALTAT LYPTQSRATG VSWSNAIGRC GAIVGSLSGG MMMALNFSFD TLFFVIAIPA
AISAVMLTLL TVVVRLSISV PDDLPRASVV NE