Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A2526 |
Symbol | |
ID | 6872822 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 2405943 |
End bp | 2407301 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642785606 |
Product | 4-hydroxybenzoate transporter |
Protein accession | YP_002216264 |
Protein GI | 198245347 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00895] benzoate transport |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 74 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCAAC GACGTGATCT ACAAGCCCTT ATTGATGCCG CGCCCGTCGG CAAAATGCAG TGGCGCGTTA TCATCTGCTG TTTTCTGGTG GTTATGCTCG ACGGTTTCGA CACCGCCGCG ATTGGCTTCA TCGCCCCGGA TATTCGTACC CACTGGCAGC TAAGCGCCAG CGAACTTGCG CCGCTGTTTG GCGCAGGGCT GCTGGGGCTT ACGGCCGGCG CGCTGCTATG CGGGCCGCTG GCGGATCGCT TTGGCCGCAA GCGGGTCATT GAGCTTTGCG TGGCGCTATT CGGCGCATTG AGTCTGCTTT CCGCTTTCTC GCCGGATATA GAAACCTTGG TGTTGCTGCG CTTCTTAACC GGTCTGGGAC TGGGCGGAGC GATGCCGAAT ACCATCACCA TGACGTCGGA ATACCTTCCC GCTCGTCGAC GCGGAGCGCT GGTCACGCTG ATGTTCTGCG GTTTTACCCT GGGGTCGGCG ATGGGCGGGA TTGTGAGCGC GCAACTGGTG CCGCTGATTG GCTGGCACGG AATTCTGGCG TTAGGCGGCA TCTTGCCTTT GCTGCTGTTT TTCGGCCTGC TGTTCGCGCT GCCGGAATCT CCCCGCTGGC AGGTACGTCG CCAACTACCG CAAGCCGTTG TCGCCCGGAC GGTCAGCGCC ATTACCGGCG AGCGCTATCA CGATACGCAA TTCTTTCTGC ATGAGACGGC AGCCGTCGCC AAAGGCAGTA TTCGCCAGCT TTTTGCCGGG CGACAGCTTG TCATTACCCT GATGTTATGG GTGGTGTTCT TTATGAGCCT GCTCATTATC TATCTGCTTT CCAGCTGGAT GCCGACGTTA CTTAACCATC GCGGTATTAA TCTGCAACAG GCGTCGTGGG TGACTGCCGC ATTCCAGGTT GGCGGCACGC TTGGCGCGCT GTTACTCGGC GTGTTGATGG ATCGGCTTAA CCCGTTCCGG GTACTGGCGG TGAGCTATGC GCTGGGCGCG GTTTGCATTG TCATGATAGG CCTGAGCGAA AACGGCCTTT GGCTGATGGC GCTGGCGATT TTTGGTACCG GCATCGGTAT TAGCGGTTCA CAGGTCGGGC TGAATGCTCT GACGGCGACG CTGTACCCCA CCCAAAGCCG GGCGACGGGC GTGAGCTGGT CGAACGCCAT TGGACGCTGC GGGGCGATTG TCGGTTCGCT CTCCGGCGGC ATGATGATGG CCCTCAATTT CTCTTTCGAT ACGTTGTTTT TTGTCATTGC TATTCCGGCG GCTATCAGCG CGGTAATGCT TACCCTGCTG ACGGTGGTTG TCCGCCTTTC GATTTCTGTA CCTGACGACC TGCCGCGTGC CAGCGTCGTA AACGAATAA
|
Protein sequence | MTQRRDLQAL IDAAPVGKMQ WRVIICCFLV VMLDGFDTAA IGFIAPDIRT HWQLSASELA PLFGAGLLGL TAGALLCGPL ADRFGRKRVI ELCVALFGAL SLLSAFSPDI ETLVLLRFLT GLGLGGAMPN TITMTSEYLP ARRRGALVTL MFCGFTLGSA MGGIVSAQLV PLIGWHGILA LGGILPLLLF FGLLFALPES PRWQVRRQLP QAVVARTVSA ITGERYHDTQ FFLHETAAVA KGSIRQLFAG RQLVITLMLW VVFFMSLLII YLLSSWMPTL LNHRGINLQQ ASWVTAAFQV GGTLGALLLG VLMDRLNPFR VLAVSYALGA VCIVMIGLSE NGLWLMALAI FGTGIGISGS QVGLNALTAT LYPTQSRATG VSWSNAIGRC GAIVGSLSGG MMMALNFSFD TLFFVIAIPA AISAVMLTLL TVVVRLSISV PDDLPRASVV NE
|
| |