Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A2369 |
Symbol | |
ID | 6484292 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 2286622 |
End bp | 2287980 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642737709 |
Product | 4-hydroxybenzoate transporter |
Protein accession | YP_002041451 |
Protein GI | 194443907 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00895] benzoate transport |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.708273 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 74 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCAAC GACGTGATCT ACAAGCCCTT ATTGATGCCG CGCCCGTCGG CAAAATGCAG TGGCGCGTTA TCATCTGCTG TTTTCTGGTG GTTATGCTCG ACGGTTTCGA CACCGCCGCG ATTGGCTTCA TCGCCCCGGA TATTCGTACT CACTGGCAGC TAAGCGCCAG CGAACTTGCG CCGCTGTTTG GCGCAGGGCT GCTGGGGCTT ACGGCCGGCG CGCTGCTATG CGGGCCGCTG GCGGATCGCT TTGGCCGCAA GCGGGTCATT GAGCTTTGCG TGGCGCTATT CGGCGCATTG AGCCTGCTTT CCGCTTTCTC GCCGGATATA GAAACCCTGG TGTTGCTGCG CTTCTTAACC GGTCTGGGAC TGGGCGGAGC GATGCCGAAT ACCATCACCA TGACGTCGGA ATACCTTCCC GCTCGTCGAC GCGGAGCGCT GGTCACGCTG ATGTTCTGCG GTTTTACCCT GGGGTCGGCG ATGGGCGGGA TTGTGAGCGC GCAACTGGTG CCGCTGATTG GCTGGCACGG AATTCTGGCG CTAGGCGGCA TCTTGCCTTT GATGCTGTTT TTCGGCCTGC TGTTCGCGCT GCCGGAATCT CCCCGCTGGC AGGTACGCCG CCAACTACCG CAAGCCGTTG TCGCCCGGAC GGTCAGCGCC ATTACCGGCG AGCGCTATCA CGATACGCAA TTCTTTCTGC ATGAGACGGC AGCCATCGCC AAAGGCAGTA TTCGCCAGCT TTTTGCCGGG CGACAGCTTG TCATTACCCT GATGTTATGG GTGGTGTTCT TTATGAGCCT GCTCATTATC TATCTGCTTT CCAGCTGGAT GCCGACGTTA CTTAACCATC GCGGTATTGA TCTGCAACAG GCGTCGTGGG TGACTGCCGC ATTCCAGGTT GGCGGCACGC TTGGCGCGCT GTTACTCGGC GTGTTGATGG ACCGGCTTAA CCCGTTCCGG GTACTGGCGG TGAGCTATGC GCTGGGCGCA GTTTGCATTG TCATGATAGG CCTGAGCGAA AACGGCCTTT GGCTGATGGC GCTGGCGATT TTTGGTACCG GCATCGGTAT TAGCGGTTCC CAGGTAGGGC TTAATGCTCT GACGGCGACG CTGTACCCCA CCCAAAGCCG GGCGACGGGC GTGAGCTGGT CGAACGCCAT TGGACGCTGC GGGGCGATTG TCGGTTCGCT CTCCGGCGGC ATGATGATGG CCCTCAATTT CTCTTTCGAT ACGTTGTTTT TTGTCATTGC TATTCCGGCG GCTATCAGCG CGGTAATGCT TACCCTGCTG ACGGTGGTTG TCCGCCTTTC GATTTCTGTA CCTGACGACC TGCCGCGTGC CAGCGTCGTA AACGAATAA
|
Protein sequence | MTQRRDLQAL IDAAPVGKMQ WRVIICCFLV VMLDGFDTAA IGFIAPDIRT HWQLSASELA PLFGAGLLGL TAGALLCGPL ADRFGRKRVI ELCVALFGAL SLLSAFSPDI ETLVLLRFLT GLGLGGAMPN TITMTSEYLP ARRRGALVTL MFCGFTLGSA MGGIVSAQLV PLIGWHGILA LGGILPLMLF FGLLFALPES PRWQVRRQLP QAVVARTVSA ITGERYHDTQ FFLHETAAIA KGSIRQLFAG RQLVITLMLW VVFFMSLLII YLLSSWMPTL LNHRGIDLQQ ASWVTAAFQV GGTLGALLLG VLMDRLNPFR VLAVSYALGA VCIVMIGLSE NGLWLMALAI FGTGIGISGS QVGLNALTAT LYPTQSRATG VSWSNAIGRC GAIVGSLSGG MMMALNFSFD TLFFVIAIPA AISAVMLTLL TVVVRLSISV PDDLPRASVV NE
|
| |