Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A1001 |
Symbol | |
ID | 6485079 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 1010079 |
End bp | 1011227 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642736407 |
Product | putative MFS family transporter protein |
Protein accession | YP_002040166 |
Protein GI | 194444300 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0738] Fucose permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 74 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCACCT ATACCCGTCC CGTCATGCTT TTGCTGTGCG GGCTACTTTT GTTGACTCTG GCCATTGCGG TACTGAATAC GCTTGTGCCG CTGTGGCTTG CTCAGGCAAA CCTTCCGACC TGGCAGGTGG GGATGGTCAG CTCGTCTTAT TTTACCGGCA ATCTGGTCGG GACGTTATTT ACCGGGTATT TAATTAAACG CATTGGGTTT AACCGTAGCT ATTATCTTGC CTCGCTGATC TTCGCCGCGG GTTGTGTCGG ATTGGGGGTG ATGGTGGGGT TCTGGAGCTG GATGAGCTGG CGTTTTATTG CCGGTATCGG CTGCGCCATG ATTTGGGTGG TTGTCGAGAG CGCGTTGATG TGCAGCGGAA CCTCGCATAA TCGCGGGCGC CTGCTGGCTG CCTATATGAT GGTCTATTAT ATGGGGACCT TCCTTGGACA ATTATTGGTC AGTAAAGTAT CTGGTGAATT GCTGCACGTT CTTCCCTGGG TGACCGGAAT GATTCTGGCG GGAATTCTGC CGCTACTCTT TACCCGAATT GTAAATCAGC AAACGCAGGC ACGTCATTCC TCTTCTATTA GCGCCATGCT GAAGCTACGC CAGGCGCGTC TTGGCGTGAA TGGTTGTATT ATTTCCGGCA TTGTTCTTGG TTCATTATAT GGCCTGATGC CGTTATATCT GAAGCATCAG GGGATGGCTA ACGCCAGCAT CGGTTTCTGG ATGGCGGTGC TGGTGAGCGC CGGCATTTTG GGGCAATGGC CAATGGGACG TCTGGCGGAC AAATTTGGTC GCTTGCTGGT GTTACGCGTA CAGGTATTCG TTGTCATACT CGGTAGTATT GCCATGTTAA CCCAGGCGGC GATGGCGCCA GCTCTGTTTA TTCTGGGGGC GGCGGGTTTT ACGCTTTATC CCGTTGCAAT GGCCTGGGCC TGTGAAAAAG TCGAACACCA CCAGCTTGTG GCAATGAACC AGGCGCTGTT GTTAAGTTAT ACGGTAGGGA GCCTGTTGGG GCCGTCTTTT GCTGCGATGT TAATGCAGAA TTATTCAGAT AATCTGCTGT TTATTATGAT CGCCAGCGTA TCGTTTATTT ATCTGCTGAT GCTGTTACGT AACGCCGGCC AGACGCCTAA TCCTGTCGCC CACATCTAA
|
Protein sequence | MSTYTRPVML LLCGLLLLTL AIAVLNTLVP LWLAQANLPT WQVGMVSSSY FTGNLVGTLF TGYLIKRIGF NRSYYLASLI FAAGCVGLGV MVGFWSWMSW RFIAGIGCAM IWVVVESALM CSGTSHNRGR LLAAYMMVYY MGTFLGQLLV SKVSGELLHV LPWVTGMILA GILPLLFTRI VNQQTQARHS SSISAMLKLR QARLGVNGCI ISGIVLGSLY GLMPLYLKHQ GMANASIGFW MAVLVSAGIL GQWPMGRLAD KFGRLLVLRV QVFVVILGSI AMLTQAAMAP ALFILGAAGF TLYPVAMAWA CEKVEHHQLV AMNQALLLSY TVGSLLGPSF AAMLMQNYSD NLLFIMIASV SFIYLLMLLR NAGQTPNPVA HI
|
| |