Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A4170 |
Symbol | |
ID | 6483183 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 4063792 |
End bp | 4065219 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642739426 |
Product | inner membrane transport protein YieO |
Protein accession | YP_002043135 |
Protein GI | 194442388 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00711] drug resistance transporter, EmrB/QacA subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0309418 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 77 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAGCA AAAAAGCGCG CAGTATGGCT GGCCTGCCGT GGATCGCGGC GATGGCCTTC TTTATGCAGG CGCTAGACGC CACCATTTTG AATACCGCTT TACCGGCTAT TGCGCATAGT CTTAACCGTT CCCCACTCGC CATGCAGTCC GCTATTATCA GTTATACCCT GACAGTAGCC ATGTTAATTC CGGTAAGCGG CTGGCTGGCT GACCGCTTCG GTACGCGTCG CGTCTTTATG GTTGCCGTAA GTCTGTTTAC GTTAGGCTCG CTGGCCTGCG CGCTCTCCAG TTCTCTATCT GAATTGGTTA TTTTTCGCGT CGTACAAGGC GTCGGCGGAG CGATGATGAT GCCAGTGGCG CGGCTGGCGT TATTGCGAGC CTATCCGCGT AGTGAGCTGC TGCCCGTTCT TAACTTTGTC ACTATGCCGG GGCTGGTAGG TCCGATTCTG GGGCCGGTAT TAGGCGGCGT ACTGGTGACC TGGGCAAGCT GGCACTGGAT CTTCCTGATT AATATTCCCA TTGGTGTTGC AGGCATTCTG TATGCCCGCA AATATATGCC CAACTTCACC ACGCCGCGTC GCAAGTTTGA CATGATCGGC TTCTTTCTTT TTGGGTTAAG TCTGGTTTTA TTTTCCAGCG GAATGGAACT GTTTGGCGAA AAGATTGTGG CGACATGGAT CGCATCCGCC ATTATTTTTT GCAGTATCGT TCTACTATTG GCCTATATCC GCCACGCCCG CCGTCATCCG ACACCGTTAA TATCACTATC GCTGTTTAAG ACACGCACAT TTTCCGTTGG CATTGCCGGC AACCTCGCCA CGCGTCTGGG GACAGGCTGC GTACCTTTTT TGATTCCGTT AATGCTACAG GTTGGTTTTG GCTACCCGGC TCTGATCGCC GGCTGCATGA TGGCGCCGAC AGCTTTGGGT TCTATTATCG CGAAATCGAC CGTAACGCAG GTTTTGCGAC GTTTGGGATA CCGGAAGACT CTGGTCAGCA TTACGGTATT TATCGGTCTG ATGATTGCCC AGTTTTCGTT TCAATCTCCC GCCATGCCGA TCTGGATGCT CGTACTGCCG CTATTCATTC TTGGAATGGC GATGTCCACT CAATTTACGG CAATGAATAC GATTACTCTT GCGGATCTGA CAGACGATAA CGCCAGCAGC GGCAACAGCG TTCTGGCGGT TACGCAGCAA TTGTCTATCA GTTTAGGCGT GGCGATCAGC GCGGCGGTGT TACGTATTTA TGAAGGTTTT GCGGGCACCA GTACCGTGGA ACAGTTCCAC TATACCTTTA TCACGATGGG GGCGATCACT ATCGTGTCCG CGCTGATGTT TATGCTGCTA AGAGCCAAAG ACGGCAACAA TCTGATTAAA GAACGGCATA AATCTAAACC GACCCACGCA CCGTCAAAAT CGGAGTAA
|
Protein sequence | MTSKKARSMA GLPWIAAMAF FMQALDATIL NTALPAIAHS LNRSPLAMQS AIISYTLTVA MLIPVSGWLA DRFGTRRVFM VAVSLFTLGS LACALSSSLS ELVIFRVVQG VGGAMMMPVA RLALLRAYPR SELLPVLNFV TMPGLVGPIL GPVLGGVLVT WASWHWIFLI NIPIGVAGIL YARKYMPNFT TPRRKFDMIG FFLFGLSLVL FSSGMELFGE KIVATWIASA IIFCSIVLLL AYIRHARRHP TPLISLSLFK TRTFSVGIAG NLATRLGTGC VPFLIPLMLQ VGFGYPALIA GCMMAPTALG SIIAKSTVTQ VLRRLGYRKT LVSITVFIGL MIAQFSFQSP AMPIWMLVLP LFILGMAMST QFTAMNTITL ADLTDDNASS GNSVLAVTQQ LSISLGVAIS AAVLRIYEGF AGTSTVEQFH YTFITMGAIT IVSALMFMLL RAKDGNNLIK ERHKSKPTHA PSKSE
|
| |