Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A3012 |
Symbol | |
ID | 6483714 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 2936081 |
End bp | 2937265 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642738328 |
Product | major facilitator family transporter |
Protein accession | YP_002042057 |
Protein GI | 194443474 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.00472083 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACGAAAC CCACTCATGG GCTTAGCCCG GCGCTGATCG TTTTAATGTC TGTGGCCACG GGGCTGGCGG TCGCCAGCAA CTACTACGCC CAGCCGCTGC TTGATACCAT CGCGCATCAC TTTTCGCTTT CCGCCAGCTC CGCAGGGTTT ATCGTTACCG CCGCGCAGTT GGGCTATGCC GCTGGCCTGT TGTTTCTGGT GCCGCTCGGC GACATGTTTG AACGCCGAAC GCTGATTGTC TCCATGACGT TGCTGGCGGC TGGCGGAATG CTGATCACCG CCAGCAGTCA GTCGCTTAGC ATGATGATAC TCGGAACGGC CTTAACCGGA CTGTTCTCCG TGGTGGCGCA GATTCTGGTT CCGCTGGCCG CCACACTTGC GACGCCCGCC ACCCGCGGTA AAGTGGTCGG CACCATTATG AGCGGCCTGT TGCTGGGGAT CCTGCTGGCG CGAACGGTCG CCGGACTGCT GGCAAACCTC GGCGGTTGGC GCACCGTATT TTGGGTAGCG TCGGCGCTGA TGGCGCTGAT GGCCGTCGCG TTATGGCGCG GACTGCCAAA GCTCAAATCC GACACCCATC TTAACTACCC GCAACTGTTG GGTTCTGTAT TCAGCCTGTT TATTCACGAT AAGCTGCTGC GTACCCGCGC TCTGCTGGGC TGTCTGACCT TTGCTAATTT CAGCATCCTC TGGACATCAA TGGCCTTTTT GCTCGCCGCG CCGCCGTTTA GCTACTCCGA GGGGATGATT GGCCTGTTTG GCCTGGCGGG GGCCGCCGGC GCTTTAGGCG CGCGTCCGGC TGGCGGATTT GCCGATAAAG GTAAATCTCA CCTCACCACC ACGTTCGGCT TACTGCTGCT GTTACTCTCC TGGCTGGCTA TCTGGCTTGG GCACACCTCG GTACTGGCGC TGATTATTGG CATTCTGGTA CTGGACCTCA CCGTTCAGGG GGTACATATC ACCAATCAGA CGGTCATCTA TCGTTTGCAT CCGGATGCGC GTAACCGGCT CACCGCCGGC TATATGACCA GCTACTTTAT CGGTGGCGCC GCGGGGTCGC TGATTTCCGC CTCCGCCTGG CAACATGCCG GCTGGGCCGG CGTTTGTCTG GCGGGTGTCA CGGTAGCCTT ACTTAATTTA CTGGTCTGGT GGCGAGGTTT TCACCGACAG GAAGCCGTAA ATTAA
|
Protein sequence | MTKPTHGLSP ALIVLMSVAT GLAVASNYYA QPLLDTIAHH FSLSASSAGF IVTAAQLGYA AGLLFLVPLG DMFERRTLIV SMTLLAAGGM LITASSQSLS MMILGTALTG LFSVVAQILV PLAATLATPA TRGKVVGTIM SGLLLGILLA RTVAGLLANL GGWRTVFWVA SALMALMAVA LWRGLPKLKS DTHLNYPQLL GSVFSLFIHD KLLRTRALLG CLTFANFSIL WTSMAFLLAA PPFSYSEGMI GLFGLAGAAG ALGARPAGGF ADKGKSHLTT TFGLLLLLLS WLAIWLGHTS VLALIIGILV LDLTVQGVHI TNQTVIYRLH PDARNRLTAG YMTSYFIGGA AGSLISASAW QHAGWAGVCL AGVTVALLNL LVWWRGFHRQ EAVN
|
| |