Gene SNSL254_A3012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3012 
Symbol 
ID6483714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2936081 
End bp2937265 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content59% 
IMG OID642738328 
Productmajor facilitator family transporter 
Protein accessionYP_002042057 
Protein GI194443474 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.00472083 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGAAAC CCACTCATGG GCTTAGCCCG GCGCTGATCG TTTTAATGTC TGTGGCCACG 
GGGCTGGCGG TCGCCAGCAA CTACTACGCC CAGCCGCTGC TTGATACCAT CGCGCATCAC
TTTTCGCTTT CCGCCAGCTC CGCAGGGTTT ATCGTTACCG CCGCGCAGTT GGGCTATGCC
GCTGGCCTGT TGTTTCTGGT GCCGCTCGGC GACATGTTTG AACGCCGAAC GCTGATTGTC
TCCATGACGT TGCTGGCGGC TGGCGGAATG CTGATCACCG CCAGCAGTCA GTCGCTTAGC
ATGATGATAC TCGGAACGGC CTTAACCGGA CTGTTCTCCG TGGTGGCGCA GATTCTGGTT
CCGCTGGCCG CCACACTTGC GACGCCCGCC ACCCGCGGTA AAGTGGTCGG CACCATTATG
AGCGGCCTGT TGCTGGGGAT CCTGCTGGCG CGAACGGTCG CCGGACTGCT GGCAAACCTC
GGCGGTTGGC GCACCGTATT TTGGGTAGCG TCGGCGCTGA TGGCGCTGAT GGCCGTCGCG
TTATGGCGCG GACTGCCAAA GCTCAAATCC GACACCCATC TTAACTACCC GCAACTGTTG
GGTTCTGTAT TCAGCCTGTT TATTCACGAT AAGCTGCTGC GTACCCGCGC TCTGCTGGGC
TGTCTGACCT TTGCTAATTT CAGCATCCTC TGGACATCAA TGGCCTTTTT GCTCGCCGCG
CCGCCGTTTA GCTACTCCGA GGGGATGATT GGCCTGTTTG GCCTGGCGGG GGCCGCCGGC
GCTTTAGGCG CGCGTCCGGC TGGCGGATTT GCCGATAAAG GTAAATCTCA CCTCACCACC
ACGTTCGGCT TACTGCTGCT GTTACTCTCC TGGCTGGCTA TCTGGCTTGG GCACACCTCG
GTACTGGCGC TGATTATTGG CATTCTGGTA CTGGACCTCA CCGTTCAGGG GGTACATATC
ACCAATCAGA CGGTCATCTA TCGTTTGCAT CCGGATGCGC GTAACCGGCT CACCGCCGGC
TATATGACCA GCTACTTTAT CGGTGGCGCC GCGGGGTCGC TGATTTCCGC CTCCGCCTGG
CAACATGCCG GCTGGGCCGG CGTTTGTCTG GCGGGTGTCA CGGTAGCCTT ACTTAATTTA
CTGGTCTGGT GGCGAGGTTT TCACCGACAG GAAGCCGTAA ATTAA
 
Protein sequence
MTKPTHGLSP ALIVLMSVAT GLAVASNYYA QPLLDTIAHH FSLSASSAGF IVTAAQLGYA 
AGLLFLVPLG DMFERRTLIV SMTLLAAGGM LITASSQSLS MMILGTALTG LFSVVAQILV
PLAATLATPA TRGKVVGTIM SGLLLGILLA RTVAGLLANL GGWRTVFWVA SALMALMAVA
LWRGLPKLKS DTHLNYPQLL GSVFSLFIHD KLLRTRALLG CLTFANFSIL WTSMAFLLAA
PPFSYSEGMI GLFGLAGAAG ALGARPAGGF ADKGKSHLTT TFGLLLLLLS WLAIWLGHTS
VLALIIGILV LDLTVQGVHI TNQTVIYRLH PDARNRLTAG YMTSYFIGGA AGSLISASAW
QHAGWAGVCL AGVTVALLNL LVWWRGFHRQ EAVN