Gene SNSL254_A4636 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4636 
Symbol 
ID6484794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4522071 
End bp4523573 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content53% 
IMG OID642739858 
Productproline/glycine betaine transporter 
Protein accessionYP_002043540 
Protein GI194445267 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00883] metabolite-proton symporter 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones76 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAAAA GGAAAAAAAT AAAACCGATT ACACTGGGCG ATGTGACCAT CATTGATGAC 
GGTAAACTTC GCAAAGCGAT TACCGCCGCC TCGCTGGGCA ACGCGATGGA GTGGTTTGAT
TTTGGTGTTT ATGGATTTGT TGCCTACGCG TTGGGTAAAG TCTTTTTCCC CGGCGCCGAT
CCCAGCGTTC AAATGATTGC CGCGCTGGCC ACGTTTTCCG TTCCCTTCCT GATTCGTCCG
CTCGGCGGGT TATTCTTTGG TATGCTCGGC GATAAATACG GGCGCCAGAA GATCCTGGCG
ATCACGATTG TGATTATGTC GATCAGTACC TTCTGTATCG GGTTAATCCC CTCTTACGCG
ACGATCGGTA TCTGGGCGCC AATACTGTTG TTGCTGTGTA AAATGGCGCA GGGCTTCTCG
GTTGGCGGGG AATATACCGG CGCGTCGATC TTTGTCGCGG AATATTCGCC GGATCGTAAA
CGCGGATTTA TGGGAAGCTG GCTGGATTTT GGTTCCATCG CCGGGTTCGT GCTGGGCGCG
GGCGTGGTGG TCTTAATCTC GACGATTGTC GGCGAGGAGA ATTTCCTGGA GTGGGGCTGG
CGTATTCCGT TCTTTATCGC CTTGCCATTG GGAATTATCG GTCTCTACTT ACGCCATGCG
CTGGAAGAGA CGCCAGCGTT TCAGCAGCAC GTGGATAAAC TGGAGCAGGG CGACCGCGAA
GGGTTGCAGG ATGGGCCGAA AGTCTCCTTT AAAGAGATTG CCACCAAACA CTGGCGTAGC
CTGTTGTCAT GTATCGGTCT GGTGATTGCC ACCAACGTGA CCTACTACAT GCTACTCACC
TACATGCCGA GCTACCTGTC GCATAACCTG CACTATTCTG AAGATCACGG CGTGTTGATT
ATCATCGCCA TTATGATCGG GATGTTGTTT GTGCAGCCGG TGATGGGGCT GCTGAGCGAC
CGTTTCGGTC GACGTCCATT TGTGATTATG GGCAGCATTG CGCTGTTTGC GCTGGCGATC
CCGGCCTTCA TCCTGATTAA CAGTAACGTT ATTGGCCTGA TTTTTGCCGG TTTGTTGATG
CTGGCGGTGA TTCTGAACTG CTTTACCGGG GTGATGGCCT CGACATTGCC GGCGATGTTT
CCGACGCATA TTCGTTACAG CGCGCTGGCG GCGGCTTTTA ATATCTCTGT ATTGATTGCC
GGTCTGACGC CAACGCTGGC GGCCTGGCTG GTGGAAAGCT CGCAGGATCT GATGATGCCG
GCGTATTATT TGATGGTCAT CGCGGTGATA GGCTTGATTA CCGGTATTTC CATGAAAGAA
ACGGCCAATC GTCCGCTAAA AGGCGCAACG CCAGCGGCGT CGGACATCCA GGAAGCGAAG
GAAATTCTGG GCGAGCATTA CGATAATATT GAGCAGAAAA TCGACGACAT CGATCAGGAA
ATTGCGGAGC TGCAGGTCAA ACGTTCGCGT CTGGTACAGC AACATCCGCG TATCGATGAA
TAA
 
Protein sequence
MLKRKKIKPI TLGDVTIIDD GKLRKAITAA SLGNAMEWFD FGVYGFVAYA LGKVFFPGAD 
PSVQMIAALA TFSVPFLIRP LGGLFFGMLG DKYGRQKILA ITIVIMSIST FCIGLIPSYA
TIGIWAPILL LLCKMAQGFS VGGEYTGASI FVAEYSPDRK RGFMGSWLDF GSIAGFVLGA
GVVVLISTIV GEENFLEWGW RIPFFIALPL GIIGLYLRHA LEETPAFQQH VDKLEQGDRE
GLQDGPKVSF KEIATKHWRS LLSCIGLVIA TNVTYYMLLT YMPSYLSHNL HYSEDHGVLI
IIAIMIGMLF VQPVMGLLSD RFGRRPFVIM GSIALFALAI PAFILINSNV IGLIFAGLLM
LAVILNCFTG VMASTLPAMF PTHIRYSALA AAFNISVLIA GLTPTLAAWL VESSQDLMMP
AYYLMVIAVI GLITGISMKE TANRPLKGAT PAASDIQEAK EILGEHYDNI EQKIDDIDQE
IAELQVKRSR LVQQHPRIDE