Gene SeSA_A4545 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A4545 
Symbol 
ID6517222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp4421558 
End bp4423060 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content53% 
IMG OID642749489 
Productproline/glycine betaine transporter 
Protein accessionYP_002117225 
Protein GI194735683 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00883] metabolite-proton symporter 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.829312 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAAAA GGAAAAAAAT AAAACCGATT ACACTGGGCG ATGTGACCAT CATTGATGAC 
GGTAAACTTC GCAAAGCGAT TACCGCCGCC TCGCTGGGCA ACGCGATGGA GTGGTTTGAT
TTTGGTGTTT ATGGATTTGT TGCCTACGCG TTGGGTAAAG TCTTTTTCCC CGGCGCCGAT
CCCAGCGTCC AGATGATTGC CGCGCTGGCC ACGTTTTCCG TTCCCTTCCT GATTCGTCCG
CTCGGCGGGT TATTCTTTGG TATGCTCGGC GATAAATACG GGCGCCAGAA GATCCTGGCG
ATAACGATTG TGATTATGTC GATCAGTACC TTCTGTATCG GGTTAATTCC CTCTTACGCG
ACGATCGGTA TCTGGGCGCC AATACTGTTG TTGCTATGTA AAATAGCGCA GGGCTTCTCG
GTTGGCGGGG AATATACCGG CGCGTCGATC TTTGTCGCGG AATATTCGCC GGATCGTAAA
CGCGGATTTA TGGGAAGCTG GTTGGATTTT GGTTCCATCG CCGGGTTCGT GCTGGGCGCG
GGCGTGGTGG TCTTAATCTC AACGATTGTC GGCGAGGAGA ATTTCCTGGA GTGGGGCTGG
CGTATTCCGT TCTTTATCGC TCTGCCATTG GGAATTATTG GTCTCTACTT ACGCCATGCG
CTGGAGGAGA CGCCAGCGTT TCAGCAGCAC GTGGATAAAC TGGAGCAGGG CGACCGCGAA
GGGTTGCAGG ATGGGCCGAA AGTCTCCTTT AAAGAGATTG CCACCAAACA CTGGCGGAGC
CTGTTGTCAT GTATCGGTCT GGTGATTGCC ACCAACGTGA CCTACTACAT GCTGCTCACC
TACATGCCGA GCTACCTGTC GCATAACCTG CACTATTCTG AAGATCACGG CGTGTTGATT
ATCATCGCCA TTATGATCGG GATGCTGTTT GTGCAGCCGG TGATGGGGCT GCTGAGCGAC
CGTTTCGGTC GACGTCCATT TGTGATTATG GGCAGCATTG CGCTGTTCGC GCTGGCGATC
CCGGCCTTCA TCCTGATTAA CAGTAACGTT ATTGGCCTGA TTTTTGCCGG TTTGTTGATG
CTGGCGGTGA TTCTGAACTG CTTTACCGGG GTGATGGCCT CGACATTGCC GGCGATGTTT
CCGACGCATA TTCGTTACAG CGCGCTGGCG GCGGCTTTTA ATATCTCTGT ATTGATTGCC
GGTCTGACGC CAACGCTGGC GGCCTGGCTG GTGGAAAGCT CGCAGGATCT GATGATGCCG
GCGTATTATT TGATGGTCAT CGCGGTGATA GGCTTGGTTA CCGGTATTTC CATGAAAGAG
ACGGCCAATC GTCCGCTAAA AGGCGCAACG CCAGCGGCGT CGGACATCCA GGAAGCGAAG
GAAATTCTGG GCGAGCATTA CGATAATATT GAGCAGAAAA TCGACGACAT CGATCAGGAA
ATCGCGGAGC TGCAGGTCAA GCGTTCGCGT CTGGTACAGC AACATCCGCG CATCGATGAA
TAA
 
Protein sequence
MLKRKKIKPI TLGDVTIIDD GKLRKAITAA SLGNAMEWFD FGVYGFVAYA LGKVFFPGAD 
PSVQMIAALA TFSVPFLIRP LGGLFFGMLG DKYGRQKILA ITIVIMSIST FCIGLIPSYA
TIGIWAPILL LLCKIAQGFS VGGEYTGASI FVAEYSPDRK RGFMGSWLDF GSIAGFVLGA
GVVVLISTIV GEENFLEWGW RIPFFIALPL GIIGLYLRHA LEETPAFQQH VDKLEQGDRE
GLQDGPKVSF KEIATKHWRS LLSCIGLVIA TNVTYYMLLT YMPSYLSHNL HYSEDHGVLI
IIAIMIGMLF VQPVMGLLSD RFGRRPFVIM GSIALFALAI PAFILINSNV IGLIFAGLLM
LAVILNCFTG VMASTLPAMF PTHIRYSALA AAFNISVLIA GLTPTLAAWL VESSQDLMMP
AYYLMVIAVI GLVTGISMKE TANRPLKGAT PAASDIQEAK EILGEHYDNI EQKIDDIDQE
IAELQVKRSR LVQQHPRIDE