Gene SeAg_B4552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B4552 
Symbol 
ID6794244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp4451745 
End bp4453247 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content53% 
IMG OID642778639 
Productproline/glycine betaine transporter 
Protein accessionYP_002149205 
Protein GI197248858 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00883] metabolite-proton symporter 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGAAAA GGAAAAAAAT AAAACCGATT ACACTGGGCG ATGTGACCAT CATTGATGAT 
GGTAAACTTC GCAAAGCGAT TACCGCCGCC TCGCTGGGCA ACGCGATGGA GTGGTTTGAT
TTTGGTGTTT ATGGATTTGT TGCCTACGCG TTGGGTAAAG TCTTTTTCCC CGGCGCCGAT
CCCAGCGTCC AGATGATTGC CGCGCTGGCC ACGTTTTCCG TTCCCTTCCT GATTCGTCCG
CTCGGCGGGT TATTCTTTGG TATGCTCGGC GATAAATACG GGCGCCAGAA GATCCTGGCG
ATCACGATTG TGATTATGTC GATCAGTACC TTCTGTATCG GGTTAATCCC CTCTTACGCG
ACGATCGGTA TCTGGGCGCC AATACTGTTG TTGCTGTGTA AAATGGCGCA GGGCTTCTCG
GTTGGCGGGG AATATACCGG CGCGTCGATC TTTGTCGCGG AATATTCGCC GGATCGTAAA
CGCGGATTTA TGGGAAGCTG GCTGGATTTT GGTTCCATCG CCGGGTTCGT GCTGGGCGCG
GGCGTGGTGG TCTTGATCTC AACGATTGTC GGCGAGGAGA ATTTCCTTGA GTGGGGCTGG
CGTATTCCGT TCTTTATCGC CCTGCCATTG GGGATTATTG GTCTCTACTT ACGCCATGCG
CTGGAGGAGA CGCCAGCGTT TCAGCAGCAC GTGGATAAAC TGGAGCAGGG CGACCGCGAA
GGGTTGCAGG ATGGGCCGAA AGTCTCCTTT AAAGAGATTG CCACCAAACA CTGGCGTAGT
CTGTTGTCAT GTATCGGTCT GGTGATTGCC ACCAACGTGA CCTACTACAT GCTGCTCACC
TACATGCCGA GCTATCTGTC GCATAACCTG CACTATTCTG AAGATCACGG CGTGTTGATT
ATCATCGCCA TTATGATCGG GATGCTGTTT GTGCAGCCGG TGATGGGGCT GCTGAGCGAC
CGTTTCGGTC GTCGTCCGTT TGTGATTATG GGCAGCATTG CGCTGTTCGC GCTGGCGATC
CCGGCCTTCA TCCTGATTAA CAGTAACGTT ATTGGCCTGA TTTTTGCCGG TTTGTTGATG
CTGGCGGTGA TTCTGAACTG CTTTACCGGG GTGATGGCCT CGACATTGCC GGCGATGTTT
CCGACGCATA TTCGTTACAG CGCGCTGGCG GCGGCTTTTA ATATCTCTGT ATTGATTGCC
GGTCTGACGC CAACGCTGGC GGCCTGGCTG GTGGAAAGCT CGCAGGATCT GATGATGCCG
GCGTATTATT TGATGGTCAT CGCGGTGATA GGCTTGATTA CCGGTATTTC CATGAAAGAG
ACGGCCAATC GTCCGCTAAA AGGCGCAACG CCAGCGGCGT CGGACATTCA GGAAGCGAAG
GAAATTCTGG GCGAGCATTA CGATAATATT GAGCAGAAAA TCGACGACAT CGATCAGGAA
ATTGCGGAGC TGCAGGTCAA ACGTTCGCGT CTGGTACAGC AACATCCGCG TATCGATGAA
TAA
 
Protein sequence
MLKRKKIKPI TLGDVTIIDD GKLRKAITAA SLGNAMEWFD FGVYGFVAYA LGKVFFPGAD 
PSVQMIAALA TFSVPFLIRP LGGLFFGMLG DKYGRQKILA ITIVIMSIST FCIGLIPSYA
TIGIWAPILL LLCKMAQGFS VGGEYTGASI FVAEYSPDRK RGFMGSWLDF GSIAGFVLGA
GVVVLISTIV GEENFLEWGW RIPFFIALPL GIIGLYLRHA LEETPAFQQH VDKLEQGDRE
GLQDGPKVSF KEIATKHWRS LLSCIGLVIA TNVTYYMLLT YMPSYLSHNL HYSEDHGVLI
IIAIMIGMLF VQPVMGLLSD RFGRRPFVIM GSIALFALAI PAFILINSNV IGLIFAGLLM
LAVILNCFTG VMASTLPAMF PTHIRYSALA AAFNISVLIA GLTPTLAAWL VESSQDLMMP
AYYLMVIAVI GLITGISMKE TANRPLKGAT PAASDIQEAK EILGEHYDNI EQKIDDIDQE
IAELQVKRSR LVQQHPRIDE