Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B4552 |
Symbol | |
ID | 6794244 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | + |
Start bp | 4451745 |
End bp | 4453247 |
Gene Length | 1503 bp |
Protein Length | 500 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642778639 |
Product | proline/glycine betaine transporter |
Protein accession | YP_002149205 |
Protein GI | 197248858 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00883] metabolite-proton symporter |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 45 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGAAAA GGAAAAAAAT AAAACCGATT ACACTGGGCG ATGTGACCAT CATTGATGAT GGTAAACTTC GCAAAGCGAT TACCGCCGCC TCGCTGGGCA ACGCGATGGA GTGGTTTGAT TTTGGTGTTT ATGGATTTGT TGCCTACGCG TTGGGTAAAG TCTTTTTCCC CGGCGCCGAT CCCAGCGTCC AGATGATTGC CGCGCTGGCC ACGTTTTCCG TTCCCTTCCT GATTCGTCCG CTCGGCGGGT TATTCTTTGG TATGCTCGGC GATAAATACG GGCGCCAGAA GATCCTGGCG ATCACGATTG TGATTATGTC GATCAGTACC TTCTGTATCG GGTTAATCCC CTCTTACGCG ACGATCGGTA TCTGGGCGCC AATACTGTTG TTGCTGTGTA AAATGGCGCA GGGCTTCTCG GTTGGCGGGG AATATACCGG CGCGTCGATC TTTGTCGCGG AATATTCGCC GGATCGTAAA CGCGGATTTA TGGGAAGCTG GCTGGATTTT GGTTCCATCG CCGGGTTCGT GCTGGGCGCG GGCGTGGTGG TCTTGATCTC AACGATTGTC GGCGAGGAGA ATTTCCTTGA GTGGGGCTGG CGTATTCCGT TCTTTATCGC CCTGCCATTG GGGATTATTG GTCTCTACTT ACGCCATGCG CTGGAGGAGA CGCCAGCGTT TCAGCAGCAC GTGGATAAAC TGGAGCAGGG CGACCGCGAA GGGTTGCAGG ATGGGCCGAA AGTCTCCTTT AAAGAGATTG CCACCAAACA CTGGCGTAGT CTGTTGTCAT GTATCGGTCT GGTGATTGCC ACCAACGTGA CCTACTACAT GCTGCTCACC TACATGCCGA GCTATCTGTC GCATAACCTG CACTATTCTG AAGATCACGG CGTGTTGATT ATCATCGCCA TTATGATCGG GATGCTGTTT GTGCAGCCGG TGATGGGGCT GCTGAGCGAC CGTTTCGGTC GTCGTCCGTT TGTGATTATG GGCAGCATTG CGCTGTTCGC GCTGGCGATC CCGGCCTTCA TCCTGATTAA CAGTAACGTT ATTGGCCTGA TTTTTGCCGG TTTGTTGATG CTGGCGGTGA TTCTGAACTG CTTTACCGGG GTGATGGCCT CGACATTGCC GGCGATGTTT CCGACGCATA TTCGTTACAG CGCGCTGGCG GCGGCTTTTA ATATCTCTGT ATTGATTGCC GGTCTGACGC CAACGCTGGC GGCCTGGCTG GTGGAAAGCT CGCAGGATCT GATGATGCCG GCGTATTATT TGATGGTCAT CGCGGTGATA GGCTTGATTA CCGGTATTTC CATGAAAGAG ACGGCCAATC GTCCGCTAAA AGGCGCAACG CCAGCGGCGT CGGACATTCA GGAAGCGAAG GAAATTCTGG GCGAGCATTA CGATAATATT GAGCAGAAAA TCGACGACAT CGATCAGGAA ATTGCGGAGC TGCAGGTCAA ACGTTCGCGT CTGGTACAGC AACATCCGCG TATCGATGAA TAA
|
Protein sequence | MLKRKKIKPI TLGDVTIIDD GKLRKAITAA SLGNAMEWFD FGVYGFVAYA LGKVFFPGAD PSVQMIAALA TFSVPFLIRP LGGLFFGMLG DKYGRQKILA ITIVIMSIST FCIGLIPSYA TIGIWAPILL LLCKMAQGFS VGGEYTGASI FVAEYSPDRK RGFMGSWLDF GSIAGFVLGA GVVVLISTIV GEENFLEWGW RIPFFIALPL GIIGLYLRHA LEETPAFQQH VDKLEQGDRE GLQDGPKVSF KEIATKHWRS LLSCIGLVIA TNVTYYMLLT YMPSYLSHNL HYSEDHGVLI IIAIMIGMLF VQPVMGLLSD RFGRRPFVIM GSIALFALAI PAFILINSNV IGLIFAGLLM LAVILNCFTG VMASTLPAMF PTHIRYSALA AAFNISVLIA GLTPTLAAWL VESSQDLMMP AYYLMVIAVI GLITGISMKE TANRPLKGAT PAASDIQEAK EILGEHYDNI EQKIDDIDQE IAELQVKRSR LVQQHPRIDE
|
| |