Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeSA_A4545 |
Symbol | |
ID | 6517222 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 |
Kingdom | Bacteria |
Replicon accession | NC_011094 |
Strand | + |
Start bp | 4421558 |
End bp | 4423060 |
Gene Length | 1503 bp |
Protein Length | 500 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642749489 |
Product | proline/glycine betaine transporter |
Protein accession | YP_002117225 |
Protein GI | 194735683 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00883] metabolite-proton symporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.829312 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAAAA GGAAAAAAAT AAAACCGATT ACACTGGGCG ATGTGACCAT CATTGATGAC GGTAAACTTC GCAAAGCGAT TACCGCCGCC TCGCTGGGCA ACGCGATGGA GTGGTTTGAT TTTGGTGTTT ATGGATTTGT TGCCTACGCG TTGGGTAAAG TCTTTTTCCC CGGCGCCGAT CCCAGCGTCC AGATGATTGC CGCGCTGGCC ACGTTTTCCG TTCCCTTCCT GATTCGTCCG CTCGGCGGGT TATTCTTTGG TATGCTCGGC GATAAATACG GGCGCCAGAA GATCCTGGCG ATAACGATTG TGATTATGTC GATCAGTACC TTCTGTATCG GGTTAATTCC CTCTTACGCG ACGATCGGTA TCTGGGCGCC AATACTGTTG TTGCTATGTA AAATAGCGCA GGGCTTCTCG GTTGGCGGGG AATATACCGG CGCGTCGATC TTTGTCGCGG AATATTCGCC GGATCGTAAA CGCGGATTTA TGGGAAGCTG GTTGGATTTT GGTTCCATCG CCGGGTTCGT GCTGGGCGCG GGCGTGGTGG TCTTAATCTC AACGATTGTC GGCGAGGAGA ATTTCCTGGA GTGGGGCTGG CGTATTCCGT TCTTTATCGC TCTGCCATTG GGAATTATTG GTCTCTACTT ACGCCATGCG CTGGAGGAGA CGCCAGCGTT TCAGCAGCAC GTGGATAAAC TGGAGCAGGG CGACCGCGAA GGGTTGCAGG ATGGGCCGAA AGTCTCCTTT AAAGAGATTG CCACCAAACA CTGGCGGAGC CTGTTGTCAT GTATCGGTCT GGTGATTGCC ACCAACGTGA CCTACTACAT GCTGCTCACC TACATGCCGA GCTACCTGTC GCATAACCTG CACTATTCTG AAGATCACGG CGTGTTGATT ATCATCGCCA TTATGATCGG GATGCTGTTT GTGCAGCCGG TGATGGGGCT GCTGAGCGAC CGTTTCGGTC GACGTCCATT TGTGATTATG GGCAGCATTG CGCTGTTCGC GCTGGCGATC CCGGCCTTCA TCCTGATTAA CAGTAACGTT ATTGGCCTGA TTTTTGCCGG TTTGTTGATG CTGGCGGTGA TTCTGAACTG CTTTACCGGG GTGATGGCCT CGACATTGCC GGCGATGTTT CCGACGCATA TTCGTTACAG CGCGCTGGCG GCGGCTTTTA ATATCTCTGT ATTGATTGCC GGTCTGACGC CAACGCTGGC GGCCTGGCTG GTGGAAAGCT CGCAGGATCT GATGATGCCG GCGTATTATT TGATGGTCAT CGCGGTGATA GGCTTGGTTA CCGGTATTTC CATGAAAGAG ACGGCCAATC GTCCGCTAAA AGGCGCAACG CCAGCGGCGT CGGACATCCA GGAAGCGAAG GAAATTCTGG GCGAGCATTA CGATAATATT GAGCAGAAAA TCGACGACAT CGATCAGGAA ATCGCGGAGC TGCAGGTCAA GCGTTCGCGT CTGGTACAGC AACATCCGCG CATCGATGAA TAA
|
Protein sequence | MLKRKKIKPI TLGDVTIIDD GKLRKAITAA SLGNAMEWFD FGVYGFVAYA LGKVFFPGAD PSVQMIAALA TFSVPFLIRP LGGLFFGMLG DKYGRQKILA ITIVIMSIST FCIGLIPSYA TIGIWAPILL LLCKIAQGFS VGGEYTGASI FVAEYSPDRK RGFMGSWLDF GSIAGFVLGA GVVVLISTIV GEENFLEWGW RIPFFIALPL GIIGLYLRHA LEETPAFQQH VDKLEQGDRE GLQDGPKVSF KEIATKHWRS LLSCIGLVIA TNVTYYMLLT YMPSYLSHNL HYSEDHGVLI IIAIMIGMLF VQPVMGLLSD RFGRRPFVIM GSIALFALAI PAFILINSNV IGLIFAGLLM LAVILNCFTG VMASTLPAMF PTHIRYSALA AAFNISVLIA GLTPTLAAWL VESSQDLMMP AYYLMVIAVI GLVTGISMKE TANRPLKGAT PAASDIQEAK EILGEHYDNI EQKIDDIDQE IAELQVKRSR LVQQHPRIDE
|
| |