Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A4636 |
Symbol | |
ID | 6484794 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 4522071 |
End bp | 4523573 |
Gene Length | 1503 bp |
Protein Length | 500 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642739858 |
Product | proline/glycine betaine transporter |
Protein accession | YP_002043540 |
Protein GI | 194445267 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00883] metabolite-proton symporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 76 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAAAA GGAAAAAAAT AAAACCGATT ACACTGGGCG ATGTGACCAT CATTGATGAC GGTAAACTTC GCAAAGCGAT TACCGCCGCC TCGCTGGGCA ACGCGATGGA GTGGTTTGAT TTTGGTGTTT ATGGATTTGT TGCCTACGCG TTGGGTAAAG TCTTTTTCCC CGGCGCCGAT CCCAGCGTTC AAATGATTGC CGCGCTGGCC ACGTTTTCCG TTCCCTTCCT GATTCGTCCG CTCGGCGGGT TATTCTTTGG TATGCTCGGC GATAAATACG GGCGCCAGAA GATCCTGGCG ATCACGATTG TGATTATGTC GATCAGTACC TTCTGTATCG GGTTAATCCC CTCTTACGCG ACGATCGGTA TCTGGGCGCC AATACTGTTG TTGCTGTGTA AAATGGCGCA GGGCTTCTCG GTTGGCGGGG AATATACCGG CGCGTCGATC TTTGTCGCGG AATATTCGCC GGATCGTAAA CGCGGATTTA TGGGAAGCTG GCTGGATTTT GGTTCCATCG CCGGGTTCGT GCTGGGCGCG GGCGTGGTGG TCTTAATCTC GACGATTGTC GGCGAGGAGA ATTTCCTGGA GTGGGGCTGG CGTATTCCGT TCTTTATCGC CTTGCCATTG GGAATTATCG GTCTCTACTT ACGCCATGCG CTGGAAGAGA CGCCAGCGTT TCAGCAGCAC GTGGATAAAC TGGAGCAGGG CGACCGCGAA GGGTTGCAGG ATGGGCCGAA AGTCTCCTTT AAAGAGATTG CCACCAAACA CTGGCGTAGC CTGTTGTCAT GTATCGGTCT GGTGATTGCC ACCAACGTGA CCTACTACAT GCTACTCACC TACATGCCGA GCTACCTGTC GCATAACCTG CACTATTCTG AAGATCACGG CGTGTTGATT ATCATCGCCA TTATGATCGG GATGTTGTTT GTGCAGCCGG TGATGGGGCT GCTGAGCGAC CGTTTCGGTC GACGTCCATT TGTGATTATG GGCAGCATTG CGCTGTTTGC GCTGGCGATC CCGGCCTTCA TCCTGATTAA CAGTAACGTT ATTGGCCTGA TTTTTGCCGG TTTGTTGATG CTGGCGGTGA TTCTGAACTG CTTTACCGGG GTGATGGCCT CGACATTGCC GGCGATGTTT CCGACGCATA TTCGTTACAG CGCGCTGGCG GCGGCTTTTA ATATCTCTGT ATTGATTGCC GGTCTGACGC CAACGCTGGC GGCCTGGCTG GTGGAAAGCT CGCAGGATCT GATGATGCCG GCGTATTATT TGATGGTCAT CGCGGTGATA GGCTTGATTA CCGGTATTTC CATGAAAGAA ACGGCCAATC GTCCGCTAAA AGGCGCAACG CCAGCGGCGT CGGACATCCA GGAAGCGAAG GAAATTCTGG GCGAGCATTA CGATAATATT GAGCAGAAAA TCGACGACAT CGATCAGGAA ATTGCGGAGC TGCAGGTCAA ACGTTCGCGT CTGGTACAGC AACATCCGCG TATCGATGAA TAA
|
Protein sequence | MLKRKKIKPI TLGDVTIIDD GKLRKAITAA SLGNAMEWFD FGVYGFVAYA LGKVFFPGAD PSVQMIAALA TFSVPFLIRP LGGLFFGMLG DKYGRQKILA ITIVIMSIST FCIGLIPSYA TIGIWAPILL LLCKMAQGFS VGGEYTGASI FVAEYSPDRK RGFMGSWLDF GSIAGFVLGA GVVVLISTIV GEENFLEWGW RIPFFIALPL GIIGLYLRHA LEETPAFQQH VDKLEQGDRE GLQDGPKVSF KEIATKHWRS LLSCIGLVIA TNVTYYMLLT YMPSYLSHNL HYSEDHGVLI IIAIMIGMLF VQPVMGLLSD RFGRRPFVIM GSIALFALAI PAFILINSNV IGLIFAGLLM LAVILNCFTG VMASTLPAMF PTHIRYSALA AAFNISVLIA GLTPTLAAWL VESSQDLMMP AYYLMVIAVI GLITGISMKE TANRPLKGAT PAASDIQEAK EILGEHYDNI EQKIDDIDQE IAELQVKRSR LVQQHPRIDE
|
| |