Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A4614 |
Symbol | |
ID | 6484070 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 4497314 |
End bp | 4498663 |
Gene Length | 1350 bp |
Protein Length | 449 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642739838 |
Product | inorganic anion transporter, sulfate permease (SulP) family |
Protein accession | YP_002043520 |
Protein GI | 194442291 |
COG category | [R] General function prediction only |
COG ID | [COG2252] Permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.166887 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 0.371803 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTACGC CTTCAGCGCG TACCGGCGGT TCACTCGACG CCTGGTTTAA AATTTCACAA CGCGGGAGCA CCGTTCGCCA GGAAGTCGTT GCTGGCTTAA CCACGTTTCT TGCGATGGTT TACTCCGTCA TCGTGGTGCC AGGGATGCTG GGTAAAGCCG GGTTCCCGCC TGCTGCCGTC TTTGTAGCGA CCTGCCTGGT CGCAGGCGTC GGCTCTATCG TTATGGGGCT TTGGGCGAAT CTGCCGCTGG CTATCGGTTG CGCTATCTCG TTGACGGCGT TTACCGCGTT TAGCCTGGTC CTGGGGCAGC ATATTAGCGT CCCGGTCGCG CTTGGCGCCG TATTCCTGAT GGGCGTACTG TTTACCGTTA TCTCCGCCAC CGGCATTCGT AGCTGGATTT TGCGTAACTT GCCACAGGGC GTGGCCCACG GAACCGGTAT CGGTATTGGC CTGTTTTTGT TGCTGATTGC CGCCAACGGC GTCGGCCTGG TCATTAAAAA CCCGCTGGAT GGCCTGCCGG TCGCGCTGGG CGACTTCGAC ACCTTCCCGG TGATTATGTC GCTTGTGGGT CTGGCTGTTA TTCTCGGCCT GGAAAAGCTG AAAGTCCCCG GCGGCATTCT GTTGACCATT ATTGGCATTT CTATTGTGGG TTTGCTCTTC GATCCTAACG TCCATTTTTC TGGCATTTTC GCCATGCCTT CGTTGAGCGA TGAAAACGGC AACTCGCTGA TTGGCAGTCT GGATATTATG GGCGCGTTAA ACCCTGTCGT TCTGCCAAGC GTACTGGCGC TGGTGATGAC GGCGGTGTTT GATGCGACCG GCACGATCCG TGCGGTCGCA GGGCAGGCTA ACCTGCTGGA CAAAGACGGA CAAATTATCG ATGGCGGCAA AGCGCTAACC ACCGACTCCC TGAGTAGCGT TTTCTCCGGC CTGGTCGGGG CGGCGCCAGC CGCGGTGTAT ATCGAGTCTG CGGCCGGTAC GGCGGCGGGC GGTAAAACGG GCCTGACGGC GATTACCGTG GGCGTGCTGT TTTTGCTGAT CCTGTTTCTT TCGCCGCTCT CTTATCTCGT TCCCGTCTAC GCAACCGCCC CGGCGCTCAT GTATGTCGGT CTGCTGATGC TGAGTAACGT GGCGAAAATC GATTTTGCTG ATTTCGTCGA TGCGATGGCG GGGCTGGTAA CGGCTGTATT TATTGTACTG ACCTGTAACA TCGTCACCGG GATTATGATC GGCTTTGCCA CGCTGGTCGT CGGGCGTCTG GTTTCCGGCG AATGGCGTAA GCTGAATATC GGAACTGTCG TCATTGCGGT CGCGCTGGTC GCTTTTTATG CCGGCGGCTG GGCTATCTAG
|
Protein sequence | MSTPSARTGG SLDAWFKISQ RGSTVRQEVV AGLTTFLAMV YSVIVVPGML GKAGFPPAAV FVATCLVAGV GSIVMGLWAN LPLAIGCAIS LTAFTAFSLV LGQHISVPVA LGAVFLMGVL FTVISATGIR SWILRNLPQG VAHGTGIGIG LFLLLIAANG VGLVIKNPLD GLPVALGDFD TFPVIMSLVG LAVILGLEKL KVPGGILLTI IGISIVGLLF DPNVHFSGIF AMPSLSDENG NSLIGSLDIM GALNPVVLPS VLALVMTAVF DATGTIRAVA GQANLLDKDG QIIDGGKALT TDSLSSVFSG LVGAAPAAVY IESAAGTAAG GKTGLTAITV GVLFLLILFL SPLSYLVPVY ATAPALMYVG LLMLSNVAKI DFADFVDAMA GLVTAVFIVL TCNIVTGIMI GFATLVVGRL VSGEWRKLNI GTVVIAVALV AFYAGGWAI
|
| |