Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A2099 |
Symbol | |
ID | 6484882 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 2035289 |
End bp | 2036557 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642737455 |
Product | tyrosine-specific transport protein |
Protein accession | YP_002041205 |
Protein GI | 194444729 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0814] Amino acid permeases |
TIGRFAM ID | [TIGR00837] aromatic amino acid transport protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 1.23753e-19 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCTTACC CTGCGCCGCG GATATCACCG TTCTTTGTTA CCGGGGTAGT AGAAAGCGTG AAAAACAGAA CTCTGGGCAG TATTTTTATC GTGGCAGGCA CCACTATCGG CGCCGGGATG CTGGCAATGC CGCTGGCAGC GGCTGGCGTT GGTTTCAGCG TCACGCTGGG ATTGTTGATT GGCCTGTGGG CGCTGATGTG TTATACCGCG CTACTATTAC TGGAGGTATA TCAACACGTT CCGGCGGATA CCGGACTGGG CTCGTTGGCA AAACGCTATC TTGGACGTTA CGGACAGTGG CTTACGGGAT TCAGTATGAT GTTCTTAATG TATGCGCTCA CCGCCGCCTA CATTTCCGGA GCCGGAGAAT TACTGGCATC CAGTATTAAT AACTGGCTTG GCGCCACGCT CTCGCCCGCT GCCGGGGTGC TGCTGTTCAC CTTTGTTGCC GGTGGGGTGG TGTGTGTGGG CACCTCGCTG GTCGACCTTT TTAACCGCTT CCTGTTTAGC GCAAAGATCA TTTTTCTGGT CATCATGCTT GCGTTGCTCA CGCCACATAT TCATAAAGTA AATCTTCTTA CGCTTCCTTT ACAGCAGGGG CTGGCGTTAT CCGCCATACC GGTCATTTTC ACCTCGTTTG GTTTTCACGG AAGCGTACCG AGTATTGTGA GTTATATGAA CGGCAACATT CGCCGGCTGC GTTGGGTCTT TATGACGGGT AGCGCCATTC CGCTAGTGGC CTATATTTTT TGGCAGCTCG CCACGCTGGG AAGTATCGAC TCGCCGACAT TCAGAGGGCT ACTGGCCAGC CATGCCGGGT TAAATGGCCT GCTGCAGGCG CTCAGAGAAG TGGTCGCTTC GCCACATGTC GAACTGGCGG TCCACCTGTT CGCCGATCTG GCGTTGGCGA CCTCTTTTCT GGGCGTAGCG CTAGGATTAT TTGATTACCT GGCCGATCTA TTCCAGCGCC GCAGTACGGT GTCCGGACGT CTGCAAACCG GGCTGATTAC CTTTCTGCCG CCGCTGGCGT TTGCACTTTT CTACCCACGT GGATTTGTGA TGGCATTAGG CTATGCCGGC GTAGCGCTGG CAGTGCTGGC ACTGCTCATC CCTGCTATGC TGGTCTGGCA GTGCCGTAAA CAGAGCCCTC AGGCGGGATA TCGTGTGGCA GGCGGCACGC CAGCGCTGGC GCTGGTGTTT ATCTGCGGCA TTGTCGTGAT TGGCGTCCAG TTTTCGATCG CACTGGGGTT TCTGCCCGAT CCAGGTTAA
|
Protein sequence | MPYPAPRISP FFVTGVVESV KNRTLGSIFI VAGTTIGAGM LAMPLAAAGV GFSVTLGLLI GLWALMCYTA LLLLEVYQHV PADTGLGSLA KRYLGRYGQW LTGFSMMFLM YALTAAYISG AGELLASSIN NWLGATLSPA AGVLLFTFVA GGVVCVGTSL VDLFNRFLFS AKIIFLVIML ALLTPHIHKV NLLTLPLQQG LALSAIPVIF TSFGFHGSVP SIVSYMNGNI RRLRWVFMTG SAIPLVAYIF WQLATLGSID SPTFRGLLAS HAGLNGLLQA LREVVASPHV ELAVHLFADL ALATSFLGVA LGLFDYLADL FQRRSTVSGR LQTGLITFLP PLAFALFYPR GFVMALGYAG VALAVLALLI PAMLVWQCRK QSPQAGYRVA GGTPALALVF ICGIVVIGVQ FSIALGFLPD PG
|
| |