Gene SNSL254_A2099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2099 
Symbol 
ID6484882 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2035289 
End bp2036557 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content55% 
IMG OID642737455 
Producttyrosine-specific transport protein 
Protein accessionYP_002041205 
Protein GI194444729 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0814] Amino acid permeases 
TIGRFAM ID[TIGR00837] aromatic amino acid transport protein 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value1.23753e-19 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCTTACC CTGCGCCGCG GATATCACCG TTCTTTGTTA CCGGGGTAGT AGAAAGCGTG 
AAAAACAGAA CTCTGGGCAG TATTTTTATC GTGGCAGGCA CCACTATCGG CGCCGGGATG
CTGGCAATGC CGCTGGCAGC GGCTGGCGTT GGTTTCAGCG TCACGCTGGG ATTGTTGATT
GGCCTGTGGG CGCTGATGTG TTATACCGCG CTACTATTAC TGGAGGTATA TCAACACGTT
CCGGCGGATA CCGGACTGGG CTCGTTGGCA AAACGCTATC TTGGACGTTA CGGACAGTGG
CTTACGGGAT TCAGTATGAT GTTCTTAATG TATGCGCTCA CCGCCGCCTA CATTTCCGGA
GCCGGAGAAT TACTGGCATC CAGTATTAAT AACTGGCTTG GCGCCACGCT CTCGCCCGCT
GCCGGGGTGC TGCTGTTCAC CTTTGTTGCC GGTGGGGTGG TGTGTGTGGG CACCTCGCTG
GTCGACCTTT TTAACCGCTT CCTGTTTAGC GCAAAGATCA TTTTTCTGGT CATCATGCTT
GCGTTGCTCA CGCCACATAT TCATAAAGTA AATCTTCTTA CGCTTCCTTT ACAGCAGGGG
CTGGCGTTAT CCGCCATACC GGTCATTTTC ACCTCGTTTG GTTTTCACGG AAGCGTACCG
AGTATTGTGA GTTATATGAA CGGCAACATT CGCCGGCTGC GTTGGGTCTT TATGACGGGT
AGCGCCATTC CGCTAGTGGC CTATATTTTT TGGCAGCTCG CCACGCTGGG AAGTATCGAC
TCGCCGACAT TCAGAGGGCT ACTGGCCAGC CATGCCGGGT TAAATGGCCT GCTGCAGGCG
CTCAGAGAAG TGGTCGCTTC GCCACATGTC GAACTGGCGG TCCACCTGTT CGCCGATCTG
GCGTTGGCGA CCTCTTTTCT GGGCGTAGCG CTAGGATTAT TTGATTACCT GGCCGATCTA
TTCCAGCGCC GCAGTACGGT GTCCGGACGT CTGCAAACCG GGCTGATTAC CTTTCTGCCG
CCGCTGGCGT TTGCACTTTT CTACCCACGT GGATTTGTGA TGGCATTAGG CTATGCCGGC
GTAGCGCTGG CAGTGCTGGC ACTGCTCATC CCTGCTATGC TGGTCTGGCA GTGCCGTAAA
CAGAGCCCTC AGGCGGGATA TCGTGTGGCA GGCGGCACGC CAGCGCTGGC GCTGGTGTTT
ATCTGCGGCA TTGTCGTGAT TGGCGTCCAG TTTTCGATCG CACTGGGGTT TCTGCCCGAT
CCAGGTTAA
 
Protein sequence
MPYPAPRISP FFVTGVVESV KNRTLGSIFI VAGTTIGAGM LAMPLAAAGV GFSVTLGLLI 
GLWALMCYTA LLLLEVYQHV PADTGLGSLA KRYLGRYGQW LTGFSMMFLM YALTAAYISG
AGELLASSIN NWLGATLSPA AGVLLFTFVA GGVVCVGTSL VDLFNRFLFS AKIIFLVIML
ALLTPHIHKV NLLTLPLQQG LALSAIPVIF TSFGFHGSVP SIVSYMNGNI RRLRWVFMTG
SAIPLVAYIF WQLATLGSID SPTFRGLLAS HAGLNGLLQA LREVVASPHV ELAVHLFADL
ALATSFLGVA LGLFDYLADL FQRRSTVSGR LQTGLITFLP PLAFALFYPR GFVMALGYAG
VALAVLALLI PAMLVWQCRK QSPQAGYRVA GGTPALALVF ICGIVVIGVQ FSIALGFLPD
PG