Gene SNSL254_A0445 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A0445 
SymbolproY 
ID6485560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp458994 
End bp460424 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content54% 
IMG OID642735867 
Productputative proline-specific permease 
Protein accessionYP_002039641 
Protein GI194444302 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1113] Gamma-aminobutyrate permease and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones103 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTACACCA CGGGGCTTAA CGCCCCGTGG TTTTTTATTG TGTTGATAGG TCAGAAATTG 
ATGGAAAGCA ATAATAAGCT AAAGCGTGGG CTGAGCACCC GGCACATTCG CTTTATGGCA
TTAGGTTCGG CAATCGGCAC CGGCCTGTTT TACGGCTCGG CGGACGCCAT CAAAATGGCG
GGGCCGAGCG TGCTGTTGGC CTATATTATT GGCGGGGTCG CGGCATATAT CATTATGCGC
GCATTGGGGG AAATGTCCGT TCACAACCCT GCCGCCAGCT CATTTTCGCG CTATGCGCAG
GAAAACCTCG GCCCGCTTGC GGGCTACATT ACCGGCTGGA CCTACTGTTT TGAGATCCTG
ATCGTCGCCA TTGCCGACGT GACCGCGTTC GGCATTTACA TGGGAGTCTG GTTCCCCGCC
GTGCCGCACT GGATTTGGGT GCTTAGCGTG GTGCTGATCA TTTGCGCCAT CAACCTGATG
AGCGTCAAGG TGTTCGGCGA GCTGGAGTTT TGGTTCTCCT TCTTCAAAGT CGCCACCATT
ATTATCATGA TTGTCGCGGG TATCGGCATC ATTGTGTGGG GAATTGGCAA CGGCGGGCAG
CCCACCGGCA TTCATAACCT GTGGAGCAAC GGCGGCTTCT TCAGCAATGG CTGGCTGGGA
ATGATCATGT CGCTGCAAAT GGTGATGTTC GCTTACGGCG GGATTGAGAT TATCGGTATC
ACCGCCGGGG AAGCGAAAGA CCCGGAGAAA TCCATTCCGC GCGCCATTAA CTCAGTACCG
ATGCGTATCC TGGTATTTTA TGTCGGCACG CTGTTCGTCA TTATGTCTAT CTATCCGTGG
AATCAGGTCG GCACAAACGG CAGCCCATTT GTGCTGACGT TCCAGCATAT GGGGATTACC
TTTGCCGCCA GCATTCTGAA CTTTGTGGTA TTGACCGCCT CGCTTTCCGC TATCAATTCC
GATGTGTTTG GCGTAGGCCG TATGCTGCAT GGTATGGCGG AGCAGGGGAG CGCGCCGAAA
ATCTTTGCCA AAACGTCACG CCGTGGTATT CCGTGGGTCA CTGTGCTGGT GATGACGATT
GCGCTGCTGT TTGCGGTTTA CCTGAACTAC ATCATGCCGG AAAACGTCTT CCTGGTGATT
GCTTCGCTGG CGACGTTTGC GACGGTATGG GTATGGATTA TGATCCTGCT GTCGCAAATC
GCCTTCCGTC GTCGTTTACC GCCGGAAGAG GTAAAAGCGC TGAAGTTTAA GGTGCCGGGC
GGTGTCGTAA CGACGATAGC GGGTCTGATT TTCCTGGTCT TCATTATTGC GCTTATCGGC
TACCATCCGG ATACCCGCAT CTCACTGTAT GTGGGCTTCG CCTGGATAGT TCTGCTGTTG
ATTGGCTGGA TATTTAAACG CCGTCGCGAC CGTCAATTGG CGCAGGCGTA G
 
Protein sequence
MYTTGLNAPW FFIVLIGQKL MESNNKLKRG LSTRHIRFMA LGSAIGTGLF YGSADAIKMA 
GPSVLLAYII GGVAAYIIMR ALGEMSVHNP AASSFSRYAQ ENLGPLAGYI TGWTYCFEIL
IVAIADVTAF GIYMGVWFPA VPHWIWVLSV VLIICAINLM SVKVFGELEF WFSFFKVATI
IIMIVAGIGI IVWGIGNGGQ PTGIHNLWSN GGFFSNGWLG MIMSLQMVMF AYGGIEIIGI
TAGEAKDPEK SIPRAINSVP MRILVFYVGT LFVIMSIYPW NQVGTNGSPF VLTFQHMGIT
FAASILNFVV LTASLSAINS DVFGVGRMLH GMAEQGSAPK IFAKTSRRGI PWVTVLVMTI
ALLFAVYLNY IMPENVFLVI ASLATFATVW VWIMILLSQI AFRRRLPPEE VKALKFKVPG
GVVTTIAGLI FLVFIIALIG YHPDTRISLY VGFAWIVLLL IGWIFKRRRD RQLAQA