Gene SeSA_A0458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A0458 
SymbolproY 
ID6516793 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp467438 
End bp468868 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content54% 
IMG OID642745608 
Productputative proline-specific permease 
Protein accessionYP_002113432 
Protein GI194738258 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1113] Gamma-aminobutyrate permease and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTACACCA CGGGGCTTAA CGCCCCGTGG TTTTTTATTG TGTTGATAGG TCAGAAATTG 
ATGGAAAGCA ATAATAAGCT AAAGCGTGGG CTGAGCACCC GGCACATTCG CTTTATGGCA
TTAGGTTCGG CAATCGGCAC CGGCCTGTTT TACGGCTCGG CGGACGCCAT CAAAATGGCG
GGGCCGAGCG TGCTGTTGGC CTATATTATT GGCGGGGTCG CGGCATATAT CATTATGCGC
GCATTGGGGG AAATGTCCGT TCACAACCCT GCCGCCAGCT CATTTTCGCG CTATGCGCAG
GAAAACCTCG GCCCGCTTGC GGGCTATATT ACCGGCTGGA CCTACTGTTT TGAGATCCTG
ATCGTCGCCA TTGCCGACGT GACCGCGTTC GGCATTTACA TGGGAGTCTG GTTCCCCGCC
GTGCCGCACT GGATTTGGGT GCTTAGCGTG GTGCTGATCA TTTGCGCCAT CAACCTGATG
AGCGTCAAGG TGTTCGGCGA GCTGGAGTTT TGGTTCTCCT TCTTCAAAGT CGCCACCATT
ATTATCATGA TTGTCGCGGG TATCGGCATT ATTGTGTGGG GAATTGGCAA CGGCGGGCAG
CCCACCGGCA TTCATAACCT GTGGAGCAAC GGCGGCTTCT TCAGCAATGG CTGGCTGGGA
ATGATCATGT CGCTGCAAAT GGTGATGTTC GCTTACGGCG GGATTGAAAT TATCGGTATC
ACCGCCGGGG AAGCGAAAGA CCCGGAGAAA TCCATTCCGC GCGCTATTAA CTCAGTACCG
ATGCGTATCC TGGTATTTTA TGTCGGCACG CTGTTCGTCA TTATGTCTAT CTATCCGTGG
AATCAGGTCG GCACAAACGG CAGTCCATTT GTGCTGACGT TCCAGCATAT GGGGATTACC
TTCGCCGCCA GCATTCTGAA CTTTGTGGTA TTGACCGCCT CGCTTTCCGC TATCAACTCC
GATGTGTTTG GTGTAGGCCG TATGCTGCAT GGTATGGCGG AGCAGGGGAG CGCGCCGAAA
GTCTTTGCCA AAACGTCACG CCGTGGTATT CCGTGGGTCA CTGTGCTGGT GATGACGATT
GCGCTGCTGT TTGCGGTTTA CCTGAACTAC ATCATGCCGG AAAACGTCTT CCTGGTGATT
GCCTCGCTGG CGACGTTTGC GACGGTATGG GTATGGATTA TGATCCTGCT GTCGCAAATC
GCCTTCCGCC GTCGTTTACC GCCGGAAGAG GTAAAAGCGC TGAAGTTTAA GGTGCCGGGC
GGTGTCGTAA CGACGATAGC TGGGCTGATT TTCCTGGTCT TCATTATTGC GCTTATCGGC
TACCATCCGG ATACCCGCAT CTCACTGTAT GTGGGCTTCG CCTGGATAGT TCTGCTGTTG
ATTGGCTGGA TATTTAAACG CCGTCGCGAC CGTCAATTGG CGCAGGCGTA G
 
Protein sequence
MYTTGLNAPW FFIVLIGQKL MESNNKLKRG LSTRHIRFMA LGSAIGTGLF YGSADAIKMA 
GPSVLLAYII GGVAAYIIMR ALGEMSVHNP AASSFSRYAQ ENLGPLAGYI TGWTYCFEIL
IVAIADVTAF GIYMGVWFPA VPHWIWVLSV VLIICAINLM SVKVFGELEF WFSFFKVATI
IIMIVAGIGI IVWGIGNGGQ PTGIHNLWSN GGFFSNGWLG MIMSLQMVMF AYGGIEIIGI
TAGEAKDPEK SIPRAINSVP MRILVFYVGT LFVIMSIYPW NQVGTNGSPF VLTFQHMGIT
FAASILNFVV LTASLSAINS DVFGVGRMLH GMAEQGSAPK VFAKTSRRGI PWVTVLVMTI
ALLFAVYLNY IMPENVFLVI ASLATFATVW VWIMILLSQI AFRRRLPPEE VKALKFKVPG
GVVTTIAGLI FLVFIIALIG YHPDTRISLY VGFAWIVLLL IGWIFKRRRD RQLAQA