Gene SeD_A0439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0439 
SymbolproY 
ID6874549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp457916 
End bp459346 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content54% 
IMG OID642783668 
Productputative proline-specific permease 
Protein accessionYP_002214355 
Protein GI198244611 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1113] Gamma-aminobutyrate permease and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.37513 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones85 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTACACCA CGGGGCTTAA CGCCCCGTGG TTTTTTATTG TGTTGATAGG TCAGAAATTG 
ATGGAAAGCA ATAATAAGCT AAAGCGTGGG CTGAGCACCC GGCACATTCG CTTTATGGCA
TTAGGTTCGG CAATCGGCAC CGGCCTGTTT TACGGCTCGG CGGACGCCAT CAAAATGGCG
GGGCCGAGCG TGCTGTTGGC CTATATTATT GGCGGGGTCG CGGCATATAT CATTATGCGC
GCATTGGGGG AAATGTCCGT TCACAACCCT GCCGCCAGCT CATTTTCGCG CTATGCGCAG
GAAAACCTCG GCCCGCTTGC GGGCTACATT ACCGGCTGGA CCTACTGTTT TGAGATCCTG
ATCGTCGCCA TTGCCGACGT GACCGCGTTC GGCATTTACA TGGGAGTCTG GTTCCCCGCC
GTGCCGCACT GGATTTGGGT ACTTAGCGTG GTGCTGATCA TTTGCGCCAT CAACCTGATG
AGCGTCAAGG TGTTCGGCGA GCTGGAGTTT TGGTTCTCCT TCTTCAAAGT CGCCACCATT
ATTATCATGA TTGTCGCGGG TATCGGCATC ATTGTGTGGG GAATTGGCAA CGGCGGGCAG
CCGACCGGCA TTCATAACCT GTGGAGCAAC GGCGGCTTCT TCAGCAATGG CTGGCTGGGA
ATGATCATGT CGCTGCAAAT GGTGATGTTC GCTTACGGCG GGATTGAGAT TATCGGTATC
ACCGCCGGGG AAGCGAAAGA CCCGGAGAAA TCCATTCCGC GCGCCATTAA CTCAGTACCG
ATGCGTATCC TGGTATTTTA TGTCGGCACG CTGTTCGTCA TTATGTCTAT CTATCCGTGG
AATCAGGTCG GCACAAACGG CAGCCCATTT GTGCTGACGT TCCAGCATAT GGGGATTACC
TTTGCCGCCA GCATTCTGAA CTTTGTGGTA TTGACCGCCT CGCTTTCCGC TATCAACTCC
GATGTGTTTG GCGTAGGCCG TATGCTGCAT GGTATGGCGG AGCAGGGGAG CGCGCCGAAA
GTCTTTGCCA AAACGTCACG CCGTGGTATT CCGTGGGTTA CTGTGCTGGT GATGACGATT
GCGCTGCTGT TTGCGGTTTA CCTGAACTAC ATCATGCCGG AAAACGTCTT CCTGGTGATT
GCCTCGCTGG CGACGTTTGC GACGGTATGG GTATGGATCA TGATCCTGCT GTCACAAATC
GCCTTCCGCC GTCGTTTACC GCCGGAAGAG GTAAAAGCGC TGAAGTTTAA GGTGCCGGGC
GGTGTCGTAA CGACGATAGC GGGTCTGATT TTCCTGGTCT TCATTATTGC GCTTATCGGC
TACCATCCGG ATACCCGCAT CTCACTGTAT GTGGGCTTCG CCTGGATAGT TCTGCTGTTG
ATTGGCTGGA TGTTTAAACG CCGTCGCGAC CGTCAATTGG CGCAGGCGTA G
 
Protein sequence
MYTTGLNAPW FFIVLIGQKL MESNNKLKRG LSTRHIRFMA LGSAIGTGLF YGSADAIKMA 
GPSVLLAYII GGVAAYIIMR ALGEMSVHNP AASSFSRYAQ ENLGPLAGYI TGWTYCFEIL
IVAIADVTAF GIYMGVWFPA VPHWIWVLSV VLIICAINLM SVKVFGELEF WFSFFKVATI
IIMIVAGIGI IVWGIGNGGQ PTGIHNLWSN GGFFSNGWLG MIMSLQMVMF AYGGIEIIGI
TAGEAKDPEK SIPRAINSVP MRILVFYVGT LFVIMSIYPW NQVGTNGSPF VLTFQHMGIT
FAASILNFVV LTASLSAINS DVFGVGRMLH GMAEQGSAPK VFAKTSRRGI PWVTVLVMTI
ALLFAVYLNY IMPENVFLVI ASLATFATVW VWIMILLSQI AFRRRLPPEE VKALKFKVPG
GVVTTIAGLI FLVFIIALIG YHPDTRISLY VGFAWIVLLL IGWMFKRRRD RQLAQA