Gene SeHA_C0500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C0500 
SymbolproY 
ID6487660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp501727 
End bp503157 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content54% 
IMG OID642740767 
Productputative proline-specific permease 
Protein accessionYP_002044434 
Protein GI194448252 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1113] Gamma-aminobutyrate permease and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.968025 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones87 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTACACCA CGGGGCTTAA CGCCCCGTGG TTTTTTATTG TGTTGATAGG TCAGAAATTG 
ATGGAAAGCA ATAATAAGCT AAAGCGTGGG CTGAGCACCC GGCACATTCG CTTTATGGCA
TTAGGTTCGG CAATCGGCAC CGGCCTGTTT TACGGCTCGG CGGACGCCAT CAAAATGGCG
GGGCCGAGCG TGCTGTTGGC CTATATTATT GGCGGGGTCG CGGCATATAT CATTATGCGC
GCATTGGGGG AAATGTCCGT TCACAACCCT GCCGCCAGCT CATTTTCGCG CTATGCGCAG
GAAAACCTCG GCCCGCTTGC GGGCTACATT ACCGGCTGGA CCTACTGTTT TGAGATCCTG
ATCGTCGCCA TTGCCGACGT GACCGCGTTC GGCATTTACA TGGGAGTCTG GTTCCCCGCC
GTGCCGCACT GGATTTGGGT GCTTAGCGTG GTGCTGATCA TTTGCGCCAT CAACCTGATG
AGCGTCAAGG TGTTCGGTGA GCTGGAGTTT TGGTTCTCCT TCTTCAAAGT CGCCACCATT
ATTATCATGA TTATCGCGGG TATCGGCATC ATTGTGTGGG GAATTGGCAA CGGCGGGCAG
CCCACCGGCA TTCATAACCT GTGGAGCAAC GGCGGCTTCT TCAGCAATGG CTGGCTGGGA
ATGATCATGT CGCTGCAAAT GGTAATGTTC GCTTACGGCG GGATTGAGAT TATCGGTATC
ACCGCCGGGG AAGCGAAAGA CCCGGAGAAA TCCATTCCGC GCGCCATTAA CTCAGTACCG
ATGCGTATCC TGGTATTTTA TGTCGGCACG CTGTTCGTCA TTATGTCTAT CTATCCGTGG
AATCAGGTCG GCACAAACGG CAGCCCATTT GTGCTGACGT TCCAGCATAT GGGGATTACC
TTTGCCGCCA GCATTCTGAA CTTTGTGGTA TTGACCGCCT CGCTTTCCGC TATCAACTCC
GATGTGTTTG GCGTAGGCCG TATGCTGCAT GGTATGGCGG AGCAGGGGAG CGCGCCGAAA
GTCTTTGCCA AAACGTCACG CCGTGGTATT CCGTGGGTTA CTGTGCTGGT GATGACGATT
GCGCTGCTGT TTGCGGTTTA CCTGAACTAC ATCATGCCGG AAAACGTCTT CCTGGTGATT
GCTTCGCTGG CGACGTTTGC GACGGTATGG GTATGGATTA TGATCCTGCT GTCGCAAATC
GCCTTCCGTC GTCGTTTACC GCCGGAAGAG GTAAAAGCGC TGAAATTTAA GGTGCCGGGC
GGTGTCGTAA CGACGATAGC GGGTCTGATT TTCCTGGTCT TCATTATTGC GCTTATCGGC
TACCATCCGG ATACCCGCAT CTCACTGTAT GTGGGCTTCG CCTGGATAGT TCTGCTGTTG
ATTGGCTGGA TATTTAAACG CCGTCGCGAC CGTCAATTGG CGCAGGCGTA G
 
Protein sequence
MYTTGLNAPW FFIVLIGQKL MESNNKLKRG LSTRHIRFMA LGSAIGTGLF YGSADAIKMA 
GPSVLLAYII GGVAAYIIMR ALGEMSVHNP AASSFSRYAQ ENLGPLAGYI TGWTYCFEIL
IVAIADVTAF GIYMGVWFPA VPHWIWVLSV VLIICAINLM SVKVFGELEF WFSFFKVATI
IIMIIAGIGI IVWGIGNGGQ PTGIHNLWSN GGFFSNGWLG MIMSLQMVMF AYGGIEIIGI
TAGEAKDPEK SIPRAINSVP MRILVFYVGT LFVIMSIYPW NQVGTNGSPF VLTFQHMGIT
FAASILNFVV LTASLSAINS DVFGVGRMLH GMAEQGSAPK VFAKTSRRGI PWVTVLVMTI
ALLFAVYLNY IMPENVFLVI ASLATFATVW VWIMILLSQI AFRRRLPPEE VKALKFKVPG
GVVTTIAGLI FLVFIIALIG YHPDTRISLY VGFAWIVLLL IGWIFKRRRD RQLAQA