Gene ECH74115_0479 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0479 
SymbolproY 
ID6968590 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp486552 
End bp487925 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content53% 
IMG OID643384527 
Productputative proline-specific permease 
Protein accessionYP_002269041 
Protein GI209396652 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1113] Gamma-aminobutyrate permease and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.339066 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAGTA AGAACAAGCT AAAGCGTGGG CTAAGTACCC GCCACATACG CTTTATGGCA 
CTGGGTTCAG CAATTGGCAC CGGGCTGTTT TACGGTTCGG CAGACGCCAT CAAAATGGCC
GGTCCGAGCG TGTTGTTGGC CTATATTATC GGTGGTATCG CGGCGTATAT CATTATGCGT
GCGCTGGGGG AAATGTCGGT ACATAACCCG GCCGCCAGCT CTTTCTCGCG TTATGCGCAG
GAAAACCTCG GCCCGCTGGC AGGTTACATT ACCGGCTGGA CCTACTGCTT TGAAATCCTT
ATTGTCGCCA TCGCCGATGT GACCGCTTTT GGTATCTATA TGGGTGTCTG GTTCCCGACG
GTGCCGCACT GGATTTGGGT ACTGAGCGTG GTGCTGATCA TTTGCGCCGT AAACCTGATG
AGCGTGAAGG TATTCGGTGA GCTGGAATTC TGGTTCTCGT TCTTTAAAGT CGCCACCATC
ATCATCATGA TTGTCGCCGG TTTCGGCATC ATCATCTGGG GGATTGGCAA CGGCGGGCAA
CCGACCGGTA TTCATAACCT GTGGAGCAAC GGCGGCTTCT TCAGTAACGG CTGGCTTGGC
ATGGTAATGT CGTTGCAAAT GGTGATGTTT GCTTACGGTG GGATCGAAAT TATCGGGATT
ACCGCCGGTG AAGCGAAAGA TCCTGAGAAA TCGATACCGC GTGCGATTAA CTCCGTGCCG
ATGCGTATTC TGGTGTTCTA CGTCGGTACG CTGTTCGTCA TTATGTCTAT CTACCCGTGG
AATCAGGTTG GCACTGCCGG TAGCCCGTTC GTGCTGACGT TCCAGCATAT GGGCATTACC
TTTGCCGCCA GCATTCTTAA CTTTGTTGTG CTGACTGCTT CGCTGTCGGC AATTAACAGT
GATGTATTTG GCGTAGGCCG TATGCTCCAC GGTATGGCAG AGCAGGGCAG CGCGCCGAAA
ATTTTCAGCA AAACGTCGCG TCGCGGTATT CCGTGGGTTA CGGTGCTGGT GATGACTACC
GCGCTGCTGT TTGCGGTGTA TCTGAACTAC ATCATGCCGG AAAACGTCTT CCTGGTGATT
GCTTCGCTGG CAACCTTCGC CACGGTGTGG GTGTGGATTA TGATCCTGCT GTCGCAAATT
GCCTTCCGTC GCCGTTTGCC GCCAGAAGAA GTTAAGGCGC TGAAATTTAA AGTGCCGGGT
GGGGTAGCAA CGACCATCGG CGGGCTGATT TTCCTGCTCT TTATTATCGG GTTGATTGGT
TATCACCCGG ATACGCGTAT CTCGCTGTAT GTCGGTTTCG CGTGGATTGT TGTGCTGTTG
ATTGGCTGGA TGTTTAAGCG TCGCCACGAT CGTCAGCTGG CTGAAAACCA ATAA
 
Protein sequence
MESKNKLKRG LSTRHIRFMA LGSAIGTGLF YGSADAIKMA GPSVLLAYII GGIAAYIIMR 
ALGEMSVHNP AASSFSRYAQ ENLGPLAGYI TGWTYCFEIL IVAIADVTAF GIYMGVWFPT
VPHWIWVLSV VLIICAVNLM SVKVFGELEF WFSFFKVATI IIMIVAGFGI IIWGIGNGGQ
PTGIHNLWSN GGFFSNGWLG MVMSLQMVMF AYGGIEIIGI TAGEAKDPEK SIPRAINSVP
MRILVFYVGT LFVIMSIYPW NQVGTAGSPF VLTFQHMGIT FAASILNFVV LTASLSAINS
DVFGVGRMLH GMAEQGSAPK IFSKTSRRGI PWVTVLVMTT ALLFAVYLNY IMPENVFLVI
ASLATFATVW VWIMILLSQI AFRRRLPPEE VKALKFKVPG GVATTIGGLI FLLFIIGLIG
YHPDTRISLY VGFAWIVVLL IGWMFKRRHD RQLAENQ