Gene ECH74115_0657 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0657 
SymbolpheP 
ID6967213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp687727 
End bp689139 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content53% 
IMG OID643384694 
Productphenylalanine transporter 
Protein accessionYP_002269207 
Protein GI209396862 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1113] Gamma-aminobutyrate permease and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCCTCA ACAAAAAAGA CACACAGGGG AAAGGCGTGA AAAACGCGTC AAACGTATCG 
GAAGATACTG CGTCGAATCA AGAGCCGACG CTTCATCGCG GATTACATAA CCGTCATATT
CAACTGATTG CGCTGGGTGG CGCAATTGGT ACTGGTCTGT TTCTTGGCAT TGGCCCGGCG
ATTCAGATGG CGGGTCCGGC TGTATTGCTG GGCTACGGCG TCGCCGGGAT CATCGCTTTC
CTGATTATGC GCCAGCTCGG CGAGATGGTG GTCGAAGAGC CGGTATCCGG TTCATTTGCC
CACTTTGCCT ATAAATACTG GGGACCGTTC GCCGGGTTCC TCTCCGGCTG GAACTACTGG
GTGATGTTCG TGCTGGTGGG AATGGCAGAG CTGACCGCTG CGGGCATCTA TATGCAGTAC
TGGTTCCCGG ATGTTCCAAC GTGGATTTGG GCTGCCGCCT TCTTTATTAT CATCAACGCC
GTTAACCTGG TGAACGTGCG CTTATATGGC GAAACCGAGT TCTGGTTTGC GCTGATTAAA
GTGCTGGCAA TCATCGGTAT GATCGGCTTT GGCCTGTGGC TGCTGTTTTC TGGTCACGGC
GGCGAGAAAG CCAGTATCGA CAACCTCTGG CGCTACGGTG GTTTCTTCGC CACCGGCTGG
AATGGGCTGA TTTTGTCGCT GGCGGTAATT ATGTTCTCCT TCGGCGGTCT GGAGCTGATT
GGGATTACTG CCGCTGAAGC GCGCGATCCG GAAAAAAGCA TTCCAAAAGC GGTAAATCAG
GTGGTGTATC GCATCCTGCT GTTTTACATC GGTTCACTGG TGGTTTTACT GGCGCTCTAT
CCGTGGGTGG AAGTGAAATC CAACAGTAGC CCGTTTGTGA TGATTTTCCA TAATCTCGAC
AGCAACGTGG TAGCTTCTGC GCTGAACTTC GTCATTCTGG TAGCATCGCT GTCAGTGTAT
AACAGCGGGG TTTACTCTAA CAGCCGCATG CTGTTTGGCC TTTCTGTGCA GGGTAATGCG
CCGAAGTTCC TTACCCGTGT TAGCCGTCGC GGCGTGCCGA TTAACTCGCT GATGCTTTCC
GGAGCGATCA CTTCGCTGGT GGTGTTAATC AACTATCTGC TGCCGCAAAA AGCGTTTGGT
CTGCTGATGG CGCTGGTGGT AGCAACGCTG CTGTTGAACT GGATTATGAT CTGTCTGGCG
CATCTGCGTT TTCGTGCAGC GATGCGACGT CAGGGGCGTG AAACACAGTT TAAGGCGCTG
CTCTATCCGT TCGGCAACTA TCTTTGCATC GCCTTCCTCG CCATGATTTT GCTGCTGATG
TGCACGATGG ATGATATGCG CTTGTCAGCG ATCCTGCTGC CGGTGTGGAT TGTATTCCTG
TTTGTAGCAT TTAAAACGCT GCGTCGGAAA TAA
 
Protein sequence
MPLNKKDTQG KGVKNASNVS EDTASNQEPT LHRGLHNRHI QLIALGGAIG TGLFLGIGPA 
IQMAGPAVLL GYGVAGIIAF LIMRQLGEMV VEEPVSGSFA HFAYKYWGPF AGFLSGWNYW
VMFVLVGMAE LTAAGIYMQY WFPDVPTWIW AAAFFIIINA VNLVNVRLYG ETEFWFALIK
VLAIIGMIGF GLWLLFSGHG GEKASIDNLW RYGGFFATGW NGLILSLAVI MFSFGGLELI
GITAAEARDP EKSIPKAVNQ VVYRILLFYI GSLVVLLALY PWVEVKSNSS PFVMIFHNLD
SNVVASALNF VILVASLSVY NSGVYSNSRM LFGLSVQGNA PKFLTRVSRR GVPINSLMLS
GAITSLVVLI NYLLPQKAFG LLMALVVATL LLNWIMICLA HLRFRAAMRR QGRETQFKAL
LYPFGNYLCI AFLAMILLLM CTMDDMRLSA ILLPVWIVFL FVAFKTLRRK