Gene EcSMS35_0594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0594 
SymbolpheP 
ID6146649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp604443 
End bp605819 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content53% 
IMG OID641615486 
Productphenylalanine transporter 
Protein accessionYP_001742692 
Protein GI170683004 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1113] Gamma-aminobutyrate permease and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAAACG CGTCAACCGT ATCGGAAGAT ACTGCGTCGA ATCAAGAGCC GACGCTTCAT 
CGCGGATTAC ATAACCGTCA TATTCAACTG ATTGCGCTGG GTGGCGCAAT TGGTACTGGT
CTGTTTCTTG GCATTGGCCC GGCGATTCAG ATGGCGGGTC CGGCTGTATT GCTGGGCTAC
GGCGTCGCCG GGATCATCGC TTTCCTGATT ATGCGCCAGC TCGGCGAAAT GGTGGTTGAG
GAGCCGGTAT CCGGTTCATT TGCCCACTTT GCCTATAAAT ACTGGGGACC GTTTGCGGGC
TTCCTCTCTG GCTGGAACTA CTGGGTAATG TTCGTGCTGG TGGGAATGGC AGAGCTGACC
GCTGCGGGCA TCTATATGCA GTACTGGTTC CCGGATGTTC CAACGTGGAT TTGGGCTGCC
GCCTTCTTTA TTATCATCAA CGCCGTTAAC CTGGTGAACG TGCGCTTATA TGGCGAAACC
GAGTTCTGGT TTGCGCTGAT TAAAGTGCTG GCGATCATCG GTATGATCGG CTTTGGCCTG
TGGCTGCTGT TTTCTGGTCA CGGCGGCGAG AAAGCCAGTA TCGATAACCT CTGGCGCTAC
GGTGGTTTTT TTGCCACCGG CTGGAATGGG CTGATTTTGT CGCTGGCGGT GATTATGTTC
TCCTTCGGCG GGCTGGAGCT GATTGGGATT ACTGCCGCTG AAGCGCGCGC TCCGGAAAAA
AGCATCCCGA AAGCAGTGAA TCAGGTGGTG TATCGCATCC TGCTGTTTTA CATCGGTTCA
CTGGTGGTTT TACTGGCACT CTATCCGTGG GTGGAAGTGA AATCTAACAG TAGCCCGTTT
GTGATGATTT TCCATAATCT CGACAGCAAC GTGGTAGCTT CTGCGCTGAA CTTCGTCATT
CTGGTAGCAT CATTATCGGT GTATAACAGC GGGGTTTACT CTAACAGCCG CATGCTGTTT
GGCCTTTCTG TGCAGGGTAA TGCGCCGAAG TTTTTGACTC GCGTCAGCCG TCGCGGTGTG
CCGATTAACT CGCTGATGCT TTCCGGAGCG ATCACTTCGC TGGTGGTGTT AATCAACTAT
CTGCTGCCGC AAAAAGCGTT TGGTCTGCTG ATGGCGCTGG TGGTAGCAAC GCTGCTGTTG
AACTGGATTA TGATCTGTCT GGCACATCTG CGTTTTCGTG CAGCGATGCG ACGTCAGGGG
CGTGAAACAC AGTTTAAGGC GCTGCTCTAT CCGTTCGGCA ACTATCTCTG CATCGCCTTC
CTCGGCATGA TTTTGCTGCT GATGTGCACG ATGGATGATA TGCGCTTGTC AGCGATTCTG
CTGCCGGTGT GGATTGTATT CCTGTTTGTG GCATTTAAAA CGCTGCGTCG GAAATAA
 
Protein sequence
MKNASTVSED TASNQEPTLH RGLHNRHIQL IALGGAIGTG LFLGIGPAIQ MAGPAVLLGY 
GVAGIIAFLI MRQLGEMVVE EPVSGSFAHF AYKYWGPFAG FLSGWNYWVM FVLVGMAELT
AAGIYMQYWF PDVPTWIWAA AFFIIINAVN LVNVRLYGET EFWFALIKVL AIIGMIGFGL
WLLFSGHGGE KASIDNLWRY GGFFATGWNG LILSLAVIMF SFGGLELIGI TAAEARAPEK
SIPKAVNQVV YRILLFYIGS LVVLLALYPW VEVKSNSSPF VMIFHNLDSN VVASALNFVI
LVASLSVYNS GVYSNSRMLF GLSVQGNAPK FLTRVSRRGV PINSLMLSGA ITSLVVLINY
LLPQKAFGLL MALVVATLLL NWIMICLAHL RFRAAMRRQG RETQFKALLY PFGNYLCIAF
LGMILLLMCT MDDMRLSAIL LPVWIVFLFV AFKTLRRK