Gene EcHS_A0623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0623 
SymbolpheP 
ID5594645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp638053 
End bp639429 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content53% 
IMG OID640919804 
Productphenylalanine transporter 
Protein accessionYP_001457386 
Protein GI157160068 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1113] Gamma-aminobutyrate permease and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value0.0943118 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAAACG CGTCAACCGT ATCGGAAGAT ACTGCGTCGA ATCAAGAGCC GACGCTTCAT 
CGCGGATTAC ATAACCGTCA TATTCAACTG ATTGCGTTGG GTGGCGCAAT TGGTACTGGT
CTGTTTCTTG GCATTGGCCC GGCGATTCAG ATGGCGGGTC CGGCTGTATT GCTGGGCTAC
GGCGTCGCCG GGATCATCGC TTTCCTGATT ATGCGCCAGC TTGGCGAAAT GGTGGTTGAG
GAGCCGGTAT CCGGTTCATT TGCCCACTTT GCCTATAAAT ACTGGGGACC GTTTGCGGGC
TTCCTCTCTG GCTGGAACTA CTGGGTAATG TTCGTGCTGG TGGGAATGGC AGAGCTGACC
GCTGCGGGCA TCTATATGCA GTACTGGTTC CCGGATGTTC CAACGTGGAT TTGGGCTGCC
GCCTTCTTTA TTATCATCAA CGCCGTTAAC CTGGTGAACG TGCGCTTATA TGGCGAAACC
GAGTTCTGGT TTGCGCTGAT TAAAGTGCTG GCGATCATCG GTATGATCGG CTTTGGCCTG
TGGCTGCTGT TTTCTGGTCA CGGCGGCGAG AAAGCCAGTA TCGACAACCT CTGGCGCTAC
GGTGGTTTCT TCGCCACCGG CTGGAATGGG CTGATTTTGT CGCTGGCGGT AATTATGTTC
TCCTTCGGCG GTCTGGAGCT GATTGGGATT ACTGCCGCTG AAGCGCGCGA TCCGGAAAAA
AGCATTCCAA AAGCGGTAAA TCAGGTGGTG TATCGCATCC TGCTGTTTTA CATCGGTTCA
CTGGTGGTTT TACTGGCGCT CTATCCGTGG ATGGAAGTGA AATCCAACAG TAGCCCGTTT
GTGATGATTT TCCATAATCT CGACAGCAAC GTGGTAGCTT CTGCGCTGAA CTTCGTCATT
CTGGTAGCAT CGCTGTCAGT GTATAACAGC GGGGTTTACT CTAACAGCCG CATGCTGTTT
GGCCTTTCTG TGCAGGGTAA TGCGCCGAAG TTTTTGACTC GCGTCAGCCG TCGCGGTGTG
CCGATTAACT CGCTGATGCT TTCCGGAGCG ATCACTTCGC TGGTGGTGTT AATCAACTAT
CTGCTGCCGC AAAAAGCGTT TGGTCTGCTG ATGGCGCTGG TGGTAGCAAC GCTGCTGTTG
AACTGGATTA TGATCTGTCT GGCGCATCTG CGTTTTCGTG CAGCGATGCG ACGTCAGGGG
CGTGAAACAC AGTTTAAGGC GCTGCTTTAT CCGTTCGGCA ACTATCTTTG CATCGCCTTC
CTCGGCATGA TTTTGCTGCT GATGTGCACG ATGGATGATA TGCGCTTGTC AGCGATCCTG
CTGCCGGTGT GGATTGTATT CCTGTTTGTG GCATTTAAAA CGCTGCGTCG GAAATAA
 
Protein sequence
MKNASTVSED TASNQEPTLH RGLHNRHIQL IALGGAIGTG LFLGIGPAIQ MAGPAVLLGY 
GVAGIIAFLI MRQLGEMVVE EPVSGSFAHF AYKYWGPFAG FLSGWNYWVM FVLVGMAELT
AAGIYMQYWF PDVPTWIWAA AFFIIINAVN LVNVRLYGET EFWFALIKVL AIIGMIGFGL
WLLFSGHGGE KASIDNLWRY GGFFATGWNG LILSLAVIMF SFGGLELIGI TAAEARDPEK
SIPKAVNQVV YRILLFYIGS LVVLLALYPW MEVKSNSSPF VMIFHNLDSN VVASALNFVI
LVASLSVYNS GVYSNSRMLF GLSVQGNAPK FLTRVSRRGV PINSLMLSGA ITSLVVLINY
LLPQKAFGLL MALVVATLLL NWIMICLAHL RFRAAMRRQG RETQFKALLY PFGNYLCIAF
LGMILLLMCT MDDMRLSAIL LPVWIVFLFV AFKTLRRK