Gene ECD_00537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_00537 
SymbolpheP 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp560641 
End bp562017 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content53% 
IMG OID 
Productphenylalanine transporter 
Protein accessionACT42417 
Protein GI253976747 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAAACG CGTCAACCGT ATCGGAAGAT ACTGCGTCGA ATCAAGAGCC GACGCTTCAT 
CGCGGATTAC ATAACCGTCA TATTCAACTG ATTGCGTTGG GTGGCGCAAT TGGTACTGGT
CTGTTTCTTG GCATTGGCCC GGCGATTCAG ATGGCGGGTC CGGCTGTATT GCTGGGCTAC
GGCGTCGCCG GGATCATCGC TTTCCTGATT ATGCGCCAGC TTGGCGAAAT GGTGGTTGAG
GAGCCGGTAT CCGGTTCATT TGCCCACTTT GCCTATAAAT ACTGGGGACC GTTTGCGGGC
TTCCTCTCTG GCTGGAACTA CTGGGTAATG TTCGTGCTGG TGGGAATGGC AGAGCTGACC
GCTGCGGGCA TCTATATGCA GTACTGGTTC CCGGATGTTC CAACGTGGAT TTGGGCTGCC
GCCTTCTTTA TTATCATCAA CGCCGTTAAC CTGGTGAACG TGCGCTTATA TGGCGAAACC
GAGTTCTGGT TTGCGCTGAT TAAAGTGCTG GCGATCATCG GTATGATCGG CTTTGGCCTG
TGGCTGCTGT TTTCTGGTCA CGGCGGCGAG AAAGCCAGTA TCGACAACCT CTGGCGCTAC
GGTGGTTTCT TCGCCACCGG CTGGAATGGG CTGATTTTGT CGCTGGCGGT AATTATGTTC
TCCTTCGGCG GTCTGGAGCT GATTGGGATT ACTGCCGCTG AAGCGCGCGA TCCGGAAAAA
AGCATTCCAA AAGCGGTAAA TCAGGTGGTG TATCGCATCC TGCTGTTTTA CATCGGTTCA
CTGATGGTTT TACTGGCGCT CTATCCGTGG GTGGAAGTGA AATCCAACAG TAGCCCGTTT
GTGATGATTT TCCATAATCT CGACAGCAAC GTGGTAGCTT CTGCGCTGAA CTTCGTCATT
CTGGTAGCAT CGCTGTCAGT GTATAACAGC GGGGTTTACT CTAACAGCCG CATGCTGTTT
GGCCTTTCTG TGCAGGGTAA TGCGCCGAAG TTTTTGACTC GCGTCAGCCG TCGCGGTGTG
CCGATTAACT CGCTGATGCT TTCCGGAGCG ATCACTTCGC TGGTGGTGTT AATCAACTAT
CTGCTGCCGC AAAAAGCGTT TGGTCTGCTG ATGGCGCTGG TGGTAGCAAC GCTGCTGTTG
AACTGGATTA TGATCTGTCT GGCGCATCTG CGTTTTCGTG CAGCGATGCG ACGTCAGGGG
CGTGAAACAC AGTTTAAGGC GCTGCTTTAT CCGTTCGGCA ACTATCTTTG CATCGCCTTC
CTCGGCATGA TTTTGCTGCT GATGTGCACG ATGGATGATA TGCGCTTGTC AGCGATCCTG
CTGCCGGTGT GGATTGTATT CCTGTTTGTG GCATTTAAAA CGCTGCGTCG GAAATAA
 
Protein sequence
MKNASTVSED TASNQEPTLH RGLHNRHIQL IALGGAIGTG LFLGIGPAIQ MAGPAVLLGY 
GVAGIIAFLI MRQLGEMVVE EPVSGSFAHF AYKYWGPFAG FLSGWNYWVM FVLVGMAELT
AAGIYMQYWF PDVPTWIWAA AFFIIINAVN LVNVRLYGET EFWFALIKVL AIIGMIGFGL
WLLFSGHGGE KASIDNLWRY GGFFATGWNG LILSLAVIMF SFGGLELIGI TAAEARDPEK
SIPKAVNQVV YRILLFYIGS LMVLLALYPW VEVKSNSSPF VMIFHNLDSN VVASALNFVI
LVASLSVYNS GVYSNSRMLF GLSVQGNAPK FLTRVSRRGV PINSLMLSGA ITSLVVLINY
LLPQKAFGLL MALVVATLLL NWIMICLAHL RFRAAMRRQG RETQFKALLY PFGNYLCIAF
LGMILLLMCT MDDMRLSAIL LPVWIVFLFV AFKTLRRK