Gene SbBS512_E0470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E0470 
SymbolpheP 
ID6268659 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp454126 
End bp455502 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content53% 
IMG OID641724688 
Productphenylalanine transporter 
Protein accessionYP_001879235 
Protein GI187731101 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1113] Gamma-aminobutyrate permease and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAAACG CGTCAACCGT ATCGGAAGAT ACTGCGTCGA ATCAAGAGCC GACGCTTCAT 
CGCGGATTAC ATAACCGTCA TATTCAACTG ATTGCGCTGG GTGGCGCAAT TGGTACTGGT
CTGTTTCTTG GCATTGGCCC GGCGATTCAG ATGGCGGGTC CGGCTGTATT GCTGGGCTAC
GGCGTCGCCG GGATCATCGC TTTCCTGATT ATGCGCCAGC TCGGCGAGAT GGTGGTCGAA
GAGCCGGTAT CCGGTTCATT TGCCCACTTT GCCTATAAAT ACTGGGGACC GTTTGCGGGC
TTCCTCTCTG GCTGGAACTA CTGGGTAATG TTCGTGCTGG TGGGAATGGC AGAGCTGACC
GCTGCGGGCA TCTATATGCA GTACTGGTTC CCGGATGTTC CAACGTGGAT TTGGGCTGCC
GCCTTCTTTA TTATCATCAA CGCCGTTAAC CTGGTGAACG TGCGCTTATA TGGCGAAACC
GAGTTCTGGT TTGCGTTGAT TAAAGTGCTG GCAATCATCG GTATGATCGG CTTTGGCCTG
TGGCTGCTGT TTTCTGGTCA CGGCGGCGAG AAAGCCAGTA TCGACAACCT CTGGTGCTAC
GGTGGTTTCT TCGCCACCGG CTGGAATGGG CTGATTTTGT CGCTGGCGGT AATTATGTTC
TCCTTCGGCG GTCTGGAGCT GATTGGGATT ACTGCCGCTG AAGCGCGCGA TCCGGAAAAA
AGCATTCCAA AAGCGGTAAA TCAGGTGGTG TATCGCATCC TGCTGTTTTA CATCGGTTCA
CTGGTGGTTT TACTGGCGCT CTATCCGTGG GTGGAAGTGA AATCCAACAG TAGCCCGTTT
GTGATGATTT TCCATAATCT CGACAGCAAC GAGGTAGCTT CTGCGCTGAA CTTCGTCATT
CTGGTAGCAT CGCTGTCAGT GTATAACAGC GGGGTTTACT CTAACAGCCG CATGCTGTTT
GGCCTTTCTG TGCAGGGTAA TGCGCCGAAG TTTTTGACTC GCGTCAGCCA TCGCGGCGTG
CCGATTAACT CGCTGATGCT TTCCGGAGCG ATCACTTCGC TGGTGGTGTT AATCAACTAT
CTGCTGCCGC AAAAAGCGTT TGGTCTGCTG ATGGCGCTGG TGGTAGCAAC GCTGCTGTTG
AACTGGATTA TGATCTGCCT GGCGCATCTG CGTTTTCGCG CGGCGATGCG ACGTCAGGGA
CGTGAAACAC AGTTTAAGGC GCTGCTTTAT CCGTTCGGCA ACTATCTTTG CATTGCCTTC
CTCGGCATGA TTTTGCTGCT GATGTGCACG ATGGATGATA TGCGCTTGTC AGCGATCCTG
CTGCCGGTGT GGATTGTATT CCTGTTTGTG GCATTTAAAA CGCTGCGTCG GAAATAA
 
Protein sequence
MKNASTVSED TASNQEPTLH RGLHNRHIQL IALGGAIGTG LFLGIGPAIQ MAGPAVLLGY 
GVAGIIAFLI MRQLGEMVVE EPVSGSFAHF AYKYWGPFAG FLSGWNYWVM FVLVGMAELT
AAGIYMQYWF PDVPTWIWAA AFFIIINAVN LVNVRLYGET EFWFALIKVL AIIGMIGFGL
WLLFSGHGGE KASIDNLWCY GGFFATGWNG LILSLAVIMF SFGGLELIGI TAAEARDPEK
SIPKAVNQVV YRILLFYIGS LVVLLALYPW VEVKSNSSPF VMIFHNLDSN EVASALNFVI
LVASLSVYNS GVYSNSRMLF GLSVQGNAPK FLTRVSHRGV PINSLMLSGA ITSLVVLINY
LLPQKAFGLL MALVVATLLL NWIMICLAHL RFRAAMRRQG RETQFKALLY PFGNYLCIAF
LGMILLLMCT MDDMRLSAIL LPVWIVFLFV AFKTLRRK