Gene ECH74115_2647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2647 
SymboltyrP 
ID6970736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2497127 
End bp2498338 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content51% 
IMG OID643386510 
Producttyrosine-specific transport protein 
Protein accessionYP_002270992 
Protein GI209395924 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0814] Amino acid permeases 
TIGRFAM ID[TIGR00837] aromatic amino acid transport protein 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.588852 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0000000318595 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAAAAACA GAACCCTGGG AAGTGTTTTT ATCGTGGCGG GAACCACAAT TGGCGCAGGC 
ATGCTGGCAA TGCCGCTGGC TGCGGCCGGT GTTGGTTTTA GCGTTACGTT AATCTTGTTG
ATTGGGCTTT GGGCGTTGAT GTGCTACACG GCGCTATTAC TGCTGGAGGT GTACCAGCAT
GTTCCGGCAG ATACCGGTCT GGGCACGCTG GCAAAACGCT ATCTGGGACG CTACGGTCAA
TGGCTGACGG GCTTCAGTAT GATGTTCTTA ATGTATGCTC TGACTGCGGC ATACATCAGC
GGTGCCGGTG AATTGTTGGC CTCCAGCATC AGCGACTGGA CAGGTATTTC AATGTCGGCA
ACCGCTGGCG TGTTGTTGTT CACTTTTGTT GCCGGTGGCG TGGTTTGTGT CGGGACATCG
CTGGTCGATT TATTTAACCG TTTTCTGTTC AGCGCCAAAA TTATTTTTCT GGTGGTGATG
CTGGTATTGC TGCTACCGCA TATTCACAAA GTGAATCTTT TAACCCTGCC GTTGCAACAG
GGGCTGGCAC TGTCTGCAAT CCCGGTGATT TTTACGTCGT TTGGTTTTCA CGGTAGCGTG
CCGAGTATTG TCAGCTATAT GGATGGCAAC ATTCGTAAGC TACGCTGGGT GTTTATAACC
GGTAGTGCGA TCCCCCTGGT GGCATATATT TTCTGGCAGG TGGCGACGCT TGGCAGCATT
GATTCAACAA CCTTTATGGG ATTGCTGGCT AATCATGCCG GATTAAACGG GCTGTTACAG
GCGTTACGCG AAATGGTGGC CTCTCCGCAT GTTGAGCTGG CAGTGCATTT ATTTGCTGAT
TTAGCCCTTG CCACGTCATT TCTCGGCGTT GCGTTAGGCT TATTTGATTA TCTGGCAGAT
TTGTTTCAGC GTTCAAATAC CGTTGGTGGA CGATTGCAAA CTGGAGCAAT TACCTTTCTG
CCACCGTTGG CGTTTGCACT GTTTTATCCA CGAGGATTTG TGATGGCGCT GGGTTACGCC
GGTGTGGCGC TGGCGGTACT GGCATTGATT ATCCCTTCGC TGTTGACCTG GCAAAGCAGA
AAGCACAATC CTCAAGCGGG TTACCGGGTC AAAGGTGGTC GTCCGGCGCT GGTGGTGGTG
TTTCTCTGTG GTATTGCTGT GATTGGCGTG CAATTTTTGA TTGCGGCAGG GTTGTTACCA
GAAGTGGGGT GA
 
Protein sequence
MKNRTLGSVF IVAGTTIGAG MLAMPLAAAG VGFSVTLILL IGLWALMCYT ALLLLEVYQH 
VPADTGLGTL AKRYLGRYGQ WLTGFSMMFL MYALTAAYIS GAGELLASSI SDWTGISMSA
TAGVLLFTFV AGGVVCVGTS LVDLFNRFLF SAKIIFLVVM LVLLLPHIHK VNLLTLPLQQ
GLALSAIPVI FTSFGFHGSV PSIVSYMDGN IRKLRWVFIT GSAIPLVAYI FWQVATLGSI
DSTTFMGLLA NHAGLNGLLQ ALREMVASPH VELAVHLFAD LALATSFLGV ALGLFDYLAD
LFQRSNTVGG RLQTGAITFL PPLAFALFYP RGFVMALGYA GVALAVLALI IPSLLTWQSR
KHNPQAGYRV KGGRPALVVV FLCGIAVIGV QFLIAAGLLP EVG