Gene ECH74115_4843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4843 
Symbol 
ID6968930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4478251 
End bp4479720 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content51% 
IMG OID643388534 
Productinner membrane transporter YhiP 
Protein accessionYP_002272962 
Protein GI209397711 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3104] Dipeptide/tripeptide permease 
TIGRFAM ID[TIGR00924] amino acid/peptide transporter (Peptide:H+ symporter), bacterial 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value0.953433 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACAA CAACACCCAT GGGGATGCTG CAGCAACCTC GCCCATTTTT CATGATCTTT 
TTTGTCGAGT TATGGGAGCG ATTCGGCTAC TACGGCGTGC AGGGCGTACT GGCGGTTTTC
TTCGTTAAAC AGCTTGGATT CTCGCAAGAG CAGGCTTTTG TCACTTTTGG TGCTTTTGCT
GCACTGGTCT ATGGCCTCAT TTCCATTGGC GGCTATGTCG GCGACCACCT GCTGGGGACC
AAACGCACCA TCGTTCTTGG AGCACTTGTG CTGGCGATTG GCTACTTCAT GACCGGCATG
TCGCTACTTA AGCCTGACCT GATTTTCATC GCCCTGGGGA CTATCGCTGT CGGTAACGGC
CTGTTTAAAG CTAACCCAGC CAGCTTGCTT TCGAAGTGCT ATCCGCCGAA AGATCCGCGC
CTTGATGGCA CATTCACCCT GTTCTATATG TCGATCAATA TCGGCTCGTT GATAGCGTTA
TCGCTGGCCC CTGTGATCGC TGATAAATTC GGCTATTCAG TCACCTACAA CCTGTGCGGT
GCGGGGTTAA TTATCGCATT ACTGGTTTAC ATCGCCTGTC GTGGAATGGT GAAAGACATT
GGTTCTGAGC CCGACTTCCG GCCAATGAGC TTCAGCAAAC TGTTGTACGT ATTACTTGGC
AGCGTGGTGA TGATCTTCGT CTGCGCCTGG CTGATGCACA ACGTAGAAGT CGCCAATCTG
GTGCTGATTG TTCTCTCCAT CGTCGTCACC ATCATCTTCT TTCGTCAGGC ATTCAAGCTG
GATAAAACCG GGCGCAATAA AATGTTTGTC GCCTTTGTCC TGATGCTCGA AGCGGTGGTG
TTTTACATTC TCTACGCCCA GATGCCAACA TCGCTGAACT TCTTTGCCAT CAACAACGTG
CATCATGAAA TTCTCGGTTT TTCCATCAAC CCGGTCAGCT TCCAGGCGCT TAACCCGTTC
TGGGTGGTAC TCGCCAGCCC AATACTGGCA GGCATTTACA CGCATCTGGG TAACAAAGGC
AAAGACCTCT CGATGCCGAT GAAATTTACT CTCGGCATGT TTATGTGCTC ACTGGGCTTT
TTGACGGCGG CAGCTGCGGG AATGTGGTTT GCGGATGCAC AAGGGCTGAC ATCGCCATGG
TTTATCGTGC TGGTGTACTT ATTCCAGAGC TTAGGTGAAC TGTTTATTAG CGCCCTTGGC
CTGGCGATGA TTGCTGCTCT GGTGCCGCAG CATTTGATGG GCTTTATTCT CGGGATGTGG
TTCCTGACAC AGGCTGCCGC GTTCTTGCTG GGCGGCTATG TGGCAACATT TACCGCAGTA
CCAGACAACA TTACCGATCC GCTTGAGACG TTGCCTGTCT ATACCAACGT GTTTGGTAAG
ATTGGTCTGG TCACGCTGGG CGTTGCAGTA GTGATGCTGT TGATGGTGCC GTGGCTGAAA
CGCATGATTG CGACGCCGGA AAGCCATTAA
 
Protein sequence
MNTTTPMGML QQPRPFFMIF FVELWERFGY YGVQGVLAVF FVKQLGFSQE QAFVTFGAFA 
ALVYGLISIG GYVGDHLLGT KRTIVLGALV LAIGYFMTGM SLLKPDLIFI ALGTIAVGNG
LFKANPASLL SKCYPPKDPR LDGTFTLFYM SINIGSLIAL SLAPVIADKF GYSVTYNLCG
AGLIIALLVY IACRGMVKDI GSEPDFRPMS FSKLLYVLLG SVVMIFVCAW LMHNVEVANL
VLIVLSIVVT IIFFRQAFKL DKTGRNKMFV AFVLMLEAVV FYILYAQMPT SLNFFAINNV
HHEILGFSIN PVSFQALNPF WVVLASPILA GIYTHLGNKG KDLSMPMKFT LGMFMCSLGF
LTAAAAGMWF ADAQGLTSPW FIVLVYLFQS LGELFISALG LAMIAALVPQ HLMGFILGMW
FLTQAAAFLL GGYVATFTAV PDNITDPLET LPVYTNVFGK IGLVTLGVAV VMLLMVPWLK
RMIATPESH