Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5646 |
Symbol | |
ID | 6971862 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 5284604 |
End bp | 5286061 |
Gene Length | 1458 bp |
Protein Length | 485 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643389280 |
Product | amino acid/peptide transporter |
Protein accession | YP_002273677 |
Protein GI | 209398346 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3104] Dipeptide/tripeptide permease |
TIGRFAM ID | [TIGR00924] amino acid/peptide transporter (Peptide:H+ symporter), bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACAC CCTCACAGCC GCGCGCGATA TACTATATCG TGGCGATCCA AATCTGGGAG TACTTCAGTT TTTACGGCAT GCGTGCCTTA CTCATTCTCT ATCTCACCCA TCAGCTTGGT TTTGATGATA ACCATGCCAT CAGCCTGTTC AGCGCATATG CTTCTCTGGT TTACGTTACC CCTATTCTCG GCGGCTGGCT TGCCGACCGC CTGCTCGGCA ACCGCACTGC AGTGATTGCC GGCGCGCTGT TAATGACCCT TGGCCATGTG GTGCTGGGTA TTGATACAAA TTCAACCTTT AGCCTGTATC TGGCGCTGGC AATCATTATT TGTGGCTACG GTTTATTCAA ATCAAACATC AGCTGTTTGC TTGGCGAGCT CTACGACGAG AACGATCATC GACGTGATGG CGGTTTTTCG CTGCTGTATG CTGCGGGCAA TATCGGTTCT ATCGCCGCCC CAATCGCCTG CGGCCTGGCT GCTCAGTGGT ATGGATGGCA TGTTGGCTTT GCCCTTGCGG GTGGCGGCAT GTTTATCGGT TTGTTGATTT TCTTAAGCGG GCATCGTCAT TTCCAGTCCA CGCGTAGTAT GGATAAAAAA GCGCTCACTA GCGTCAAATT TGCCTTACCG GTATGGAGCT GGTTAGTGGT GATGCTCTGT TTAGCCCCAG TATTTTTTAC CCTGCTGCTG GAGAACGACT GGTCCGGATA TTTGCTGGCG ATCGTTTGCC TCATTGCCGC ACAAATCATT GCCCGCATGA TGATCAAATT CCCCGAACAC CGCCGTGCTC TTTGGCAAAT TGTATTGTTG ATGTTTGTCG GGACATTGTT CTGGGTACTG GCACAACAGG GCGGCAGTAC CATCAGCTTG TTTATCGATC GCTTTGTGAA TCGTCAGGCA TTCAATATTG AAGTACCTAC TGCACTATTC CAGTCGGTGA ATGCAATTGC GGTGATGCTC GCTGGGGTTG TTCTGGCCTG GCTGGCGTCG CCAGAAAGCC ACGGCAACTC AACATTGCGC GTCTGGCTGA AGTTTGCCTT TGGCTTACTG CTGATGGCTT GTGGCTTTAT GTTGCTGGCA TTTGATGCCC GACATGCAGC AGCTGACGGT CAAGCGTCAA TGGGCGTGAT GGTATCCGGG CTGGCGCTAA TGGGCTTTGC CGAACTCTTT ATTGACCCGG TGGCGATTGC GCAAATCACG CGTCTGAAAA TGTCTGGCGT ATTAACCGGG ATTTATATGC TGGCAACAGG CGCGGTCGCC AACTGGCTGG CAGGCGTAGT GGCACAGCAG ACGACAGAGT CGCAAATTAG CGGTATGGCA ATTGCAGCTT ACCAGCGATT CTTTTCTCAG ATGGGAGAGT GGACGTTGGC TTGTGTCGCG ATCATCGTGG TATTGGCCTT TGCTACCCGT TTCCTGTTTA GCACGCCGAC GAATATGGTA CAGGAGAGCA ACGATTAA
|
Protein sequence | MKTPSQPRAI YYIVAIQIWE YFSFYGMRAL LILYLTHQLG FDDNHAISLF SAYASLVYVT PILGGWLADR LLGNRTAVIA GALLMTLGHV VLGIDTNSTF SLYLALAIII CGYGLFKSNI SCLLGELYDE NDHRRDGGFS LLYAAGNIGS IAAPIACGLA AQWYGWHVGF ALAGGGMFIG LLIFLSGHRH FQSTRSMDKK ALTSVKFALP VWSWLVVMLC LAPVFFTLLL ENDWSGYLLA IVCLIAAQII ARMMIKFPEH RRALWQIVLL MFVGTLFWVL AQQGGSTISL FIDRFVNRQA FNIEVPTALF QSVNAIAVML AGVVLAWLAS PESHGNSTLR VWLKFAFGLL LMACGFMLLA FDARHAAADG QASMGVMVSG LALMGFAELF IDPVAIAQIT RLKMSGVLTG IYMLATGAVA NWLAGVVAQQ TTESQISGMA IAAYQRFFSQ MGEWTLACVA IIVVLAFATR FLFSTPTNMV QESND
|
| |