Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0799 |
Symbol | |
ID | 6968894 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 820993 |
End bp | 822474 |
Gene Length | 1482 bp |
Protein Length | 493 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643384826 |
Product | amino acid/peptide transporter |
Protein accession | YP_002269332 |
Protein GI | 209398739 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3104] Dipeptide/tripeptide permease |
TIGRFAM ID | [TIGR00924] amino acid/peptide transporter (Peptide:H+ symporter), bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.256216 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 70 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAAAC ACGCATCACA ACCGCGCGCT ATTTACTATG TCGTTGCGCT GCAAATCTGG GAATATTTTA GCTTTTACGG CATGCGTGCC CTGCTGATTC TCTATCTCAC CAATCAACTA AAATACAACG ATACTCACGC CTACGAGTTA TTTAGCGCCT ACTGTTCGCT GGTGTATGTC ACGCCAATCC TCGGTGGCTT TTTGGCGGAT AAAGTTCTCG GCAATCGCAT GGCGGTGATG CTGGGGGCGT TATTGATGGC GATCGGTCAT GTGGTGTTGG GTGCAAGTGA GATCCATCCG TCATTCCTCT ATCTGTCCCT GGCGATAATC GTTTGTGGCT ATGGTCTGTT TAAATCCAAT GTCAGTTGCC TGCTCGGCGA GCTGTATGAG CCAACCGATC CGCGTCGTGA TGGCGGTTTC TCGCTGATGT ATGCAGCGGG TAACGTGGGG TCTATTATCG CACCTATTGC CTGTGGCTAC GCCCAGGAAG AGTACAGTTG GGCGATGGGC TTTGGCCTGG CGGCGGTGGG TATGATCGCG GGTCTGGTTA TTTTCTTATG TGGCAATCGT CATTTCACAC ATACCCGCGG CGTTAACAAA AAAGTACTGC GTGCGACAAA CTTTCTGCTG CCGAACTGGG GATGGCTGCT GGTTCTGCTG GTGGCAACGC CTGCGCTGAT TACCGTACTG TTCTGGAAAG AATGGTCGGT ATACGCCTTG ATTGTCGCGA CTATCATTGG CCTGGGTGTA CTGGCAAAAA TTTATCGCAA AGCAGAAAAC CAGAAACAGC GGAAGGAGCT GAGGCTGATT GTGACGCTCA CCTTCTTCAG TATGTTGTTC TGGGCCTTCG CACAACAGGG CGGTAGCTCG ATTAGCCTTT ATATCGACCG CTTCGTTAAT CGCGATATGT TTGGTTATAC CGTTCCGACC GCGATGTTCC AGTCGATTAA TGCCTTCGCA GTTATGCTGT GCGGTGTGTT CCTGGCGTGG GTGGTAAAAG AGAGTGTCGC GGGTAATCGT ACCGTGCGCA TCTGGGGAAA ATTTGCTCTT GGCCTTGGCC TGATGAGCGC CGGATTCTGC ATTCTGACCT TAAGCGCCCG CTGGTCCGCA ATGTATGGTC ACTCTTCTCT GCCACTGATG GTATTAGGCC TGGCGGTGAT GGGCTTTGCG GAACTGTTTA TCGACCCGGT TGCTATGTCG CAGATTACGC GCATTGAAAT CCCCGGTGTG ACCGGCGTAT TAACCGGCAT TTATATGCTG CTTTCCGGCG CGATTGCGAA CTATCTGGCG GGCGTGATTG CCGATCAGAC ATCGCAGGCT TCGTTTGATG CTTCCGGGGC GATCAACTAC TCCATCAATG CATATATTGA AGTATTTGAT CAAATTACCT GGGGCGCACT GGCGTGTGTA GGAGTGGTAC TGATGATTTG GCTGTATCAG GCGCTGAAAT TCAGAAACCG CGCGCTGGCG CTGGAGTCTT AA
|
Protein sequence | MNKHASQPRA IYYVVALQIW EYFSFYGMRA LLILYLTNQL KYNDTHAYEL FSAYCSLVYV TPILGGFLAD KVLGNRMAVM LGALLMAIGH VVLGASEIHP SFLYLSLAII VCGYGLFKSN VSCLLGELYE PTDPRRDGGF SLMYAAGNVG SIIAPIACGY AQEEYSWAMG FGLAAVGMIA GLVIFLCGNR HFTHTRGVNK KVLRATNFLL PNWGWLLVLL VATPALITVL FWKEWSVYAL IVATIIGLGV LAKIYRKAEN QKQRKELRLI VTLTFFSMLF WAFAQQGGSS ISLYIDRFVN RDMFGYTVPT AMFQSINAFA VMLCGVFLAW VVKESVAGNR TVRIWGKFAL GLGLMSAGFC ILTLSARWSA MYGHSSLPLM VLGLAVMGFA ELFIDPVAMS QITRIEIPGV TGVLTGIYML LSGAIANYLA GVIADQTSQA SFDASGAINY SINAYIEVFD QITWGALACV GVVLMIWLYQ ALKFRNRALA LES
|
| |