Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4331 |
Symbol | |
ID | 6970088 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 4009544 |
End bp | 4011151 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643388058 |
Product | putative binding protein |
Protein accession | YP_002272496 |
Protein GI | 209396769 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATACGC GAAATTTATT ATGGCTGGTC AGCCTGGTAA GTGCGGCTCC TCTCTACGCT GCTGACGTTC CCACCAACAC ACAGCTCGCC CCGCAACAAG TCTTTCGTTA CAACAATCAT AGCGACCCAG GTACGCTCGA CCCGCAAAAG GTGGAGGAGA ATACTGCCGC GCAGATTGTG CTGGATCTGT TTGAAGGTCT GGTATGGATG GACGGTGAAG GCCAGGTGCA GCCCGCTCAG GCTGAACGCT GGGAGATACT GGACGGCGGC AGGCGCTATA TTTTCCATCT GCGTAGCGGT TTGCAGTGGT CCGACGGTCA GCCTCTGACG GCAGAGGATT TTGTCCTCGG CTGGCAGCGC GCGGTTGACC CGAAAACGGC AAGCCCTTTT GCTGGCTATC TGGCACAGGC GCACATTAAC AATGCCGCGG CTATTGTTGC GGGTAAAGCA GATGTTACAT CGCTGGGTGT CAAAGCGACG GATGATCGTA CTCTTGAAGT TACGCTTGAG CAGCCGGTTC CTTGGTTCAC GACGATGCTC GCCTGGCCGA CGCTGTTCCC GGTTCCTCAT CATGTCATCG CTAAACATGG CGATAGCTGG AGTAAGCCAG AGAACATGGT TTACAACGGT GCCTTTGTGC TTGATCAGTG GGTAGTTAAC GAAAAGATTA CTGCACGCAA AAATCCAAAG TACCGCGATG CGCAACATAC AGTATTGCAA CAGGTTGAGT ATCTGGCGCT GGATAATTCG GTCACCGGCT ATAACCGCTA TCGCGCGGGA GAGGTCGATC TCACCTGGGT TCCGGCGCAG CAAATTCCCG CCATTGAAAA ATCACTGCCT GGCGAGATAC GAATTATTCC GCGTCTGAAC AGCGAATATT ACAACTTCAA CCTTGAGAAA CCGCCATTTA ACGATGTGCG AGTGCGTCGG GCGCTATATC TTACGGTTGA TCGACAGCTT ATTGCGCAAA AGGTACTGGG GTTGAGAACG CCCGCAACCA CGCTGACGCC GCCAGAGGTA AAAGGCTTTA GCGCGACGAC GTTCGATGAA CTGCAAAAGC CAATGAGTGA GCGCGTCGCG ATGGCAAAAG CCTTGCTGAA ACAGGCGGGA TACGACGCCT CTCATCCGCT ACGCTTTGAG CTGTTCTACA ACAAGTACGA TCTGCATGAA AAGACCGCGA TAGCGTTGTC TTCCGAATGG AAAAAATGGC TGGGTGCACA GGTGACGCTG CGCACAATGG AGTGGAAAAC CTATCTTGAT GCCCGACGAG CCGGTGATTT CATGCTGTCT CGGCAGTCGT GGGATGCGAC GTACAATGAT GCTTCCAGCT TCCTGAACAC GCTCAAAAGC GATAGTGAAG AAAACGTCGG TCACTGGAAA AATGCGCAGT ATGACGCCTT ACTAAACCAG GCCATGCAGA TCACTGATGC GACAAAGCGT AATGCGTTGT ATCAGCAGGC AGAAGTGATC GTCAACCAGC AGGCACCGCT GATTCCTATC TACTATCAGC CGTTAATCAA ACTGCTTAAA CCCTACGTTG GCGGTTTTCC GCTGCATAAT CCCCAGGATT ATGTCTACAG CAAAGAGTTG TATATCAAGG CACATTGA
|
Protein sequence | MYTRNLLWLV SLVSAAPLYA ADVPTNTQLA PQQVFRYNNH SDPGTLDPQK VEENTAAQIV LDLFEGLVWM DGEGQVQPAQ AERWEILDGG RRYIFHLRSG LQWSDGQPLT AEDFVLGWQR AVDPKTASPF AGYLAQAHIN NAAAIVAGKA DVTSLGVKAT DDRTLEVTLE QPVPWFTTML AWPTLFPVPH HVIAKHGDSW SKPENMVYNG AFVLDQWVVN EKITARKNPK YRDAQHTVLQ QVEYLALDNS VTGYNRYRAG EVDLTWVPAQ QIPAIEKSLP GEIRIIPRLN SEYYNFNLEK PPFNDVRVRR ALYLTVDRQL IAQKVLGLRT PATTLTPPEV KGFSATTFDE LQKPMSERVA MAKALLKQAG YDASHPLRFE LFYNKYDLHE KTAIALSSEW KKWLGAQVTL RTMEWKTYLD ARRAGDFMLS RQSWDATYND ASSFLNTLKS DSEENVGHWK NAQYDALLNQ AMQITDATKR NALYQQAEVI VNQQAPLIPI YYQPLIKLLK PYVGGFPLHN PQDYVYSKEL YIKAH
|
| |