Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3616 |
Symbol | |
ID | 6967379 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3336760 |
End bp | 3337797 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643387411 |
Product | exoaminopeptidase |
Protein accession | YP_002271870 |
Protein GI | 209395774 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 0.874424 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTTAT CGCTATTGAA AGCGTTGAGC GAGGCAGATG CGATCGCCTC CTCGGAACAG GAAGTGCGGC AGATCCTGCT GGAAGAAGCG GATCGCCTGC AAAAAGAAGT GCGATTTGAT GGGCTGGGAT CGGTGCTGAT CCGCCTCAAT GAATCGACAG GTCCGAAGGT GATGATCTGT GCGCATATGG ACGAAGTGGG ATTTATGGTG CGCAGCATCT CCCGCGAAGG GGCGATTGAT GTGCTGCCGG TTGGCAATGT ACGCATGGCT GCCCGCCAGC TGCAGCCGGT GCGCATCACC ACCCGTGAAG AGTGCAAAAT TCCAGGCCTG CTTGACGGCG ACCGGCAGGG GAATGATGTC AGCGCCATGC GCGTGGATAT TGGCGCGCGC TCGTATGACG AAGTGATGCA GGCGGGAATT CGTCCAGGCG ATCGCGTCAC GTTCGATACC ACTTTTCAGG TTCTCCCCCA CCAGCGGGTG ATGGGGAAAG CCTTTGATGA CCGCCTCGGT TGCTACCTGC TGGTGACGTT ACTGCGCGAA CTACACAGCG CTGAACTGCC TGCGGAAGTG TGGCTGGTGG CCAGTTCCAG CGAAGAGGTG GGGTTACGCG GCGGGCAAAC TGCCACCCGC GCGGTGTCGC CGGACGTCGC CATTGTCCTT GATACCGCCT GCTGGGCGAA AAACTTTAAT TATGGCGCGG CTAACCATCG CCAGATTGGT AACGGCCCGA TGCTGGTGTT AAGCGACAAG TCACTGATTG CGCCGCCAAA ACTCACCGCC TGGATCGAAA CCGTGGCGGC AGAAATTGGC GTGCCGTTAC AGGCGGATAT GTTCAGTAAC GGCGGCACGG ACGGTGGAGC GGTGCACTTA ACCGGTACTG GCGTACCCAC AGTGGTGATG GGGCCTGCCA CCCGCCACGG ACATTGCGCC GCGTCGATTG CCGATTGCCG TGACATTTTG CAGATGGAGC AACTTTTATC TGCCCTTATT CAACGTCTTA CGCGTGAGAC GGTTGTTCAA CTGACGGATT TCAGATGA
|
Protein sequence | MDLSLLKALS EADAIASSEQ EVRQILLEEA DRLQKEVRFD GLGSVLIRLN ESTGPKVMIC AHMDEVGFMV RSISREGAID VLPVGNVRMA ARQLQPVRIT TREECKIPGL LDGDRQGNDV SAMRVDIGAR SYDEVMQAGI RPGDRVTFDT TFQVLPHQRV MGKAFDDRLG CYLLVTLLRE LHSAELPAEV WLVASSSEEV GLRGGQTATR AVSPDVAIVL DTACWAKNFN YGAANHRQIG NGPMLVLSDK SLIAPPKLTA WIETVAAEIG VPLQADMFSN GGTDGGAVHL TGTGVPTVVM GPATRHGHCA ASIADCRDIL QMEQLLSALI QRLTRETVVQ LTDFR
|
| |