Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4477 |
Symbol | |
ID | 6969978 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 4145750 |
End bp | 4146745 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643388192 |
Product | peptidase, U32 family |
Protein accession | YP_002272629 |
Protein GI | 209398259 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 71 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGCTGC TCTGCCCTGC CGGAAATCTC CCGGCGCTTA AGGCGGCCAT CGAAAACGGC GCAGATGCTG TTTATATCGG GCTAAAAGAT GATACCAATG CCCGTCACTT CGCCGGCCTT AACTTTACCG AGAAAAAATT GCAGGAAGCG GTGAGTTTTG TCCATCAACA TCGCCGCAAA CTTCACATCG CGATTAACAC TTTTGCGCAT CCGGACGGTT ACGCCCGTTG GCAGCGTGCC GTGGATATGG CGGCGCAGCT GGGTGCCGAC GCGCTGATCC TCGCCGACCT CGCCATGCTA GAGTACGCCG CCGAGCGTTA CCCGCATATT GAGCGTCATG TGTCAGTGCA GGCTTCGGCG ACCAATGAAG AGGCGATTAA CTTTTATCAT CGCCATTTTG ACGTTGCTCG CGTGGTGCTG CCGCGCGTGT TGTCGATTCA TCAGGTGAAA CAACTGGCAC GGGTCACACC TGTCCCGCTG GAAGTCTTTG CTTTCGGCAG CCTGTGCATT ATGTCGGAAG GTCGTTGCTA TCTGTCGTCG TATCTGACGG GTGAGTCGCC CAACACCGTG GGCGCGTGTT CTCCGGCCCG TTTCGTGCGC TGGCAGCAAA CGCCGCAGGG GCTGGAATCC CGCCTGAACG AAGTGCTGAT CGACCGTTAT CAGGACGGCG AAAACGCAGG TTATCCGACG CTATGTAAAG GGCGTTATCT GGTGGACGGC GAGCGCTATC ACGCGCTGGA AGAACCAACC AGTCTCAATA CCCTGGAACT GCTGCCGGAG TTAATGGCGG CGAATATTGC TTCGGTGAAA ATTGAAGGCC GCCAACGTAG CCCGGCGTAT GTCAGCCAGG TGGCGAAAGT CTGGCGTCAG GCTATCGACC GTTGTAAGGC CGATCCGCAA AACTTCGTAC CGCAAAGCGC GTGGATGGAG ACGCTCGGGT CGATGTCCGA AGGCACGCAG ACCACCCTTG GCGCGTATCA CCGTAAATGG CAGTGA
|
Protein sequence | MELLCPAGNL PALKAAIENG ADAVYIGLKD DTNARHFAGL NFTEKKLQEA VSFVHQHRRK LHIAINTFAH PDGYARWQRA VDMAAQLGAD ALILADLAML EYAAERYPHI ERHVSVQASA TNEEAINFYH RHFDVARVVL PRVLSIHQVK QLARVTPVPL EVFAFGSLCI MSEGRCYLSS YLTGESPNTV GACSPARFVR WQQTPQGLES RLNEVLIDRY QDGENAGYPT LCKGRYLVDG ERYHALEEPT SLNTLELLPE LMAANIASVK IEGRQRSPAY VSQVAKVWRQ AIDRCKADPQ NFVPQSAWME TLGSMSEGTQ TTLGAYHRKW Q
|
| |