Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2641 |
Symbol | |
ID | 6143586 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2700421 |
End bp | 2701884 |
Gene Length | 1464 bp |
Protein Length | 487 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641617512 |
Product | M48 family peptidase |
Protein accession | YP_001744677 |
Protein GI | 170680376 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.938933 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCAGGC AGTTGAAAAA AAACCTGGTT GCAACCCTCA TTGCTGCTAT GACCATTGGT CAGGTAGCCC CGGCGTTTGC CGACAGCGCA GACACCTTGC CGGATATGGG AACCTCCGCA GGAAGCACGC TTTCCATTGG CCAGGAAATG CAGATGGGCG ACTATTATGT CCGCCAGCTA CGCGGCAGCG CGCCGTTAAT TAATGACCCG CTGTTAACGC AATATATTAA TTCGCTGGGG ATGCGTCTGG TTTCGCATGC CAATTCGGTT AAGACACCGT TTCATTTCTT TCTGATCAAC AACGACGAAA TTAACGCCTT TGCTTTCTTT GGCGGCAACG TGGTGCTGCA CTCTGCCCTG TTCCGTTATT CCGATAACGA AAGTCAACTG GCTTCAGTTA TGGCGCACGA AATCTCCCAC GTCACCCAAC GTCACCTGGC GCGAGCGATG GAAGATCAGC AGCGCAGCGC GCCGCTGACC TGGGTCGGCG CGTTAGGTTC TATTTTACTG GCGATGGCCA GTCCGCAGGC GGGGATGGCG GCGCTGACCG GTACACTGGC GGGAACGCGC CAGGGGATGA TCAGTTTCAC CCAGCAAAAT GAACAGGAAG CGGACCGCAT TGGTATTCAG GTGCTGCAAC GCTCGGGATT CGATCCGCAG GCGATGCCGA CCTTCCTCGA AAAATTACTC GATCAGGCGC GTTACTCCTC GCGCCCACCA GAAATTCTGC TCACTCACCC ACTACCGGAA AGCCGTCTGG CTGATGCCCG TAACCGTGCC AATCAGATGC GCCCGATGGT GGTGCAATCG TCAGAAGATT TCTATCTGGC AAAAGTGCGC ACACTGGGGA TGTATAATTC CGGACGTAAC CAGCTCACCA GTGATTTGCT GGATGAATGG GCGAAAGGAA ACGTTCGTCA GCAACGAGCG GCGCAATATG GTCGTGCTCT ACAGGCGATG GAAGCCAATA AATACGATGA GGCGCGTAAA ACGCTGCAAC CGTTACTGGC GGCAGAACCT GGTAACGCAT GGTATCTCGA TCTGGCTACC GATATCGATC TTGGGCAAAA CAAAGCCAAT GAGGCAATCA ATCGCCTGAA AAATGCCCGT GATTTGCGCA CCAATCCGGT GTTGCAGCTC AACCTGGCGA ACGCTTATCT GCAAGGCGGT CAACCACAAG AAGCGGCCAA TATTCTGAAT CGCTACACTT TTAATAATAA AGATGACAGC AACGGCTGGG ATTTACTGGC ACAGGCGGAA GCCGCGCTAA ATAACCGCGA TCAGGAACTG GCTGCGCGAG CAGAAGGTTA TGCGCTCGCC GGGCGACTCG ATCAGGCCAT TTCCTTGTTG AGTAGCGCCA GTTCGCAGGT GAAATTAGGC AGCCTGCAAC AAGCGCGTTA CGATGCGCGC ATCGACCAGT TGCGCCAGCT GCAGGAACGC TTTAAGCCTT ATACCAAGAT GTAA
|
Protein sequence | MFRQLKKNLV ATLIAAMTIG QVAPAFADSA DTLPDMGTSA GSTLSIGQEM QMGDYYVRQL RGSAPLINDP LLTQYINSLG MRLVSHANSV KTPFHFFLIN NDEINAFAFF GGNVVLHSAL FRYSDNESQL ASVMAHEISH VTQRHLARAM EDQQRSAPLT WVGALGSILL AMASPQAGMA ALTGTLAGTR QGMISFTQQN EQEADRIGIQ VLQRSGFDPQ AMPTFLEKLL DQARYSSRPP EILLTHPLPE SRLADARNRA NQMRPMVVQS SEDFYLAKVR TLGMYNSGRN QLTSDLLDEW AKGNVRQQRA AQYGRALQAM EANKYDEARK TLQPLLAAEP GNAWYLDLAT DIDLGQNKAN EAINRLKNAR DLRTNPVLQL NLANAYLQGG QPQEAANILN RYTFNNKDDS NGWDLLAQAE AALNNRDQEL AARAEGYALA GRLDQAISLL SSASSQVKLG SLQQARYDAR IDQLRQLQER FKPYTKM
|
| |