Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0977 |
Symbol | |
ID | 6146059 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 987169 |
End bp | 988530 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641615864 |
Product | U32 family peptidase |
Protein accession | YP_001743056 |
Protein GI | 170681214 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.51078 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTAAAC CGGAACTCCT TTCCCCGGCG GGAACGCTGA AAAATATGCG TTACGCTTTC GCTTATGGCG CAGATGCTGT TTATGCGGGC CAGCCGCGTT ACTCCCTGCG TGTGCGCAAC AACGAATTCA ACCACGAAAA TCTCCAGCTC GGCATCAATG AAGCCCACGC GCTGGGGAAA AAGTTTTATG TCGTGGTCAA CATTGCACCG CACAACGCCA AGCTGAAAAC CTTTATCCGT GACCTGAAAC CGGTGGTGGA GATGGGGCCG GATGCGCTGA TTATGTCCGA TCCAGGGCTG ATTATGCTGG TGCGCGAGCA CTTCCCTGAA ATGCCGATCC ACCTCTCGGT GCAGGCTAAC GCCGTGAACT GGGCGACGGT GAAATTCTGG CAGCAAATGG GACTGACCCG CGTGATCCTC TCTCGCGAGC TGTCGCTGGA AGAGATTGAA GAGATCCGTA ATCAGGTGCC GGATATGGAG ATCGAGATCT TCGTTCACGG TGCGCTGTGC ATGGCCTACT CCGGTCGCTG CCTGCTCTCT GGCTATATCA ACAAGCGCGA CCCGAACCAG GGCACCTGCA CCAACGCCTG CCGCTGGGAG TACAACGTCC AGGAAGGGAA AGAAGATGAC GTTGGCAACA TCGTGCACAA GTACGAGCCG ATTCCGGTGC AAAATGTTGA GCCGACGCTG GGCATCGGCG CGCCAACCGA CAAAGTGTTT ATGATCGAAG AAGCTCAGCG TCCGGGCGAG TATATGACTG CGTTTGAAGA TGAGCACGGC ACTTACATCA TGAACTCGAA AGATCTGCGC GCTATCGCCC ACGTTGAACG CCTGACCAAA ATGGGCGTGC ATTCGCTGAA AATCGAAGGT CGTACAAAAT CTTTCTACTA TTGCGCACGC ACCGCGCAGG TTTACCGTAA AGCTATCGAT GACGCAGCTG CGGGAAAACC ATTCGATACC AGCCTGCTGG AAACTCTGGA AGGTCTGGCG CATCGTGGCT ATACCGAAGG TTTCCTGCGT CGTCATACTC ACGACGATTA TCAGAACTAC GAATACGGTT ATTCAGTTTC TGACCGCCAG CAGTTTGTTG GTGAGTTTAC CGGTGAGCGC AAGGGCGACC TCGCGGCGGT AGCGGTGAAA AATAAATTCT CCGTTGGCGA CAGCCTTGAG CTGATGACGC CGCAAGGGAA CATTAACTTT ACCCTTGAGC ATATGGAAAA CGCCAAAGGC GAAGCAATGC CGGTCGCACC GGGCGATGGT TATACTGTGT GGATCCCGGT GCCGCAGGAT CTTGAGCTAA ATTACGCGCT GCTGATGCGT AATTTCTCCG GGGAAACCAC GCGTAACCCC CACGGTAAGT GA
|
Protein sequence | MFKPELLSPA GTLKNMRYAF AYGADAVYAG QPRYSLRVRN NEFNHENLQL GINEAHALGK KFYVVVNIAP HNAKLKTFIR DLKPVVEMGP DALIMSDPGL IMLVREHFPE MPIHLSVQAN AVNWATVKFW QQMGLTRVIL SRELSLEEIE EIRNQVPDME IEIFVHGALC MAYSGRCLLS GYINKRDPNQ GTCTNACRWE YNVQEGKEDD VGNIVHKYEP IPVQNVEPTL GIGAPTDKVF MIEEAQRPGE YMTAFEDEHG TYIMNSKDLR AIAHVERLTK MGVHSLKIEG RTKSFYYCAR TAQVYRKAID DAAAGKPFDT SLLETLEGLA HRGYTEGFLR RHTHDDYQNY EYGYSVSDRQ QFVGEFTGER KGDLAAVAVK NKFSVGDSLE LMTPQGNINF TLEHMENAKG EAMPVAPGDG YTVWIPVPQD LELNYALLMR NFSGETTRNP HGK
|
| |