Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3454 |
Symbol | |
ID | 6145982 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3528240 |
End bp | 3529235 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641618283 |
Product | U32 family peptidase |
Protein accession | YP_001745432 |
Protein GI | 170683910 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGCTGC TCTGCCCTGC CGGAAATCTC CCGGCGCTTA AGGCGGCCAT CGAAAACGGC GCAGATGCTG TTTATATCGG GCTAAAAGAT GATACCAATG CCCGTCACTT CGCCGGCCTT AACTTTACCG AGAAAAAATT GCAGGAAGCG GTGAGTTTTG TCCATCAACA TCGCCGCAAA CTTCACATCG CGATTAACAC TTTTGCGCAT CCGGACGGTT ACGCCCGCTG GCAGCGCGCC GTGGATATGG CGGCGCAGCT GGGTGCCGAC GCGCTGATCC TCGCCGACCT CGCCATGCTG GAGTACGCCG CCGAGCGTTA CCCGCATATT GAGCGTCATG TGTCAGTGCA GGCTTCGGCG ACCAATGAAG AGGCGATTAA CTTTTATCAT CGCCATTTTG ACGTTGCCCG CGTGGTGCTG CCGCGCGTGT TGTCGATTCA TCAGGTGAAA CAACTGGCAC GGGTCACACC TGTACCACTG GAAGTCTTTG CTTTCGGCAG CCTGTGCATT ATGTCGGAAG GTCGTTGCTA TCTGTCGTCG TATCTGACGG GTGAGTCGCC CAACACTGTG GGTGCGTGTT CTCCGGCCCG TTTCGTGCGC TGGCAACAAA CGCCGCAGGG GCTGGAATCC CGCCTGAACG AAGTGCTGAT CGACCGTTAT CAGGACGGCG AAAACGCAGG TTATCCGACG CTGTGTAAAG GGCGTTATCT GGTGGACGGC GAGCGCTATC ACGCGCTGGA AGAACCCACC AGTCTCAATA CCCTGGAACT GCTGCCGGAG TTAATGGCAG CGAATATTGC TTCGGTGAAA ATTGAAGGCC GCCAGCGTAG CCCGGCGTAT GTCAGCCAGG TGGCGAAAGT CTGGCGTCAG GCTATCGACC GTTGTAAGGC CGATCCGCAA AACTTCATAC CGCAAAGCGC GTGGATGGAG ACGCTCGGGT CGATGTCCGA AGGCACGCAG ACCACCCTTG GCGCGTATCA CCGTAAATGG CAGTGA
|
Protein sequence | MELLCPAGNL PALKAAIENG ADAVYIGLKD DTNARHFAGL NFTEKKLQEA VSFVHQHRRK LHIAINTFAH PDGYARWQRA VDMAAQLGAD ALILADLAML EYAAERYPHI ERHVSVQASA TNEEAINFYH RHFDVARVVL PRVLSIHQVK QLARVTPVPL EVFAFGSLCI MSEGRCYLSS YLTGESPNTV GACSPARFVR WQQTPQGLES RLNEVLIDRY QDGENAGYPT LCKGRYLVDG ERYHALEEPT SLNTLELLPE LMAANIASVK IEGRQRSPAY VSQVAKVWRQ AIDRCKADPQ NFIPQSAWME TLGSMSEGTQ TTLGAYHRKW Q
|
| |