Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1201 |
Symbol | |
ID | 6147022 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1204585 |
End bp | 1205328 |
Gene Length | 744 bp |
Protein Length | 247 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641616079 |
Product | tail assembly protein |
Protein accession | YP_001743262 |
Protein GI | 170679766 |
COG category | [R] General function prediction only |
COG ID | [COG1310] Predicted metal-dependent protease of the PAD1/JAB1 superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.000820398 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACAGAGA CAGAATCAGC GATTCTGGCG CACGCCCGGC GATGTGCGCC AGCGGAGTCG TGCGGCTTCG TGGTGAGAAC GCCGGAGGGG GTCAGATATT TTCCCTGCGT GAATATCTCC GGTGAGCCGG AGGCGTATTT CCGGATGTCG CCGGAGGACT GGCTGCGTGC ACAAATGCAG GGGGAGGTTG TGGCACTGGT CCACAGCCAC CCCGGTGGTC TGCCCTGGCT GAGTGAGGCT GACAGGCGGC TGCAGGTGCA GAGTGATTTG CCGTGGTGGC TGGTCTGCCG GGGGGCGATT CACAAGTTCC GCTGTGTGCC ACATCTTACC GGGCGGCGCT TTGAGCACGG GGTGACGGAC TGTTACACGC TGTTCCGGGA TGCATACCAT CTGGCGGAAA TTGAGATGCC GGATTTTTAT CGCGGGGATG ACTGGTGGCG TAACGGCCAG AATCTCTATC TTGAAAATAT GGAGGCGACT GGTTTTTACC GTGTCGCACT GACAGAGGCG CAGCCGGGCG ATGTGCTGCT GTGCTGTTTT GGTTCATCGG TGCCGAATCA TGCCGCCATT TACTGCGGCG ACGGCGAGCT GCTGCACCAT ATTCCTGAAC AACTGAGCAA ACGAGAGAGG TATACCGACA AATGGCAGCG ACGCACACAC TCCCTCTGGC GTCACCGGGC ATGGCACGCA TCTGCCTTTA CGGGAATTTA CAACGATTTG GCCGCCGCAT CGACCTTCGT GTGA
|
Protein sequence | MTETESAILA HARRCAPAES CGFVVRTPEG VRYFPCVNIS GEPEAYFRMS PEDWLRAQMQ GEVVALVHSH PGGLPWLSEA DRRLQVQSDL PWWLVCRGAI HKFRCVPHLT GRRFEHGVTD CYTLFRDAYH LAEIEMPDFY RGDDWWRNGQ NLYLENMEAT GFYRVALTEA QPGDVLLCCF GSSVPNHAAI YCGDGELLHH IPEQLSKRER YTDKWQRRTH SLWRHRAWHA SAFTGIYNDL AAASTFV
|
| |