Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1189 |
Symbol | |
ID | 6146912 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1195262 |
End bp | 1195912 |
Gene Length | 651 bp |
Protein Length | 216 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641616067 |
Product | HK97 family phage prohead protease |
Protein accession | YP_001743250 |
Protein GI | 170679829 |
COG category | [R] General function prediction only |
COG ID | [COG3740] Phage head maturation protease |
TIGRFAM ID | [TIGR01543] phage prohead protease, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0000000250589 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAGACGA AACAACGTCT TGATGTACCG CTGAGTCTGA AATCTGTCAG TGACTCCGGT GAGTTTGAAG GGTATGGCTC CGTCTTTGGT GTAAAGGACA GCCACGATGA TGTGGTGATG TCCGGGGCAT TTGCTGCTTC CCTGCGGGCG TGGAGTGACA GAAAAGCGTT ACCTGCGCTG CTCTGGCAGC ACCGCATGGA TGAACCCATC GGTGTTTACA CCGAAATGAA GGAAGACGAT GTCGGGCTTT ACGTCAGGGG ACGGTTGCTT ATTGATGATG ATCCCCTCGC AAAACGCGCA CATGCACACA TGAAGGCCGG TTCGTTAACC GGCCTTTCTA TTGGGTACGT CCTGAAAGAC TGGGAATACG ACCGGAGCAA AGAAGCCTTT CTGCTGAAAG AAATCGACCT CTGGGAAGTC AGCCTGGTGA CGTTCCCGTC TAACGACGAG GCGCGGATCA GCGACGTCAA GAACGCACTG GCCCGCGGGG AAATCCCCGA ACAGAAAAAA ATCGAAAGAG TCCTGCGTGA TGTCGGACTC TCCCGTACCC AGGCCAAAGC ATTCATGGCC GGGGGCTATG GCGCACTGTC CCTGCGCGAC GCTGAGGATG TGGGCTCTGC ACTGAATGCA CTGAAAAATC TGAACTTCTA A
|
Protein sequence | MQTKQRLDVP LSLKSVSDSG EFEGYGSVFG VKDSHDDVVM SGAFAASLRA WSDRKALPAL LWQHRMDEPI GVYTEMKEDD VGLYVRGRLL IDDDPLAKRA HAHMKAGSLT GLSIGYVLKD WEYDRSKEAF LLKEIDLWEV SLVTFPSNDE ARISDVKNAL ARGEIPEQKK IERVLRDVGL SRTQAKAFMA GGYGALSLRD AEDVGSALNA LKNLNF
|
| |