Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0951 |
Symbol | |
ID | 6146879 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 963760 |
End bp | 964929 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641615838 |
Product | HK97 family phage major capsid protein |
Protein accession | YP_001743030 |
Protein GI | 170681140 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAAT TAATCGAACT CCGCCAGCAA AAAAACGCCC TGAAAAACCA GATGCGATCC CTGCTGGAAA AAGCCGACAG TGAAAACCGT AGTCTGAACG CTGAAGAAGG CAAACAGTTT GATGAACTGC GTGCAAAAGC TGATGCCCTC GACACAGAAA TTTCCCGCCT CGAGTCTGTG GCTGATGAAG AACGCAGCAA GCCAGGAACG GGCATCCAGA AATTATCATC TGATGAATTG CGTAACTACA TCGTAACCGG AGATGTGCGA TCACTGTCCA CCAGCACTGA CAGCGGCAGG GATGGCGGAT ATACCGTAAT TCCTGAGCTT GATCGCGAAG TCATGCGCCA GCTACAGGAT GACAGTGTTA TGCGCGTGAT CGCGACCGTG AAGACCGCAA AATCAAATGA GTTTCAGAAA CTGGTTTCCA CTGGCGGCGC AACTGTAGGA CGAGGCACAG AAGGCAGCGC ACGCAGTGAA ACCAACACCC CGAAAATTGA ACGCGTAACC ATCAAGTTGA ATCCGATCTA CGCCTACCCG AAAACCACGC AGGAAATCCT GGATTTTTCA GAGGTGGATA TTCTGGGCTG GTTATCCTCC GAAATTGCCG ACACGTTCGC CAGCACCGAA GAGGATGATT TTGTTAATGG CGACGGTAAC GGCAAGCCGA AAGGCTTCAT GGCTTACACC CGTGCGGCGA CCAGTGACAA AACCCGCGCT TTTGGCACCA TTGAAAAAAT AGTAGCGGCA AGTGGAACCG CCATTACAGC GGACGAACTG ATCGACATTC TCTACAAGCT GAAAGCGAAA TACCGCAAAA ATGCCGTCTG GGTGATGAAC TCGGGCACGG CAGGGACACT ACAGAAGCTG AAAAATGAGA ACGGCGATTA TATCTGGCGC GACAGCCTTA AAGAAGGTGC GCCGGATATG TTGCTTGGTC GTCCTGTTTA CTGCCTGGAG TCCATGCCGG ACATCGGCGC AGGAAAAGCA CCGCTAGCGG TTGGCGATTT CAGTCGTGGT TATTTCATCG TTGATCATGT AACAGGGATT CGCACCCGAC CGGACAACAT TACTGAACCC GGATTCTACA AGGTCCACAC GGATAAATAT CTGGGCGGTG GTGTGGTGGA TTCAAACGCC ATCAAAATTC TGGAAATGAA AGCTGGCTAG
|
Protein sequence | MKKLIELRQQ KNALKNQMRS LLEKADSENR SLNAEEGKQF DELRAKADAL DTEISRLESV ADEERSKPGT GIQKLSSDEL RNYIVTGDVR SLSTSTDSGR DGGYTVIPEL DREVMRQLQD DSVMRVIATV KTAKSNEFQK LVSTGGATVG RGTEGSARSE TNTPKIERVT IKLNPIYAYP KTTQEILDFS EVDILGWLSS EIADTFASTE EDDFVNGDGN GKPKGFMAYT RAATSDKTRA FGTIEKIVAA SGTAITADEL IDILYKLKAK YRKNAVWVMN SGTAGTLQKL KNENGDYIWR DSLKEGAPDM LLGRPVYCLE SMPDIGAGKA PLAVGDFSRG YFIVDHVTGI RTRPDNITEP GFYKVHTDKY LGGGVVDSNA IKILEMKAG
|
| |