Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0923 |
Symbol | |
ID | 5594297 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 927547 |
End bp | 928572 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640920093 |
Product | PBSX family phage portal protein |
Protein accession | YP_001457660 |
Protein GI | 157160342 |
COG category | [R] General function prediction only |
COG ID | [COG5518] Bacteriophage capsid portal protein |
TIGRFAM ID | [TIGR01540] phage portal protein, PBSX family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 48 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAAAGA GTAAGAAGAA CCGCGCTGCG GCGACGAAAC AGATCCAGCT TAAAAGTCAA ACTACAGCCG AAGCATTCAG CTTCGGCGAT CCCGTTCCTG TTCTGGACCG CCGAGAACTG CTGGATTATG TGGAATGCGT ACAGATGGAC CGCTGGTATG AGCCGCCCGT CAGCTTTGAC GGACTGGCGC GCACCTTCCG CGCTGCCGTG CATCATAGTT CCCCGATTGC AGTAAAGTGC AACATTCTGA CCAGCACCTA CATCCCTCAC CCGCTGCTCA GCCAGCAGGC TTTTTCGCGT TTTGTGCAGG ACTATCTGGT ATTTGGTAAC GCCTACCTGG AGAAACGCAC GAACCGCTTC GGTGAAGTTA TCGCCCTTGA ACCTGCCCTG GCAAAATATA CCCGACGCGG GTTAGACCTG GATACCTACT GGTTTGTGCA ATACGGTATG ACCACGCAGC CATATCAGTT CACGAAAGGC AGCATCTTTC ATCTGATGGA ACCGGACATC AACCAGGAGA TCTACGGCCT GCCCGGTTAT CTTTCTGCCA TTCCGTCAGC CCTGCTCAAC GAGTCCGCCA CGCTGTTCCG CCGAAAGTAT TACATTAACG GCAGTCATGC TGGCTTCATC ATGTACATGA CCGATGCTGC GCAGAACCAG GAGGATGTGA ACAACCTCCG CAACGCAATG AAAAGCGCCA AAGGTCCAGG CAACTTCCGC AACCTGTTTA TGTACTCACC TAACGGCAAA AAGGATGGTC TTCAGATTAT CCCGTTGTCA GAAGTCGCGG CGAAGGATGA ATTTCTGAAT ATCAAAAATG TCAGCCGCGA CGACATGATG GCTGCGCACC GTGTACCGCC ACAAATGATG GGGATAATGC CTAATAATGT TGGGGGATTT GGGGATGTGG AGAAAGCCTG CAAAGTATTT GTTAGAAATG AGTTAACAGT ATTACAAAAA AAAATACTGG AACTGAACAC TTGGTTAGAT GATGATGTAA TTAATTTTAA TGAGTATATG CTTTGA
|
Protein sequence | MGKSKKNRAA ATKQIQLKSQ TTAEAFSFGD PVPVLDRREL LDYVECVQMD RWYEPPVSFD GLARTFRAAV HHSSPIAVKC NILTSTYIPH PLLSQQAFSR FVQDYLVFGN AYLEKRTNRF GEVIALEPAL AKYTRRGLDL DTYWFVQYGM TTQPYQFTKG SIFHLMEPDI NQEIYGLPGY LSAIPSALLN ESATLFRRKY YINGSHAGFI MYMTDAAQNQ EDVNNLRNAM KSAKGPGNFR NLFMYSPNGK KDGLQIIPLS EVAAKDEFLN IKNVSRDDMM AAHRVPPQMM GIMPNNVGGF GDVEKACKVF VRNELTVLQK KILELNTWLD DDVINFNEYM L
|
| |