Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4322 |
Symbol | |
ID | 6147244 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4419087 |
End bp | 4420121 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641619143 |
Product | PBSX family phage portal protein |
Protein accession | YP_001746267 |
Protein GI | 170683575 |
COG category | [R] General function prediction only |
COG ID | [COG5518] Bacteriophage capsid portal protein |
TIGRFAM ID | [TIGR01540] phage portal protein, PBSX family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.00346629 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCAAGA AAAAAGGGAA AACACCGCAA CCTGCGGCAA AAAAAATGAC CGCCAGCGCC CCGAAAATGG AGGCATTCAC CTTTGGTGAG CCGGTGCCGG TACTCGACCG CCGTGACATT CTGGATTACG TCGAGTGCAT CAGTAACGGC AGATGGTATG AGCCACCAGT CAGCTTTACC GGTCTGGCAA AAAGCCTGCG TGCTGCCGTG CATCACAGCT CACCGATTTA CGTCAAACGT AACATTCTGG CCTCAACGTT TATCCCACAC CCGTGGCTTT CTCAGCAGGA TTTCAGCCGC TTTGTGCTGG ATTTTCTGGT ATTCGGTAAT GCGTTTCTGG AAAAGCGTTA CAGCACCACC GGTAAGGTCA TCAGACTGGA AACCTCACCG GCAAAATATA CCCGCCGTGG CGTGGAGGAG GATGTTTACT GGTGGGTGCC GTCCTTCAAC GAGCCGACAC CTTTCGCGCC CGGCTCCGTG TTTCACCTGC TGGAGCCGGA TATTAATCAG GAGCTGTACG GTCTGCCGGA ATATCTCAGC GCCCTTAACT CTGCCTGGCT GAATGAGTCG GCCACGCTGT TCCGCCGCAA GTATTACGAA AACGGCGCTC ATGCCGGATA TATCATGTAC GTCACTGATG CCGTGCAGGA TCGCAACGAT ATCGAAATGC TTCGCGAAAA CATGGTGAAG TCGAAAGGCC GCAACAACTT TAAAAACCTG TTTCTCTATG CCCCGCAGGG GAAAGCTGAC GGCATTAAAA TTATCCCGCT CAGTGAAGTG GCAACGAAGG ACGATTTTTT TAATATCAAA AAAGCCAGCG CCGCTGACCT GCTGGACGCG CACCGCATCC CCTTTCAGTT GATGGGCGGC AAGCCGGAGA ACGTCGGGTC GCTGGGTGAT ATTGAGAAAG TAGCAAAGGT CTTTGTCCGC AATGAGCTTA TCCCGTTACA GGACAGGATC CGCGAGATAA ACGGCTGGCT CGGTCAGGAG GTCATCCGAT TTAAAAACTA CTCACTGGAC ACTGACAACG GCTGA
|
Protein sequence | MSKKKGKTPQ PAAKKMTASA PKMEAFTFGE PVPVLDRRDI LDYVECISNG RWYEPPVSFT GLAKSLRAAV HHSSPIYVKR NILASTFIPH PWLSQQDFSR FVLDFLVFGN AFLEKRYSTT GKVIRLETSP AKYTRRGVEE DVYWWVPSFN EPTPFAPGSV FHLLEPDINQ ELYGLPEYLS ALNSAWLNES ATLFRRKYYE NGAHAGYIMY VTDAVQDRND IEMLRENMVK SKGRNNFKNL FLYAPQGKAD GIKIIPLSEV ATKDDFFNIK KASAADLLDA HRIPFQLMGG KPENVGSLGD IEKVAKVFVR NELIPLQDRI REINGWLGQE VIRFKNYSLD TDNG
|
| |