Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0953 |
Symbol | |
ID | 6145949 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 965546 |
End bp | 966760 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641615840 |
Product | HK97 family phage portal protein |
Protein accession | YP_001743032 |
Protein GI | 170684127 |
COG category | [S] Function unknown |
COG ID | [COG4695] Phage-related protein |
TIGRFAM ID | [TIGR01537] phage portal protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGGTGGC CTTTTAGTCG TAAAAAAAGC GAGCAGCGTA ACCTGTCCAT TGATGATTTT CTGGCGCTGT CCGGCGTACC GAATACCGGA TCCGGAGAAT ATGTTTCTGC CGGGACGGCT GAATCATTGC CTGCAGTGAT GAATGCGGTT TCTGTCATCG CTGAGGCGGT GGCCACGATG CCGTGTTATC TGTATCTGGT ACGTAATGAC AAGGGCAGGG AGGCGCGGGA ATGGCTGGAC AGTCACCCAG TAGATATTCT GCTGAATGAG CAGCCTAATT CGTGCCAGAC ACCTTACCAG TTTAAACGCA CAATGATGCG TCACTGCCTG CTGAACGGTA ACGCCTATGC GGTTATTGAG TGGGGGCAGG ACGGGCAGCC AAAATCACTT CATCCTTATG CGCCGGGGTG TGTTGTACCG GAACGCACAG GCGCACACAA ATACCGCTAT ACCATCACCG AACCCTGTAC AGGAACGGTG CGCACGTATT TACAGGAAGA AGTTTTGCAT CTCCGCTATG CCTCGGATGA TGGCTTTCTG GGGCGTTCCC CCGTCACGAT TTGCCGTGAG GCGCTGGGGC TTGGCCTTGC TCAACAGCGT CACGGAGCCA GCATTATGAA AGATGGCATG ATGGCGGCAG GGATTATCAC GTCAGGCGAA TGGCTGGACG GCGTGAAAGG TAAACAGGCA TTGGATGCTC TGGAACGCTA CAAGGGGGCG AAAAATGCCG GAAAAACGCC AATCCTTGAA GGGGGCATGG ATTACAGGCA ACTGGGAATG AGTAACCAGG ATGCGGAATG GCTGGCCTCC CGTCGCTTCT CCATTGAAGA CATCGCCCGC ATGTTCAACG TATCGCCTAT TTTTCTGCAG GAATACAGCA ACAGCACCTA CAGCAATTTC AGCGAGGCAA GCCGCGCGTT TCTGACCATG ACAATGCGCC CGTGGCTGGC GAACTTCGAA CAGCAAATCA AGGCCGCTTT GCTGGTGGCT TCTCCCGTAC CTGGTACCCG TTATCTGGTT GAGTTTGATT CAGCCGATTT ATTACGCGCC ACACCCACCG AACGTTATGC CACGTATGAG AAAGGGATTA AGAACGGGAT CATGAATCCG AACGAAGCCC GTGAGCGTGA GGGTATGCCG CCGCGTGAAG GTGGTGATGA GTTCAGCCAG GCATGGAAAC AGACTGTGGA AATTAAAGGG AGAAAAGATG AGTGA
|
Protein sequence | MWWPFSRKKS EQRNLSIDDF LALSGVPNTG SGEYVSAGTA ESLPAVMNAV SVIAEAVATM PCYLYLVRND KGREAREWLD SHPVDILLNE QPNSCQTPYQ FKRTMMRHCL LNGNAYAVIE WGQDGQPKSL HPYAPGCVVP ERTGAHKYRY TITEPCTGTV RTYLQEEVLH LRYASDDGFL GRSPVTICRE ALGLGLAQQR HGASIMKDGM MAAGIITSGE WLDGVKGKQA LDALERYKGA KNAGKTPILE GGMDYRQLGM SNQDAEWLAS RRFSIEDIAR MFNVSPIFLQ EYSNSTYSNF SEASRAFLTM TMRPWLANFE QQIKAALLVA SPVPGTRYLV EFDSADLLRA TPTERYATYE KGIKNGIMNP NEAREREGMP PREGGDEFSQ AWKQTVEIKG RKDE
|
| |