Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0957 |
Symbol | |
ID | 6144497 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 968124 |
End bp | 969785 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641615844 |
Product | putative phage terminase, large subunit |
Protein accession | YP_001743036 |
Protein GI | 170683362 |
COG category | [R] General function prediction only |
COG ID | [COG4626] Phage terminase-like protein, large subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACCGCCT GGCATGAGTA CGCAGAAGGC GTAAAAAACG GCAAAATTAC GGCCTGTAAA CGACTGAAAC AGGCTGTTAA ACGGTATTTT TCTGACCTTG AAAACTCCCT TTACACGTTC GATCCGGAGG TCGTGGAGCG GTTTATTGCC TTTTCCCGGG TGTGCCCGCA CGTAAAAGGC GCAATGCGCG GTAGCCCCAT TGAACTTGAG CCGTGGCAGC AGTTCGCCTT TGCCTGCATC CTGGGATTTA AGGTTAAGGC CACCGGACGG CGCAAATACA CCAGCGCATT CATTGAAGTA CCGCGAAAAA ATGCCAAATC CACGGTCGCC GCTATCCTGG CTAACTGGTT TCTGGTTATG GAAAACGGGC AGCAGGATAT TTACACCGCC GCCGTGAGTC GTGATCAGGC GCGGATCGTG TTTGATGATG CGCGTCAGAT GTGCCTTTTA TCCCGACCGT TACGAAAGCG GGTAAATATT CAGGCGCACA AGGTGATACA CCCGAAAACC AACAGCCTGT TAAAGCCACT GGCAGCAAAA GCGGCAACCA TTGAGGGGAC AAACCCGAGT CTTGCCATTG TGGATGAATA TCACCTGCAC CCAGACAACG GGGTTTATTC CGCACTTGAA CTGGGGATGG GGGCGCGTCC GGAGGGGCTG TTATTTGCCA TCACCACATC GGGGAGCAAC GTTGTTTCAG CCTGTAAACA ACACTACGAC TATTGCTGCC AGATACTGGA TGGTGAAGAG GTGAACGAAT CCATGTTCGT ACTGATTTAC GAGCTGGATG ATGAAAGCGA GGTTGACGAT CCGGCGATGT GGATAAAGGC GAATCCCAAT ATCGATGTTT CCGTCGATCG TGAAAAACTG GCCTCAACCA TCCAGAAAGC GCGGGGTATT CCGTCGCAGT GGGTGGAAAT GCTCACCAAG CGATTCAATA TCTGGTGTCA GGGGGCTACG CCGTGGATGG GTAACGGTGC ATGGGCGGAG TGCGCCGGAA CGTTCGCTGA GGCGGATTTA TACGGGCAGG AGTGCTATGC GGGGCTGGAC TTATCATCAA CCAGCGATAT TTCCAGCGTG TGCTATGCCT TTCCGGTCGG TAAAAAGATT ATGCTGGTTT CACGTCACTA TCTACCGGAA TTTCAGCTAC AGAACCCTGC CAATAAAAAC CGCGCCATCT ATCGCCAGTG GGCAAAGGCG GGCTGGATAC GCACAACACC GGGTGACTGC ATTGATTATG ACCGTATCCG TGATGACATC ATGGCGGATG CAGAGAATTT CAATATCAGG CTGGTGGGTT TCGATACATG GAACGCCACG CACCTGAGGA CGCAGCTACA GGGAGCGGGA TTTGAGGTGG AGCCGTTCCC GCAAACGTAC CTTCGTTTTA GTCCGGCGGC GAAATCGTTC GAAGTTTTTG TTAACCGGAA GGTGATTGTT CATCGTGGTG ATCCGGTGCT GGCCTGGTCA ATGAGTAATG TTGTGATGCA GAGTGACGCG AACGCCAATA TCAAGCCGAA CAAGAAAAAA TCATCCAATA AGATAGACCC GAGCGTTGCG GCGCTGATGG CGTTTGGCAC ATTCCAGGCT GAGCATGAGG AATTTGCATT TGATATGAGC GACAGCCAGA AAGAGCGACT TGCGGCATTT GATGGGGTAT GA
|
Protein sequence | MTAWHEYAEG VKNGKITACK RLKQAVKRYF SDLENSLYTF DPEVVERFIA FSRVCPHVKG AMRGSPIELE PWQQFAFACI LGFKVKATGR RKYTSAFIEV PRKNAKSTVA AILANWFLVM ENGQQDIYTA AVSRDQARIV FDDARQMCLL SRPLRKRVNI QAHKVIHPKT NSLLKPLAAK AATIEGTNPS LAIVDEYHLH PDNGVYSALE LGMGARPEGL LFAITTSGSN VVSACKQHYD YCCQILDGEE VNESMFVLIY ELDDESEVDD PAMWIKANPN IDVSVDREKL ASTIQKARGI PSQWVEMLTK RFNIWCQGAT PWMGNGAWAE CAGTFAEADL YGQECYAGLD LSSTSDISSV CYAFPVGKKI MLVSRHYLPE FQLQNPANKN RAIYRQWAKA GWIRTTPGDC IDYDRIRDDI MADAENFNIR LVGFDTWNAT HLRTQLQGAG FEVEPFPQTY LRFSPAAKSF EVFVNRKVIV HRGDPVLAWS MSNVVMQSDA NANIKPNKKK SSNKIDPSVA ALMAFGTFQA EHEEFAFDMS DSQKERLAAF DGV
|
| |