Gene EcSMS35_0957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0957 
Symbol 
ID6144497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp968124 
End bp969785 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content52% 
IMG OID641615844 
Productputative phage terminase, large subunit 
Protein accessionYP_001743036 
Protein GI170683362 
COG category[R] General function prediction only 
COG ID[COG4626] Phage terminase-like protein, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCGCCT GGCATGAGTA CGCAGAAGGC GTAAAAAACG GCAAAATTAC GGCCTGTAAA 
CGACTGAAAC AGGCTGTTAA ACGGTATTTT TCTGACCTTG AAAACTCCCT TTACACGTTC
GATCCGGAGG TCGTGGAGCG GTTTATTGCC TTTTCCCGGG TGTGCCCGCA CGTAAAAGGC
GCAATGCGCG GTAGCCCCAT TGAACTTGAG CCGTGGCAGC AGTTCGCCTT TGCCTGCATC
CTGGGATTTA AGGTTAAGGC CACCGGACGG CGCAAATACA CCAGCGCATT CATTGAAGTA
CCGCGAAAAA ATGCCAAATC CACGGTCGCC GCTATCCTGG CTAACTGGTT TCTGGTTATG
GAAAACGGGC AGCAGGATAT TTACACCGCC GCCGTGAGTC GTGATCAGGC GCGGATCGTG
TTTGATGATG CGCGTCAGAT GTGCCTTTTA TCCCGACCGT TACGAAAGCG GGTAAATATT
CAGGCGCACA AGGTGATACA CCCGAAAACC AACAGCCTGT TAAAGCCACT GGCAGCAAAA
GCGGCAACCA TTGAGGGGAC AAACCCGAGT CTTGCCATTG TGGATGAATA TCACCTGCAC
CCAGACAACG GGGTTTATTC CGCACTTGAA CTGGGGATGG GGGCGCGTCC GGAGGGGCTG
TTATTTGCCA TCACCACATC GGGGAGCAAC GTTGTTTCAG CCTGTAAACA ACACTACGAC
TATTGCTGCC AGATACTGGA TGGTGAAGAG GTGAACGAAT CCATGTTCGT ACTGATTTAC
GAGCTGGATG ATGAAAGCGA GGTTGACGAT CCGGCGATGT GGATAAAGGC GAATCCCAAT
ATCGATGTTT CCGTCGATCG TGAAAAACTG GCCTCAACCA TCCAGAAAGC GCGGGGTATT
CCGTCGCAGT GGGTGGAAAT GCTCACCAAG CGATTCAATA TCTGGTGTCA GGGGGCTACG
CCGTGGATGG GTAACGGTGC ATGGGCGGAG TGCGCCGGAA CGTTCGCTGA GGCGGATTTA
TACGGGCAGG AGTGCTATGC GGGGCTGGAC TTATCATCAA CCAGCGATAT TTCCAGCGTG
TGCTATGCCT TTCCGGTCGG TAAAAAGATT ATGCTGGTTT CACGTCACTA TCTACCGGAA
TTTCAGCTAC AGAACCCTGC CAATAAAAAC CGCGCCATCT ATCGCCAGTG GGCAAAGGCG
GGCTGGATAC GCACAACACC GGGTGACTGC ATTGATTATG ACCGTATCCG TGATGACATC
ATGGCGGATG CAGAGAATTT CAATATCAGG CTGGTGGGTT TCGATACATG GAACGCCACG
CACCTGAGGA CGCAGCTACA GGGAGCGGGA TTTGAGGTGG AGCCGTTCCC GCAAACGTAC
CTTCGTTTTA GTCCGGCGGC GAAATCGTTC GAAGTTTTTG TTAACCGGAA GGTGATTGTT
CATCGTGGTG ATCCGGTGCT GGCCTGGTCA ATGAGTAATG TTGTGATGCA GAGTGACGCG
AACGCCAATA TCAAGCCGAA CAAGAAAAAA TCATCCAATA AGATAGACCC GAGCGTTGCG
GCGCTGATGG CGTTTGGCAC ATTCCAGGCT GAGCATGAGG AATTTGCATT TGATATGAGC
GACAGCCAGA AAGAGCGACT TGCGGCATTT GATGGGGTAT GA
 
Protein sequence
MTAWHEYAEG VKNGKITACK RLKQAVKRYF SDLENSLYTF DPEVVERFIA FSRVCPHVKG 
AMRGSPIELE PWQQFAFACI LGFKVKATGR RKYTSAFIEV PRKNAKSTVA AILANWFLVM
ENGQQDIYTA AVSRDQARIV FDDARQMCLL SRPLRKRVNI QAHKVIHPKT NSLLKPLAAK
AATIEGTNPS LAIVDEYHLH PDNGVYSALE LGMGARPEGL LFAITTSGSN VVSACKQHYD
YCCQILDGEE VNESMFVLIY ELDDESEVDD PAMWIKANPN IDVSVDREKL ASTIQKARGI
PSQWVEMLTK RFNIWCQGAT PWMGNGAWAE CAGTFAEADL YGQECYAGLD LSSTSDISSV
CYAFPVGKKI MLVSRHYLPE FQLQNPANKN RAIYRQWAKA GWIRTTPGDC IDYDRIRDDI
MADAENFNIR LVGFDTWNAT HLRTQLQGAG FEVEPFPQTY LRFSPAAKSF EVFVNRKVIV
HRGDPVLAWS MSNVVMQSDA NANIKPNKKK SSNKIDPSVA ALMAFGTFQA EHEEFAFDMS
DSQKERLAAF DGV