Gene EcSMS35_0953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0953 
Symbol 
ID6145949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp965546 
End bp966760 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content53% 
IMG OID641615840 
ProductHK97 family phage portal protein 
Protein accessionYP_001743032 
Protein GI170684127 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGTGGC CTTTTAGTCG TAAAAAAAGC GAGCAGCGTA ACCTGTCCAT TGATGATTTT 
CTGGCGCTGT CCGGCGTACC GAATACCGGA TCCGGAGAAT ATGTTTCTGC CGGGACGGCT
GAATCATTGC CTGCAGTGAT GAATGCGGTT TCTGTCATCG CTGAGGCGGT GGCCACGATG
CCGTGTTATC TGTATCTGGT ACGTAATGAC AAGGGCAGGG AGGCGCGGGA ATGGCTGGAC
AGTCACCCAG TAGATATTCT GCTGAATGAG CAGCCTAATT CGTGCCAGAC ACCTTACCAG
TTTAAACGCA CAATGATGCG TCACTGCCTG CTGAACGGTA ACGCCTATGC GGTTATTGAG
TGGGGGCAGG ACGGGCAGCC AAAATCACTT CATCCTTATG CGCCGGGGTG TGTTGTACCG
GAACGCACAG GCGCACACAA ATACCGCTAT ACCATCACCG AACCCTGTAC AGGAACGGTG
CGCACGTATT TACAGGAAGA AGTTTTGCAT CTCCGCTATG CCTCGGATGA TGGCTTTCTG
GGGCGTTCCC CCGTCACGAT TTGCCGTGAG GCGCTGGGGC TTGGCCTTGC TCAACAGCGT
CACGGAGCCA GCATTATGAA AGATGGCATG ATGGCGGCAG GGATTATCAC GTCAGGCGAA
TGGCTGGACG GCGTGAAAGG TAAACAGGCA TTGGATGCTC TGGAACGCTA CAAGGGGGCG
AAAAATGCCG GAAAAACGCC AATCCTTGAA GGGGGCATGG ATTACAGGCA ACTGGGAATG
AGTAACCAGG ATGCGGAATG GCTGGCCTCC CGTCGCTTCT CCATTGAAGA CATCGCCCGC
ATGTTCAACG TATCGCCTAT TTTTCTGCAG GAATACAGCA ACAGCACCTA CAGCAATTTC
AGCGAGGCAA GCCGCGCGTT TCTGACCATG ACAATGCGCC CGTGGCTGGC GAACTTCGAA
CAGCAAATCA AGGCCGCTTT GCTGGTGGCT TCTCCCGTAC CTGGTACCCG TTATCTGGTT
GAGTTTGATT CAGCCGATTT ATTACGCGCC ACACCCACCG AACGTTATGC CACGTATGAG
AAAGGGATTA AGAACGGGAT CATGAATCCG AACGAAGCCC GTGAGCGTGA GGGTATGCCG
CCGCGTGAAG GTGGTGATGA GTTCAGCCAG GCATGGAAAC AGACTGTGGA AATTAAAGGG
AGAAAAGATG AGTGA
 
Protein sequence
MWWPFSRKKS EQRNLSIDDF LALSGVPNTG SGEYVSAGTA ESLPAVMNAV SVIAEAVATM 
PCYLYLVRND KGREAREWLD SHPVDILLNE QPNSCQTPYQ FKRTMMRHCL LNGNAYAVIE
WGQDGQPKSL HPYAPGCVVP ERTGAHKYRY TITEPCTGTV RTYLQEEVLH LRYASDDGFL
GRSPVTICRE ALGLGLAQQR HGASIMKDGM MAAGIITSGE WLDGVKGKQA LDALERYKGA
KNAGKTPILE GGMDYRQLGM SNQDAEWLAS RRFSIEDIAR MFNVSPIFLQ EYSNSTYSNF
SEASRAFLTM TMRPWLANFE QQIKAALLVA SPVPGTRYLV EFDSADLLRA TPTERYATYE
KGIKNGIMNP NEAREREGMP PREGGDEFSQ AWKQTVEIKG RKDE