Gene EcSMS35_1248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1248 
Symbol 
ID6146726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1247021 
End bp1248160 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content35% 
IMG OID641616126 
Producthypothetical protein 
Protein accessionYP_001743309 
Protein GI170684152 
COG category[S] Function unknown 
COG ID[COG4886] Leucine-rich repeat (LRR) protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.48386 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.670053 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTCCAT TAAATGATCT TTCGTTAAAA ACTCAGTCGG TTCAATTAAA TAAAGTCACA 
TCTAATACTG AATCTACGAT AAAACAACAC GAGTTAGTAT CTGATGATGC AATCATAAAT
GAATTATCAA GTGAGTTAGT CAGTTGTCTT GGAAATGGTA AGCTTACACC AATTAGTGAA
GACAGCAACT TACTGAATAT GCTGTCTGAA TTTAAGTTAT TGAGAGAGCA ATGTTTCAGG
TGGGGTAATT ATACTCTATT GTTTGAAAAT TATGGGGCTT ATGATAAGAC GGGATCTATC
ACGATAGAAA AAAGTCAGGG GGAGGGGACT TTACCCATTC GGCATAAATT AGAGTTTATA
TCGACTAATA TTGCAGAGTT GCTGGACAAG TTAACCAAAA TTACAGATGC CAGACTTTGC
AAAGGTTTCA GTGACTGGGC TAGTTCAGTC AAAGAAGGCG CATCGAATGA CTTGAAAGAA
AATGTGGATA GAGCATTGGT GAGAATGTTT AAATGTGTTA AGCTCCACAG TAATGAACTT
AACTTATCAT ACCTTTTTTT GGGTTCTGTG CCGCCTCTTC CAGAGTGGAT TGAAATGCTT
AGCCTTATTC ATAATAAACT TGATTCAATA CAGGTGCCCG AATCATGCAA AGAATTAGAA
CTCGATTTCA ATAACCTTAC AGAATTTCCA CAAGTACCTG ATGGAATTAC CCTGATCTCC
GTAAATAATA ACCTGATATC GCATATTGAC TCATTTCCGC CAAAGGTTAA GAAAATTTTT
ATTGGTCACA ATAAGCTATC GGAAATACCA GCAATACCAG ACACCGCTAA GGTTTTTGAT
TGTAGTGAGA ATAATATAAA AGAAATTAGG TGGTTCCCCA AAAATCTGAA AGAAGTGCAT
ATTGAATATA ATAAGATTGA GGTTATTCCT GCGATACCTG GCAATTTAAA ATTACTTTTT
ATGGAATGTA ATCCTATTAA AGAGGCATTT TTAATGCCAT GGACTCTGAC AGGGATTTGC
TATGAAATAT CGCAGCGGAA ATATATTGTT ACGAACCCCG ATGATTATGA TAAATATTCC
GATATGGTTA AAAAGCATGT AATAGATGGT GAGGAATTCA TAATTAAATA TTATATGTAA
 
Protein sequence
MFPLNDLSLK TQSVQLNKVT SNTESTIKQH ELVSDDAIIN ELSSELVSCL GNGKLTPISE 
DSNLLNMLSE FKLLREQCFR WGNYTLLFEN YGAYDKTGSI TIEKSQGEGT LPIRHKLEFI
STNIAELLDK LTKITDARLC KGFSDWASSV KEGASNDLKE NVDRALVRMF KCVKLHSNEL
NLSYLFLGSV PPLPEWIEML SLIHNKLDSI QVPESCKELE LDFNNLTEFP QVPDGITLIS
VNNNLISHID SFPPKVKKIF IGHNKLSEIP AIPDTAKVFD CSENNIKEIR WFPKNLKEVH
IEYNKIEVIP AIPGNLKLLF MECNPIKEAF LMPWTLTGIC YEISQRKYIV TNPDDYDKYS
DMVKKHVIDG EEFIIKYYM