Gene EcSMS35_1704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1704 
Symbol 
ID6145086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1708354 
End bp1709610 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content34% 
IMG OID641616580 
Producthypothetical protein 
Protein accessionYP_001743758 
Protein GI170683898 
COG category[S] Function unknown 
COG ID[COG4886] Leucine-rich repeat (LRR) protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.172293 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTCC CTTTAATATT TAATAAAATA AATCCACAAT CCATACAGCA ACATCCAGAA 
AAAAATGAAC TTAACTGGAT GCTCGAATTA AATCAATGGA AAGCAGAACG CATACTTACA
GGTGAAATCC ATCGTCCGGA ATGCCGAAAC GAAGCCGCTA AAAGGATAAA TTGTGCTTTT
TTGTCGAAAC AGAATGATAT TGATTTATCA GGACTTAATT TAACTACCCA ACCACCAGGG
CTGCAAAACT TCACCTCTAT CAATCTTGAT AATAACCAAC TCACACATTT TGATACAACC
ACCTACGATA GACTCGTAAA GCTTAGTCTG AATAGTAATG CTCTTGAGTC AATAAATTTT
CCTCAAGGTA GAAATGTAAG CATTACACAT ATATCTATGA ATAATAATTC TCTCAGAAAT
ATTGATATAG ATCGGCTTTC ATCAATTACT TATTTTAGTG CGGCACATAA TCAACTAGAG
TTTGTGCAAT TAGAATGTTG CGAAAGGCTG CAGTACCTGA ATCTCAGCCA CAATCAATTA
ACTGATATTG TGGCAGAAAA TAAAGATGAA CTTTTACTAC TGGATCTATC CCATAATAAA
CTAACAAGTT TACATAATGC CTTATTCCCC AACTTGAATA CGTTACTTAT CAACAACAAC
TTGCTTTCTG AAATTAAAAT ATTCTTTAGC AACTTCTGCA ATGTTCAGAC ATTAAACGCT
GCGAACAATC AGTTGGAAAA AATAAACCTT CATTTCCTGA CTTATCTTTC ATCTATCAAA
AGTTTAAGGC TGGACAATAA TAAAATAACC CGCATTGACA CTAAGAATAC ATCCGATATT
GGAACTTTAT TCCCCATAAT AAAACAGAGC AAAAACTTAA ATTTTTTAAA TGTTTCTGGG
AAGAACAATT GCCCTACTAT GCAGCTCATG TTATTTAATT TATTTTCCCC AGCACTTAAG
CTTAATACTG GCCTGGCAAT TCTTTCGCCT GGTGCATTTG AAGTTCACTC TGACGGAGTA
GATGTGGATA ACGAATTGTT TCACTATACT ATTAAAGAAG CATATACCCC ATATAATATA
CACACTTATA AAACAGAAGA AGTTGTAAAT CAGATGAATA TAAAAATTAA AAATATGACA
TTAGATGAAA TAAACAATAC TTACTGTAAT AACGATTATT ACAATGAGGC GATAAGAGAG
GAACCGATAG ACTTTCTGGG CAGATCGTTT TCCTCCAGCT CATGGCCTTT TCAGTGA
 
Protein sequence
MKFPLIFNKI NPQSIQQHPE KNELNWMLEL NQWKAERILT GEIHRPECRN EAAKRINCAF 
LSKQNDIDLS GLNLTTQPPG LQNFTSINLD NNQLTHFDTT TYDRLVKLSL NSNALESINF
PQGRNVSITH ISMNNNSLRN IDIDRLSSIT YFSAAHNQLE FVQLECCERL QYLNLSHNQL
TDIVAENKDE LLLLDLSHNK LTSLHNALFP NLNTLLINNN LLSEIKIFFS NFCNVQTLNA
ANNQLEKINL HFLTYLSSIK SLRLDNNKIT RIDTKNTSDI GTLFPIIKQS KNLNFLNVSG
KNNCPTMQLM LFNLFSPALK LNTGLAILSP GAFEVHSDGV DVDNELFHYT IKEAYTPYNI
HTYKTEEVVN QMNIKIKNMT LDEINNTYCN NDYYNEAIRE EPIDFLGRSF SSSSWPFQ