Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1704 |
Symbol | |
ID | 6145086 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1708354 |
End bp | 1709610 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 641616580 |
Product | hypothetical protein |
Protein accession | YP_001743758 |
Protein GI | 170683898 |
COG category | [S] Function unknown |
COG ID | [COG4886] Leucine-rich repeat (LRR) protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.172293 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTCC CTTTAATATT TAATAAAATA AATCCACAAT CCATACAGCA ACATCCAGAA AAAAATGAAC TTAACTGGAT GCTCGAATTA AATCAATGGA AAGCAGAACG CATACTTACA GGTGAAATCC ATCGTCCGGA ATGCCGAAAC GAAGCCGCTA AAAGGATAAA TTGTGCTTTT TTGTCGAAAC AGAATGATAT TGATTTATCA GGACTTAATT TAACTACCCA ACCACCAGGG CTGCAAAACT TCACCTCTAT CAATCTTGAT AATAACCAAC TCACACATTT TGATACAACC ACCTACGATA GACTCGTAAA GCTTAGTCTG AATAGTAATG CTCTTGAGTC AATAAATTTT CCTCAAGGTA GAAATGTAAG CATTACACAT ATATCTATGA ATAATAATTC TCTCAGAAAT ATTGATATAG ATCGGCTTTC ATCAATTACT TATTTTAGTG CGGCACATAA TCAACTAGAG TTTGTGCAAT TAGAATGTTG CGAAAGGCTG CAGTACCTGA ATCTCAGCCA CAATCAATTA ACTGATATTG TGGCAGAAAA TAAAGATGAA CTTTTACTAC TGGATCTATC CCATAATAAA CTAACAAGTT TACATAATGC CTTATTCCCC AACTTGAATA CGTTACTTAT CAACAACAAC TTGCTTTCTG AAATTAAAAT ATTCTTTAGC AACTTCTGCA ATGTTCAGAC ATTAAACGCT GCGAACAATC AGTTGGAAAA AATAAACCTT CATTTCCTGA CTTATCTTTC ATCTATCAAA AGTTTAAGGC TGGACAATAA TAAAATAACC CGCATTGACA CTAAGAATAC ATCCGATATT GGAACTTTAT TCCCCATAAT AAAACAGAGC AAAAACTTAA ATTTTTTAAA TGTTTCTGGG AAGAACAATT GCCCTACTAT GCAGCTCATG TTATTTAATT TATTTTCCCC AGCACTTAAG CTTAATACTG GCCTGGCAAT TCTTTCGCCT GGTGCATTTG AAGTTCACTC TGACGGAGTA GATGTGGATA ACGAATTGTT TCACTATACT ATTAAAGAAG CATATACCCC ATATAATATA CACACTTATA AAACAGAAGA AGTTGTAAAT CAGATGAATA TAAAAATTAA AAATATGACA TTAGATGAAA TAAACAATAC TTACTGTAAT AACGATTATT ACAATGAGGC GATAAGAGAG GAACCGATAG ACTTTCTGGG CAGATCGTTT TCCTCCAGCT CATGGCCTTT TCAGTGA
|
Protein sequence | MKFPLIFNKI NPQSIQQHPE KNELNWMLEL NQWKAERILT GEIHRPECRN EAAKRINCAF LSKQNDIDLS GLNLTTQPPG LQNFTSINLD NNQLTHFDTT TYDRLVKLSL NSNALESINF PQGRNVSITH ISMNNNSLRN IDIDRLSSIT YFSAAHNQLE FVQLECCERL QYLNLSHNQL TDIVAENKDE LLLLDLSHNK LTSLHNALFP NLNTLLINNN LLSEIKIFFS NFCNVQTLNA ANNQLEKINL HFLTYLSSIK SLRLDNNKIT RIDTKNTSDI GTLFPIIKQS KNLNFLNVSG KNNCPTMQLM LFNLFSPALK LNTGLAILSP GAFEVHSDGV DVDNELFHYT IKEAYTPYNI HTYKTEEVVN QMNIKIKNMT LDEINNTYCN NDYYNEAIRE EPIDFLGRSF SSSSWPFQ
|
| |