Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2284 |
Symbol | |
ID | 6147268 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2309786 |
End bp | 2310778 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641617157 |
Product | hypothetical protein |
Protein accession | YP_001744330 |
Protein GI | 170680310 |
COG category | [S] Function unknown |
COG ID | [COG2990] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00241499 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.00902859 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGTGAAAT CGACGTCATG TATAACCATT GATTTCATGA ATATGTCGCA GCTAACTGAA CGGACCTTTA CGCCATCTGA ATCTCTCAGC AGCCTGTCAC TTTTTCTTAG TCTGGCACGT GGACAGTGTC GGCCGGGTAA ATTCTGGCAT CGCCGTAGTT TTCGCCAGAA ATTTTTGCTG CGCTCGTTGA TTATGCCGCG TTTAAGCGTT GAGTGGATGA ACGAACTTTC CCACTGGCCT AATCTCAATG TGTTGTTAAC GCGCCAGCCG CGACTGCCTG TGCGTCTGCA TCGCCCTTAC CTTGCGGCGA ATCTTAGCCG TAAGCAATTG CTGGAGGCGT TACGTTACCA TTATGCGTTA CTCCGCGGGT GTATGTCGGC GGAAGAATTC AGCTTATATT TGAATACCCC CGGGCTGCAA CTGGCGAAGC TGGAAGGCAA AAACGGCGAG CAGTTCACAC TTGAGCTGAC CATGATGATC TCAATGGATA AAGAAGGTGA CAGCACAATC CTGTTTCGCA ACAGCGAAGG TATTCCTCTG GCAGAGATCA CGTTTACCCT GTGTGAATAT CAGGGGAAAA GAACAATGTT TATTGGCGGA CTGCAAGGCG CAAAATGGGA AATCCCACAT CAGGAAATCC AGAATGCGAC GAAAGCCTGC CACGGGCTAT TTCCCAAACG CCTCGTGATG GAAGCGGCCT GTCTGTTTGC CCAACGTTTG CAGGTAGAGC AGATTATTGC CGTCAGCAAT GAAACGCATA TTTACCGCAG CCTGCGTTAT CGCGATAAAG AAGGCAAGAT CCACGCTGAT TACAACGCTT TCTGGGAATC GGTTGGCGGC GTATGTGATG CTGAACGCCA TTACCGCCTT CCAGCACAGA TAGCACGAAA AGAGATTGCC GAAATCGCCA GTAAAAAACG GGCTGAATAC CGTCGGCGCT ATGAGATGCT CGACGCTATT CAGCCACAAA TGGCCACGAT GTTTCGCGGT TAA
|
Protein sequence | MVKSTSCITI DFMNMSQLTE RTFTPSESLS SLSLFLSLAR GQCRPGKFWH RRSFRQKFLL RSLIMPRLSV EWMNELSHWP NLNVLLTRQP RLPVRLHRPY LAANLSRKQL LEALRYHYAL LRGCMSAEEF SLYLNTPGLQ LAKLEGKNGE QFTLELTMMI SMDKEGDSTI LFRNSEGIPL AEITFTLCEY QGKRTMFIGG LQGAKWEIPH QEIQNATKAC HGLFPKRLVM EAACLFAQRL QVEQIIAVSN ETHIYRSLRY RDKEGKIHAD YNAFWESVGG VCDAERHYRL PAQIARKEIA EIASKKRAEY RRRYEMLDAI QPQMATMFRG
|
| |