Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4113 |
Symbol | yieM |
ID | 6142921 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4207951 |
End bp | 4209402 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641618937 |
Product | hypothetical protein |
Protein accession | YP_001746075 |
Protein GI | 170683973 |
COG category | [R] General function prediction only |
COG ID | [COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.218844 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.154025 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTAACGC TGGATACGCT TAATGTGATG CTGGCCGTCA GCGAAGAGGG ATTGATCGAA GAGATGATCA TCGCGCTGCT GGCCTCACCG CAGCTGGCGG TCTTCTTTGA AAAATTCCCA CGACTGAAGG CGGCTATCAC TGATGATGTT CCCCGCTGGC GTGAGGCGCT GCGCAGTCGG CTGAAAGATG CCCGAGTCCC GCCGGAACTC ACCGAAGAGG TGATGTGCTA TCAGCAAAGC CAGCTCCTCT CCACGCCGCA GTTTATTGTG CAGCTACCAC AGATCCTGGA CTTACTGCAT CGTCTGAATT CCCCATGGGC AGAACAAGCC CGACAGTTGG TTGATGCTAA CAGCACGATC ACTTCAGCGT TACACACGCT TTTTCTCCAG CGTTGGCGTT TAAGTCTGAT CGTGCAAGCA ACGACGTTAA ATCAACAGCT ATTAGAAGAA GAACGCGAAC AACTGTTGAG TGAAGTTCAG GAACGCATGA CGCAGAGCGG ACAACTTGAA CCGATTCTCG CAGATAACAA TACCGCAGCT GGTCGTCTGT GGGATATGAG CGCCGGTCAG CTTAAACGTG GCGACTATCA GTTGATTGTG AAATACGGTG AATTTCTGAA CGAACAGCCG GAACTGAAAC GCCTGGCAGA ACAGTTGGGG CGTTCCCGGG AAGCCAAATC AATACCGCGC AACGATGCGC AGATGGAAAC CTTCCGCACC CTGGTGCGCG AACCGGCGAC GGTTCCTGAG CAGGTTGATG GTCTGCAACA AAGCGATGAT ATTTTACGTC TCCTGCCGCC AGAACTGGCG ACACTAGGGA TAACAGAACT GGAGTATGAG TTTTACCGTC GGCTGGTGGA AAAACAGTTG CTCACCTATC GCCTGCACGG TGAGTCGTGG CGTGAAAAAG TGATCGAACG CCCGGTGGTG CATAAAGATT ACGACGAACA GCCGCGCGGA CCGTTTATTG TCTGTGTGGA TACTTCCGGC TCAATGGGCG GCTTTAATGA ACAGTGTGCG AAAGCGTTCT GCCTGGCCTT GATGCGCATT GCTCTCGCTG AAAACCGGCG CTGCTATATT ATGCTATTTT CCACCGAGAT CGTCCGTTAT GAGCTTTCAG GCCCACAAGG CATCGAACAG GCAATCCGTT TTTTAAGCCA GCGTTTTCGT GGTGGTACTG ATCTTGCCAG TTGTTTTCGC GCCATTATGG AACGATTGCA AAGCCGGGAA TGGTTTGACG CCGATGCGGT GGTGATTTCT GATTTTATCG CCCAGCGGTT GCCTGACGAC GTGACGAGTA AAGTGAAAGA GTTGCAGCGG GTACATCAGC ATCGCTTTCA TGCCGTGGCG ATGTCGGCAC ATGGCAAACC CGGCATCATG CGCATTTTCG ATCATATCTG GCGCTTTGAT ACCGGGATGC GAAGCCGCCT GCTCAGACGC TGGCGACGAT AA
|
Protein sequence | MLTLDTLNVM LAVSEEGLIE EMIIALLASP QLAVFFEKFP RLKAAITDDV PRWREALRSR LKDARVPPEL TEEVMCYQQS QLLSTPQFIV QLPQILDLLH RLNSPWAEQA RQLVDANSTI TSALHTLFLQ RWRLSLIVQA TTLNQQLLEE EREQLLSEVQ ERMTQSGQLE PILADNNTAA GRLWDMSAGQ LKRGDYQLIV KYGEFLNEQP ELKRLAEQLG RSREAKSIPR NDAQMETFRT LVREPATVPE QVDGLQQSDD ILRLLPPELA TLGITELEYE FYRRLVEKQL LTYRLHGESW REKVIERPVV HKDYDEQPRG PFIVCVDTSG SMGGFNEQCA KAFCLALMRI ALAENRRCYI MLFSTEIVRY ELSGPQGIEQ AIRFLSQRFR GGTDLASCFR AIMERLQSRE WFDADAVVIS DFIAQRLPDD VTSKVKELQR VHQHRFHAVA MSAHGKPGIM RIFDHIWRFD TGMRSRLLRR WRR
|
| |