Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_4249 |
Symbol | yieM |
ID | 6067963 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 4700385 |
End bp | 4701836 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641603686 |
Product | hypothetical protein |
Protein accession | YP_001727172 |
Protein GI | 170022218 |
COG category | [R] General function prediction only |
COG ID | [COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00907426 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTAACGC TGGATACGCT TAATGTGATG CTGGCCGTCA GCGAAGAGGG ATTGATCGAA GAGATGATCA TCGCGCTGCT GGCCTCACCG CAGCTGGCAG TCTTCTTTGA AAAATTCCCA CGCCTGAAGG CAGCAATCAC TGATGATGTT CCCCGCTGGC GTGAGGCGCT GCGCAGTCGG CTGAAAGATG CCCGAGTCCC GCCAGAACTC ACCGAAGAGG TGATGTGCTA TCAGCAAAGC CAGCTCCTCT CCACGCCGCA GTTTATTGTG CAGCTACCAC AGATCCTGGA CTTACTGCAT CGTCTGAATT CCCCATGGGC AGAACAAGCC CGACAGTTGG TTGATGCTAA CAGCGCGATC ACTTCAGCGT TACACACACT TTTTCTCCAG CGTTGGCGTT TAAGTCTGAT CGTGCAAGCA ACGACGTTAA ATCAACAGCT ATTAGAAGAA GAACGCGAAC AACTGTTAAG TGAAGTTCAG GAACGCATGA CGCTGAGCGG ACAACTTGAA CCGATTCTCG CAGATAACAA TACCGCAGCT GGTCGTCTGT GGGATATGAG CGCCGGCCAG CTTAAACGTG GCGACTATCA GTTGATTGTG AAATACGGTG AATTTCTTAA CGAACAGCCG GAACTGAAAC GCCTGGCAGA GCAGCTGGGG CGTTCTCGGG AAGCCAAATC AATACCGCGC AACGATGCGC AGATGGAAAC CTTCCGCACC ATGGTGCGCG AACCTGCGAC GGTTCCTGAG CAGGTTGATG GTCTGCAACA AAGCGATGAT ATTTTACGTC TCCTGCCGCC AGAACTGGCG ACACTAGGGA TAACGGAACT GGAGTATGAG TTTTACCGTC GGCTGGTGGA AAAACAGTTG CTCACCTATC GCCTGCACGG TGAGTCGTGG CGTGAAAAAG TGATCGAACG TCCGGTGGTA CATAAAGATT ACGATGAACA GCCGCGCGGG CCGTTTATTG TCTGTGTGGA TACTTCCGGC TCAATGGGCG GCTTTAATGA ACAGTGTGCG AAAGCGTTCT GCCTGGCCTT GATGCGCATT GCTCTCGCAG AAAACCGGCG CTGCTATATT ATGCTATTTT CCACCGAGAT CGTCCGTTAT GAGCTTTCAG GCCCACAAGG CATCGAACAA GCAATCCGTT TTTTAAGCCA GCAGTTTCGT GGCGGCACCG ATCTTGCCAG TTGTTTTCGC GCCATTATGG AACGCTTGCA AAGCAGGGAA TGGTTTGATG CCGATGCGGT GGTGATTTCT GATTTTATCG CTCAGCGGTT GCCTGACGAC GTGACGAGTA AAGTGAAAGA GCTGCAGCGG GTACATCAGC ATCGCTTTCA TGCCGTGGCG ATGTCGGCAC ACGGCAAACC CGGCATCATG CGCATTTTCG ATCATATCTG GCGCTTTGAT ACCGGGATGC GAAGCCGCCT GCTCAGACGC TGGCGGCGAT AA
|
Protein sequence | MLTLDTLNVM LAVSEEGLIE EMIIALLASP QLAVFFEKFP RLKAAITDDV PRWREALRSR LKDARVPPEL TEEVMCYQQS QLLSTPQFIV QLPQILDLLH RLNSPWAEQA RQLVDANSAI TSALHTLFLQ RWRLSLIVQA TTLNQQLLEE EREQLLSEVQ ERMTLSGQLE PILADNNTAA GRLWDMSAGQ LKRGDYQLIV KYGEFLNEQP ELKRLAEQLG RSREAKSIPR NDAQMETFRT MVREPATVPE QVDGLQQSDD ILRLLPPELA TLGITELEYE FYRRLVEKQL LTYRLHGESW REKVIERPVV HKDYDEQPRG PFIVCVDTSG SMGGFNEQCA KAFCLALMRI ALAENRRCYI MLFSTEIVRY ELSGPQGIEQ AIRFLSQQFR GGTDLASCFR AIMERLQSRE WFDADAVVIS DFIAQRLPDD VTSKVKELQR VHQHRFHAVA MSAHGKPGIM RIFDHIWRFD TGMRSRLLRR WRR
|
| |