Gene EcolC_4249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4249 
SymbolyieM 
ID6067963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4700385 
End bp4701836 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content53% 
IMG OID641603686 
Producthypothetical protein 
Protein accessionYP_001727172 
Protein GI170022218 
COG category[R] General function prediction only 
COG ID[COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00907426 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTAACGC TGGATACGCT TAATGTGATG CTGGCCGTCA GCGAAGAGGG ATTGATCGAA 
GAGATGATCA TCGCGCTGCT GGCCTCACCG CAGCTGGCAG TCTTCTTTGA AAAATTCCCA
CGCCTGAAGG CAGCAATCAC TGATGATGTT CCCCGCTGGC GTGAGGCGCT GCGCAGTCGG
CTGAAAGATG CCCGAGTCCC GCCAGAACTC ACCGAAGAGG TGATGTGCTA TCAGCAAAGC
CAGCTCCTCT CCACGCCGCA GTTTATTGTG CAGCTACCAC AGATCCTGGA CTTACTGCAT
CGTCTGAATT CCCCATGGGC AGAACAAGCC CGACAGTTGG TTGATGCTAA CAGCGCGATC
ACTTCAGCGT TACACACACT TTTTCTCCAG CGTTGGCGTT TAAGTCTGAT CGTGCAAGCA
ACGACGTTAA ATCAACAGCT ATTAGAAGAA GAACGCGAAC AACTGTTAAG TGAAGTTCAG
GAACGCATGA CGCTGAGCGG ACAACTTGAA CCGATTCTCG CAGATAACAA TACCGCAGCT
GGTCGTCTGT GGGATATGAG CGCCGGCCAG CTTAAACGTG GCGACTATCA GTTGATTGTG
AAATACGGTG AATTTCTTAA CGAACAGCCG GAACTGAAAC GCCTGGCAGA GCAGCTGGGG
CGTTCTCGGG AAGCCAAATC AATACCGCGC AACGATGCGC AGATGGAAAC CTTCCGCACC
ATGGTGCGCG AACCTGCGAC GGTTCCTGAG CAGGTTGATG GTCTGCAACA AAGCGATGAT
ATTTTACGTC TCCTGCCGCC AGAACTGGCG ACACTAGGGA TAACGGAACT GGAGTATGAG
TTTTACCGTC GGCTGGTGGA AAAACAGTTG CTCACCTATC GCCTGCACGG TGAGTCGTGG
CGTGAAAAAG TGATCGAACG TCCGGTGGTA CATAAAGATT ACGATGAACA GCCGCGCGGG
CCGTTTATTG TCTGTGTGGA TACTTCCGGC TCAATGGGCG GCTTTAATGA ACAGTGTGCG
AAAGCGTTCT GCCTGGCCTT GATGCGCATT GCTCTCGCAG AAAACCGGCG CTGCTATATT
ATGCTATTTT CCACCGAGAT CGTCCGTTAT GAGCTTTCAG GCCCACAAGG CATCGAACAA
GCAATCCGTT TTTTAAGCCA GCAGTTTCGT GGCGGCACCG ATCTTGCCAG TTGTTTTCGC
GCCATTATGG AACGCTTGCA AAGCAGGGAA TGGTTTGATG CCGATGCGGT GGTGATTTCT
GATTTTATCG CTCAGCGGTT GCCTGACGAC GTGACGAGTA AAGTGAAAGA GCTGCAGCGG
GTACATCAGC ATCGCTTTCA TGCCGTGGCG ATGTCGGCAC ACGGCAAACC CGGCATCATG
CGCATTTTCG ATCATATCTG GCGCTTTGAT ACCGGGATGC GAAGCCGCCT GCTCAGACGC
TGGCGGCGAT AA
 
Protein sequence
MLTLDTLNVM LAVSEEGLIE EMIIALLASP QLAVFFEKFP RLKAAITDDV PRWREALRSR 
LKDARVPPEL TEEVMCYQQS QLLSTPQFIV QLPQILDLLH RLNSPWAEQA RQLVDANSAI
TSALHTLFLQ RWRLSLIVQA TTLNQQLLEE EREQLLSEVQ ERMTLSGQLE PILADNNTAA
GRLWDMSAGQ LKRGDYQLIV KYGEFLNEQP ELKRLAEQLG RSREAKSIPR NDAQMETFRT
MVREPATVPE QVDGLQQSDD ILRLLPPELA TLGITELEYE FYRRLVEKQL LTYRLHGESW
REKVIERPVV HKDYDEQPRG PFIVCVDTSG SMGGFNEQCA KAFCLALMRI ALAENRRCYI
MLFSTEIVRY ELSGPQGIEQ AIRFLSQQFR GGTDLASCFR AIMERLQSRE WFDADAVVIS
DFIAQRLPDD VTSKVKELQR VHQHRFHAVA MSAHGKPGIM RIFDHIWRFD TGMRSRLLRR
WRR