Gene EcSMS35_4113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4113 
SymbolyieM 
ID6142921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4207951 
End bp4209402 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content53% 
IMG OID641618937 
Producthypothetical protein 
Protein accessionYP_001746075 
Protein GI170683973 
COG category[R] General function prediction only 
COG ID[COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.218844 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.154025 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAACGC TGGATACGCT TAATGTGATG CTGGCCGTCA GCGAAGAGGG ATTGATCGAA 
GAGATGATCA TCGCGCTGCT GGCCTCACCG CAGCTGGCGG TCTTCTTTGA AAAATTCCCA
CGACTGAAGG CGGCTATCAC TGATGATGTT CCCCGCTGGC GTGAGGCGCT GCGCAGTCGG
CTGAAAGATG CCCGAGTCCC GCCGGAACTC ACCGAAGAGG TGATGTGCTA TCAGCAAAGC
CAGCTCCTCT CCACGCCGCA GTTTATTGTG CAGCTACCAC AGATCCTGGA CTTACTGCAT
CGTCTGAATT CCCCATGGGC AGAACAAGCC CGACAGTTGG TTGATGCTAA CAGCACGATC
ACTTCAGCGT TACACACGCT TTTTCTCCAG CGTTGGCGTT TAAGTCTGAT CGTGCAAGCA
ACGACGTTAA ATCAACAGCT ATTAGAAGAA GAACGCGAAC AACTGTTGAG TGAAGTTCAG
GAACGCATGA CGCAGAGCGG ACAACTTGAA CCGATTCTCG CAGATAACAA TACCGCAGCT
GGTCGTCTGT GGGATATGAG CGCCGGTCAG CTTAAACGTG GCGACTATCA GTTGATTGTG
AAATACGGTG AATTTCTGAA CGAACAGCCG GAACTGAAAC GCCTGGCAGA ACAGTTGGGG
CGTTCCCGGG AAGCCAAATC AATACCGCGC AACGATGCGC AGATGGAAAC CTTCCGCACC
CTGGTGCGCG AACCGGCGAC GGTTCCTGAG CAGGTTGATG GTCTGCAACA AAGCGATGAT
ATTTTACGTC TCCTGCCGCC AGAACTGGCG ACACTAGGGA TAACAGAACT GGAGTATGAG
TTTTACCGTC GGCTGGTGGA AAAACAGTTG CTCACCTATC GCCTGCACGG TGAGTCGTGG
CGTGAAAAAG TGATCGAACG CCCGGTGGTG CATAAAGATT ACGACGAACA GCCGCGCGGA
CCGTTTATTG TCTGTGTGGA TACTTCCGGC TCAATGGGCG GCTTTAATGA ACAGTGTGCG
AAAGCGTTCT GCCTGGCCTT GATGCGCATT GCTCTCGCTG AAAACCGGCG CTGCTATATT
ATGCTATTTT CCACCGAGAT CGTCCGTTAT GAGCTTTCAG GCCCACAAGG CATCGAACAG
GCAATCCGTT TTTTAAGCCA GCGTTTTCGT GGTGGTACTG ATCTTGCCAG TTGTTTTCGC
GCCATTATGG AACGATTGCA AAGCCGGGAA TGGTTTGACG CCGATGCGGT GGTGATTTCT
GATTTTATCG CCCAGCGGTT GCCTGACGAC GTGACGAGTA AAGTGAAAGA GTTGCAGCGG
GTACATCAGC ATCGCTTTCA TGCCGTGGCG ATGTCGGCAC ATGGCAAACC CGGCATCATG
CGCATTTTCG ATCATATCTG GCGCTTTGAT ACCGGGATGC GAAGCCGCCT GCTCAGACGC
TGGCGACGAT AA
 
Protein sequence
MLTLDTLNVM LAVSEEGLIE EMIIALLASP QLAVFFEKFP RLKAAITDDV PRWREALRSR 
LKDARVPPEL TEEVMCYQQS QLLSTPQFIV QLPQILDLLH RLNSPWAEQA RQLVDANSTI
TSALHTLFLQ RWRLSLIVQA TTLNQQLLEE EREQLLSEVQ ERMTQSGQLE PILADNNTAA
GRLWDMSAGQ LKRGDYQLIV KYGEFLNEQP ELKRLAEQLG RSREAKSIPR NDAQMETFRT
LVREPATVPE QVDGLQQSDD ILRLLPPELA TLGITELEYE FYRRLVEKQL LTYRLHGESW
REKVIERPVV HKDYDEQPRG PFIVCVDTSG SMGGFNEQCA KAFCLALMRI ALAENRRCYI
MLFSTEIVRY ELSGPQGIEQ AIRFLSQRFR GGTDLASCFR AIMERLQSRE WFDADAVVIS
DFIAQRLPDD VTSKVKELQR VHQHRFHAVA MSAHGKPGIM RIFDHIWRFD TGMRSRLLRR
WRR