Gene ECH74115_2978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2978 
SymbolwcaL 
ID6968361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2756122 
End bp2757342 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content54% 
IMG OID643386818 
Productcolanic acid biosynthesis glycosyl transferase WcaL 
Protein accessionYP_002271286 
Protein GI209399030 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0458092 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.000111113 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGGTCG GCTTCTTCTT ATTGAAATTT CCGCTGTCGT CAGAAACCTT CGTTCTTAAC 
CAAATCACTG CGTTTATTGA TATGGGCTTT GAAGTGGAGA TTGTCGCGCT GCAAAAAGGC
GACACACAAA ACACCCACGT GGCATGGACT AAATACAACC TTGCCGCCAG AACCCGCTGG
TTACAGGACG AACCTACGGG CAAAGTGGCG AAACTGAGCC ACCGAGCCAG TCAGACCTTG
CGCGGCATTC ATCGTAAAAA TACCTGGCAG GCGCTCAACC TCAAACGCTA TGGTGCCGAG
TCGTGGAACC TGATTTTGTC TGCCATTTGC GGCCAGGTCG CAACACCGTT TCGCGCCGAT
GTGTTCATCG CTCATTTTGG CCCTGCGGGG GTAACCGCAG CAAAACTACG CGAACTGGGT
GTCATTCACG GCAAAATTGC CACTATTTTC CACGGCATTG ATATTTCCAG TCGGGAAGTG
CTCAACCATT ACACTCCCGA ATATCAACAA CTGTTTCGAC GTGGCGACCT GATGCTACCG
ATAAGCGATC TGTGGGCCGG AAGGCTGCAA AAAATGGGCT GCCCGAGGGA AAAAATCGCC
GTATCGCGCA TGGGCGTAGA TATGACGCGC TTTAGCCCGC GTCCCGTGAA AGCGCCCGCA
ACACCGCTGG AGATTATTTC CGTCGCACGC TTAACCGAGA AAAAAGGTCT GCATGTGGCG
ATTGAAGCCT GCCGGCAGTT GAAAGAGCAG GGCGTGGCAT TTCGCTATCG CATCCTCGGC
ATTGGCCCGT GGGAACGACG CCTGCGCACG CTCATCGAAC AATATCAACT GGAAGATGTG
GTGGAGATGC CGGGCTTTAA ACCGAGCCAT GAAGTGAAAG CGATGCTCGA CGACGCGGAT
GTCTTCCTGT TGCCATCGGT TACAGGTGCG GATGGCGATA TGGAAGGCAT TCCGGTGGCG
CTGATGGAGG CGATGGCGGT CGGCATTCCG GTGGTTTCTA CTCTGCATAG CGGAATACCG
GAACTGGTGG AGGCTGACAA ATCCGGCTGG CTGGTGCCTG AGAACGATGC TCGCGCACTG
GCGCAACGAC TGGCGGCGTT TAGCCAACTG GACACCGACG AATTGGCTCC GGTCGTCAAA
CGCGCGCGCG AAAAAGTTGA ACACGATTTT AACCAGCAGG TGATCAATCG AGAACTCGCC
AGCTTGCTGC AGGCTTTATA G
 
Protein sequence
MKVGFFLLKF PLSSETFVLN QITAFIDMGF EVEIVALQKG DTQNTHVAWT KYNLAARTRW 
LQDEPTGKVA KLSHRASQTL RGIHRKNTWQ ALNLKRYGAE SWNLILSAIC GQVATPFRAD
VFIAHFGPAG VTAAKLRELG VIHGKIATIF HGIDISSREV LNHYTPEYQQ LFRRGDLMLP
ISDLWAGRLQ KMGCPREKIA VSRMGVDMTR FSPRPVKAPA TPLEIISVAR LTEKKGLHVA
IEACRQLKEQ GVAFRYRILG IGPWERRLRT LIEQYQLEDV VEMPGFKPSH EVKAMLDDAD
VFLLPSVTGA DGDMEGIPVA LMEAMAVGIP VVSTLHSGIP ELVEADKSGW LVPENDARAL
AQRLAAFSQL DTDELAPVVK RAREKVEHDF NQQVINRELA SLLQAL