Gene ECH74115_2985 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2985 
SymbolwcaI 
ID6967214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2764730 
End bp2765953 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content55% 
IMG OID643386825 
Productputative glycosyl transferase 
Protein accessionYP_002271293 
Protein GI209396871 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.00135892 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAATAC TGGTCTACGG CATTAACTAC TCGCCGGAGT TAACCGGCAT CGGCAAATAC 
ACCGGCGAGA TGGTGGAATG GCTGGCGGCG CAAGGTCATG AGGTGCGGGT TATTACCGCA
CCGCCTTACT ACCCGCAGTG GCAAGTGGGC GAGAACTATT CCGCCTGGCG GTACAAACGA
GAAGAGGGGG CCGCTACGGT GTGGCGCTGC CCGCTGTACG TGCCAAAACA GCCGAGCACC
CTGAAACGCC TGTTGCATCT CGGCAGTTTT GCCGTCAGCA GTTTTTTCCC GTTGATGGCG
CAACGTCGCT GGAAGCCGGA TCGCATTATC GGCGTAGTGC CAACGCTGTT TTGCACGCCG
GGAATGCGCC TGCTGGCGAA ACTCTCTGGC GCGCGTACCG TGCTGCATAT TCAGGATTAC
GAAGTAGATG CCATGCTGGG GCTGGGCCTT GCCGGAAAAG GCAAAGGCGG CAAAGTGGCA
CAGCTGGCAA CGGCGTTCGA ACGTAGCGGA CTGCATAACG TCGATAACGT TTCCACGATT
TCGCGTTCGA TGATGAATAA AGCCATCGAA AAAGGCGTGG CGGCGGAAAA CGTCATCTTC
TTCCCCAACT GGTCGGAAAT CGCCCGTTTT CAGCATGTTG CAGACGCCGA TGTTGATGCC
CTTCGTAACC AGCTTGGCCT GCCGGATAAC AAAAAAATCA TTCTTTACTC CGGCAATATT
GGTGAAAAGC AGGGGCTGGA AAACGTTATT GAAGCAGCCG ATCGCCTGCG CGATGAACCG
CTGATTTTTG CCATTGTCGG GCAGGGCGGC GGCAAAGCGC GGCTGGAAAA AATGGCGCAG
CAGCGTGGAC TGCGCAACAT GCAATTTTTC CCGCTGCAAT CGTATGACGC TTTACCCGCA
CTGCTGAAGA TGGGCGATTG CCATCTGGTG GTGCAAAAAC GCGGCGCGGC AGATGCCGTA
TTGCCGTCGA AACTGACCAA TATTCTGGCA GTAGGCGGTA ACGCGGTGAT TACTGCTGAA
GCCCACACAG AACTGGGACA GCTTTGCGAA ACCTTTCCGG GCATTGCGGT TTGCGTAGAA
CCGGAATCGG TTGAGGCGCT GGTGGCGGGG ATTCGTCAGG CGCTCCTGCT GCCCAAACAC
AACACGGTGG CACGTGAATA TGCCGAACGC ACGCTTGATA AAGAGAACGT GTTACGTCAA
TTTATAAATG ATATTCGGGG ATAA
 
Protein sequence
MKILVYGINY SPELTGIGKY TGEMVEWLAA QGHEVRVITA PPYYPQWQVG ENYSAWRYKR 
EEGAATVWRC PLYVPKQPST LKRLLHLGSF AVSSFFPLMA QRRWKPDRII GVVPTLFCTP
GMRLLAKLSG ARTVLHIQDY EVDAMLGLGL AGKGKGGKVA QLATAFERSG LHNVDNVSTI
SRSMMNKAIE KGVAAENVIF FPNWSEIARF QHVADADVDA LRNQLGLPDN KKIILYSGNI
GEKQGLENVI EAADRLRDEP LIFAIVGQGG GKARLEKMAQ QRGLRNMQFF PLQSYDALPA
LLKMGDCHLV VQKRGAADAV LPSKLTNILA VGGNAVITAE AHTELGQLCE TFPGIAVCVE
PESVEALVAG IRQALLLPKH NTVAREYAER TLDKENVLRQ FINDIRG