Gene ECH74115_2979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2979 
SymbolwcaK 
ID6970131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2757339 
End bp2758619 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content54% 
IMG OID643386819 
Productputative pyruvyl transferase 
Protein accessionYP_002271287 
Protein GI209398906 
COG category[S] Function unknown 
COG ID[COG2327] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.158831 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.000214392 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAATTAC TTATTCTGGG CAACCACACT TGCGGCAATC GTGGCGACAG CGCCATCTTG 
CGCGGCTTAC TTGATGCCAT CAACATCCTC AATCCACATA CCGAAGTGGA CGTGATGAGC
CGCTATCCGG TCAGTTCTTC CTGGCTGCTC AACCGCCCGG TAATGGGTGA TCCGCTGTTC
CTTCAAATGA AACAACACAA CAGCGCGGCG GGCGTTGTCG GGCGCGTTAA AAAAGTCCTC
CGTCGCCGCT ATCAGCACCA GGTATTGCTC TCACGCGTCA CCGACACTGG CAAGCTGCGC
AATATCGCCA TCGCCCAGGG ATTCACCGAC TTCGTGCGCC TACTGTCAGG TTACGACGCC
ATTATTCAGG TCGGCGGATC GTTTTTTGTC GATCTCTACG GCGTGCCGCA GTTTGAACAT
GCACTTTGCA CATTTATGGC GAAAAAGCCG CTGTTTATGA TTGGTCACAG CGTCGGTCCC
TTCCAGGATG AGCAATTTAA CCAACTGGCG AACTACGTTT TTGGTCACTG CGACGCGCTG
ATCTTGCGCG AATCGGTCAG CCTCGATCTA ATGAAACGCA GCAATATCAC CACTGCAAAA
GTTGAACATG GCGTCGATAC CGCGTGGCTG GTCGATCACC ACACAGAAGA CTTCACCGCC
AGCTATGCCG TCCAACACTG GCTGGACGTT GCCGCACAAC AGAAAACGGT GGCAATTACC
CTGCGCGAAC TGGCACCGTT CGACAAACGC CTCGGCACCA CTCAGCAAGT GTATGAAAAA
GCCTTTGCCG GGGTGGTCAA TCGCATTCTC GACGAAGGCT ATCAGGTCAT TGCGCTCTCC
ACCTGTACGG GTATCGACAG CTATAACAAA GACGACCGCA TGGTGGCGCT CAACCTGCGC
CAGCACATCA GCGATCCTGC CCGTTATCAC GTAGTGATGG ATGAACTTAA CGATCTGGAA
ATGGGCAAAA TTCTCGGTGC CTGTGAACTC ACCGTCGGTA CGCGCCTGCA CTCCGCCATT
ATCTCAATGA ACTTTGCTAC TCCGGCGATT GCCATCAACT ATGAACATAA ATCCGCCGGG
GTTATGCAGC AGCTGGGACT ACCGGAGATG GCAATTGATA TCCGTCATTT ATTAGACGGC
AGCCTGCAAG CGATGGTTGC GGATACCTTA GGCCAGCTTC CGGCGCTGAA CGCACGACTT
AGCGAAGCCG TCAGTCGTGA GCGTCAGACG GGAATGCAGA TGGTGCAGTC CGTACTGGAA
CGCATCGGGG AGGTGAAATG A
 
Protein sequence
MKLLILGNHT CGNRGDSAIL RGLLDAINIL NPHTEVDVMS RYPVSSSWLL NRPVMGDPLF 
LQMKQHNSAA GVVGRVKKVL RRRYQHQVLL SRVTDTGKLR NIAIAQGFTD FVRLLSGYDA
IIQVGGSFFV DLYGVPQFEH ALCTFMAKKP LFMIGHSVGP FQDEQFNQLA NYVFGHCDAL
ILRESVSLDL MKRSNITTAK VEHGVDTAWL VDHHTEDFTA SYAVQHWLDV AAQQKTVAIT
LRELAPFDKR LGTTQQVYEK AFAGVVNRIL DEGYQVIALS TCTGIDSYNK DDRMVALNLR
QHISDPARYH VVMDELNDLE MGKILGACEL TVGTRLHSAI ISMNFATPAI AINYEHKSAG
VMQQLGLPEM AIDIRHLLDG SLQAMVADTL GQLPALNARL SEAVSRERQT GMQMVQSVLE
RIGEVK