Gene ECH74115_2977 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2977 
SymbolwcaM 
ID6969348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2754717 
End bp2756111 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content47% 
IMG OID643386817 
Productputative colanic acid biosynthesis protein 
Protein accessionYP_002271285 
Protein GI209397395 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000190544 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0000308601 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCATTTA AAAAACTCTC CCGACGCACC TTCCTGACGG CAAGCTCGGC GCTTGCCTTC 
CTTCATACCC CTTTCGCTCG CGCATTTCCC GCCCGACAAA GCGTTAATAT CAACGACTAC
AACCCTCACG ACTGGATCGC CTCATTTAAA CAAGCCTTCA GCGAAGGGCA AACGGTTGTC
GTGCCTGCTG GATTCGTTTG TGACAATATC AACACCGGCA TCTTCATTCC TCCTGGCAAA
ACGTTACACA TCCTTGGAAG CCTGCGCGGC AACGGCAGAG GGCGATTTGT CTTACAGGAC
GGCAGCCAGG TGACAGGGGA GGAGGGCGGC AGTATGCATA ACATCACCCT GGATGTGCGT
GGTTCTGACT GCACCATCAA AGGGCTGGCG ATGAGCGGCT TTGGTCCGGT AACGCAGATT
TATATCGGCG GCAAAAACAA ACGGGTCATG CGCAACCTGA CCATCGATAA CCTGACCATT
AGCCACGCTA ATTACGCCAT CTTACGCCAG GGATTTCATA ACCAGATTAT CGGTGCCAAC
ATCACCAATT GTAAGTTCAG CGACTTACAG GGCGATGCCA TTGAATGGAA CGTGGCGATT
AACGACCGCG ATATTTTGAT CTCCGACCAT GTCATCGAGC GCATCAACTG TACTAACGGC
AAAATCAACT GGGGCATTGG CATAGGCCTT GCGGGAAGCA CTTACGATAA TAATTACCCG
GAAGATCAGG CAGTGAAAAA CTTTGTCGTG GCGAATATCA CGGGATCGGA TTGTCGGCAG
TTGATACATG TTGAAAATGG TAAACATTTT GTTATTCGTA ATATCAAAGC CCGCAATATC
ACGCAGGATT TCAGTAAGAA AGCAGGTATT GATAACGCCA CAGTCGCTAT TTACGGTTGT
GACAATTTCG TGATTGATAA TATTGAAATG ATTAATAGCG CCGGGATGTT AATCGGCTAT
GGGGTAATTA AAGGCAATTA TCTCTCGATT CCGCAAAATT TCCGAGTGAA TAATATTCAA
CTGGATAACA CCCATCTTGC TTATAAATTG CGCGGCCTCC AAATCTCTGC CGGGAATGCC
GTCTCCTTTG TGGCACTAAC TAACATTGAG ATGAAGCGTG CGTCGCTGGA GTTACACAAC
AAACCGCAAC ATCTTTTTCT GCGTAATATC AAAGTGATGC AGGAATCCTC TGTTGGACCC
GCATTGATTA TGAACTTCGA CATGCGCAAA GACGTTCGAG GCGTCTTTAT GGCGAAAGAA
GAAACACTGC TGTCTCTTGC AAATGTTCAT GCGGTGAATG AGAAAGGACA AAGCTCCGTC
GATATCGACA GGATTAATCA CCATATTGTT AATGTGGAAA AGATTAACTT TAGATTGCCG
GAACGGAGAG AGTAG
 
Protein sequence
MPFKKLSRRT FLTASSALAF LHTPFARAFP ARQSVNINDY NPHDWIASFK QAFSEGQTVV 
VPAGFVCDNI NTGIFIPPGK TLHILGSLRG NGRGRFVLQD GSQVTGEEGG SMHNITLDVR
GSDCTIKGLA MSGFGPVTQI YIGGKNKRVM RNLTIDNLTI SHANYAILRQ GFHNQIIGAN
ITNCKFSDLQ GDAIEWNVAI NDRDILISDH VIERINCTNG KINWGIGIGL AGSTYDNNYP
EDQAVKNFVV ANITGSDCRQ LIHVENGKHF VIRNIKARNI TQDFSKKAGI DNATVAIYGC
DNFVIDNIEM INSAGMLIGY GVIKGNYLSI PQNFRVNNIQ LDNTHLAYKL RGLQISAGNA
VSFVALTNIE MKRASLELHN KPQHLFLRNI KVMQESSVGP ALIMNFDMRK DVRGVFMAKE
ETLLSLANVH AVNEKGQSSV DIDRINHHIV NVEKINFRLP ERRE