Gene EcolC_1598 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1598 
SymbolwcaM 
ID6065621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1777533 
End bp1778927 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content48% 
IMG OID641601014 
Productputative colanic acid biosynthesis protein 
Protein accessionYP_001724584 
Protein GI170019630 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATTTA AAAAACTCTC CCGACGCACC TTCCTGACGG CAAGCTCCGC GCTTGCCTTC 
CTTCATACCC CTTTCGCTCG TGCACTTCCC GCCCGACAAA GCGTTAACAT TAACGACTAC
AACCCACACG ACTGGATCGC CTCATTTAAA CAAGCCTTCA GCGAAGGGCA AACGGTCGTC
GTGCCTGCCG GATTGGTTTG TGACAATATC AACACCGGCA TCTTTATCCC TCCCGGTAAA
ACGTTACACA TCCTTGGAAG CCTGCGCGGC AACGGCAGAG GGCGATTTGT CTTACAGGAC
GGCAGCCAGG TGACAGGGGA GGAGGGCGGC AGTATGCATA ACATCACCCT GGATGTGCGC
GGTTCTGACT GCTCCATCAA AGGGCTGGTG ATGAGCGGCT TTGGCCCGGT AACGCAGATT
TATATCGGCG GCAAAAACAA ACGGGTCATG CGCAACCTGA CCATCGATAA CCTCACTGTC
AGCCACGCTA ATTACGCCAT CTTACGCCAG GGATTTCATA ATCAGATTAT CGGTGCCAAC
ATCACCAACT GTAAGTTCAG CGACTTACAG GGCGACGCCA TCGAATGGAA CGTGGCGATT
AACGACAGTG ATATTTTGAT ATCTGACCAT GTCATCGAGC GCATCAACTG TACCAACGGC
AAAATCAACT GGGGAATCGG CATAGGCCTT GCAGGAAGCA CTTACGATAA CAACTACCCG
GAAGACCAGG CAGTGAAAAA CTTTGTCGTG GCGAATATCA CGGGATCGGA TTGTCGGCAG
TTGATCCATG TTGAAAATGG CAAACATTTT GTTATTAGTA ATATCAAAGC CCGCAATATC
ACGCCGGATT TCAGTAAGAA AGCGGGCATT GATAACGCCA CGGTCGCTAT TTACGGTTGT
GACAATTTCG TGATTGATAA TATTGAAATG ATTAATAGCG CCGGGATGTT AATCGGCTAT
GGGGTAATTA AAGGCAAATA TCTCTCGATA CCGCAAAATT TCCGAGTGAA TAATATTCAA
CTGGATAACA CCCATCTTGC TTATAAATTG CGCGGCATCC AAATCTCCGC CGGGAATGCT
GTCTCCTTTG TGGCGCTGAC TAACATTGAG ATGAAGCGTG CGTCGCTGGA GTTACACAAC
AAACCGCAAC ATCTTTTTAT GCGTAATATC AAGGTGATGC AGGAATCCTC AGTTGGACCA
GCATTGAGCA TGAACTTCGA CATGCGCAAA GACGTTCGCG GCGTCTTTAT GGCGAAAAAA
GAAACACTCC TGTCTCTTGC AAATGTTCAT GCGGTGAATG AAAGAGGGCA AAGCTCCGTC
GATATCGACA GGATAAATCA CCATATTGTT AATGTGGAAA AGATTAACTT TAGATTGCCG
GAACGGAGGG AGTAG
 
Protein sequence
MPFKKLSRRT FLTASSALAF LHTPFARALP ARQSVNINDY NPHDWIASFK QAFSEGQTVV 
VPAGLVCDNI NTGIFIPPGK TLHILGSLRG NGRGRFVLQD GSQVTGEEGG SMHNITLDVR
GSDCSIKGLV MSGFGPVTQI YIGGKNKRVM RNLTIDNLTV SHANYAILRQ GFHNQIIGAN
ITNCKFSDLQ GDAIEWNVAI NDSDILISDH VIERINCTNG KINWGIGIGL AGSTYDNNYP
EDQAVKNFVV ANITGSDCRQ LIHVENGKHF VISNIKARNI TPDFSKKAGI DNATVAIYGC
DNFVIDNIEM INSAGMLIGY GVIKGKYLSI PQNFRVNNIQ LDNTHLAYKL RGIQISAGNA
VSFVALTNIE MKRASLELHN KPQHLFMRNI KVMQESSVGP ALSMNFDMRK DVRGVFMAKK
ETLLSLANVH AVNERGQSSV DIDRINHHIV NVEKINFRLP ERRE