Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_2335 |
Symbol | wcaM |
ID | 5587086 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 2297425 |
End bp | 2298819 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640926000 |
Product | putative colanic acid biosynthesis protein |
Protein accession | YP_001463395 |
Protein GI | 157159381 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00000518984 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCATTTA AAAAACTCTC CCGACGCACC TTCCTGACGG CAAGCTCGGC GCTTGCCTTC CTCCATACCC CTTTCGCTCG CGCACTTCCC GCCCGACAAA GCGTTAACAT TAACGACTAC AACCCACACG ACTGGATCGC CTCATTTAAA CAAGCCTTCA GCGAAGGGCA AACAGTCGTC GTGCCTGCCG GATTGGTTTG TGACAATATC AACACCGGCA TCTTCATTCC TCCAGGCAAA ACGTTACACA TCCTGGGAAG CCTGCGCGGT AACGGCAGAG GGCGTTTTGT CTTACAGGAC GGCAGCCAGG TGACAGGGAA GGAGGGCGGC GGTATGCATA ACATCACCCT GGATGTGCGT GGCTCTGACT GCACCATCAA AGGGCTGACG ATGAGCGGCT TTGGTCCGGT AACGCAGATT TATATCGGCG GCAAAAACAA ACGGGTCATG CGCAACCTGA CCATCGATAA CCTCACTGTC AGCCACGCTA ATTACGCCAT CTTACGCCAG GGATTTCATA ACCAGATAAT CGGTGCCAAC ATCACCAACT GTAAATTTAG TGATTTACAG GGCGACGCCA TTGAATGGAA CGTGGCGATT AACGACCGTG ATATCTTGAT CTCCGACCAT GTCATCGAGC GCATCAACTG TACTAATGGC AAAATCAACT GGGGCATCGG CATAGGTCTT GCGGGAAGTA CCTACGATAA CAACTACCCG GAAGACCAGG CAGTGAAAAA CTTTGTCGTG GCGAATATCA CGGGATCGGA TTGTCGGCAG TTGATCCATG TTGAAAATGG TAAACATTTT GTTATTCGTA ATATCAAAGC CCGCAATATC ACGCCGGATT TTAGTAAGAA AGCAGGTATT GATAACGCCA CAGTCGCTAT TTACGGTTGT GACAATTTCG TGATTGATAA TATTGAAATG ACTAATAGCG CCGGGATGTT AATCGGTTAT GGGGTAATTA AAGGCAAATA TCTCTCGATA CCGCAAAATT TCCGAGTGAA TAATATTCAA CTGGATAACA CCCATCTTGC TTATAAATTG CGCGGCATCC AAATCTCCGC CGGGAATGCT GTCTCCTTTG TGGCACTGAC TAACATTGAG ATGAAGCGTG CATCGCTGGA GTTACACAAC AAACCGCAAC ATCTTTTTAT GCGTAATATC AAGGTGATGC AGGAATCCTC AGTTGGACCA GCATTGAGCA TGAACTTCGA CATGCGCAAA GACGTTCGCG GCGTCTTTAT GGCGAAAAAA GAAACACTGC TGTCTCTTGC AAATGTTCAT GCGGTGAATG AAAAAGGGCA AAGCTCCGTC GATATCGACA GAGTTAATCA CCATATTGTT AATGTGGAAA AGATTAACTT TAGATTGCCG GAACGGAGAG AGTAG
|
Protein sequence | MPFKKLSRRT FLTASSALAF LHTPFARALP ARQSVNINDY NPHDWIASFK QAFSEGQTVV VPAGLVCDNI NTGIFIPPGK TLHILGSLRG NGRGRFVLQD GSQVTGKEGG GMHNITLDVR GSDCTIKGLT MSGFGPVTQI YIGGKNKRVM RNLTIDNLTV SHANYAILRQ GFHNQIIGAN ITNCKFSDLQ GDAIEWNVAI NDRDILISDH VIERINCTNG KINWGIGIGL AGSTYDNNYP EDQAVKNFVV ANITGSDCRQ LIHVENGKHF VIRNIKARNI TPDFSKKAGI DNATVAIYGC DNFVIDNIEM TNSAGMLIGY GVIKGKYLSI PQNFRVNNIQ LDNTHLAYKL RGIQISAGNA VSFVALTNIE MKRASLELHN KPQHLFMRNI KVMQESSVGP ALSMNFDMRK DVRGVFMAKK ETLLSLANVH AVNEKGQSSV DIDRVNHHIV NVEKINFRLP ERRE
|
| |