Gene EcE24377A_2335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_2335 
SymbolwcaM 
ID5587086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp2297425 
End bp2298819 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content48% 
IMG OID640926000 
Productputative colanic acid biosynthesis protein 
Protein accessionYP_001463395 
Protein GI157159381 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000518984 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATTTA AAAAACTCTC CCGACGCACC TTCCTGACGG CAAGCTCGGC GCTTGCCTTC 
CTCCATACCC CTTTCGCTCG CGCACTTCCC GCCCGACAAA GCGTTAACAT TAACGACTAC
AACCCACACG ACTGGATCGC CTCATTTAAA CAAGCCTTCA GCGAAGGGCA AACAGTCGTC
GTGCCTGCCG GATTGGTTTG TGACAATATC AACACCGGCA TCTTCATTCC TCCAGGCAAA
ACGTTACACA TCCTGGGAAG CCTGCGCGGT AACGGCAGAG GGCGTTTTGT CTTACAGGAC
GGCAGCCAGG TGACAGGGAA GGAGGGCGGC GGTATGCATA ACATCACCCT GGATGTGCGT
GGCTCTGACT GCACCATCAA AGGGCTGACG ATGAGCGGCT TTGGTCCGGT AACGCAGATT
TATATCGGCG GCAAAAACAA ACGGGTCATG CGCAACCTGA CCATCGATAA CCTCACTGTC
AGCCACGCTA ATTACGCCAT CTTACGCCAG GGATTTCATA ACCAGATAAT CGGTGCCAAC
ATCACCAACT GTAAATTTAG TGATTTACAG GGCGACGCCA TTGAATGGAA CGTGGCGATT
AACGACCGTG ATATCTTGAT CTCCGACCAT GTCATCGAGC GCATCAACTG TACTAATGGC
AAAATCAACT GGGGCATCGG CATAGGTCTT GCGGGAAGTA CCTACGATAA CAACTACCCG
GAAGACCAGG CAGTGAAAAA CTTTGTCGTG GCGAATATCA CGGGATCGGA TTGTCGGCAG
TTGATCCATG TTGAAAATGG TAAACATTTT GTTATTCGTA ATATCAAAGC CCGCAATATC
ACGCCGGATT TTAGTAAGAA AGCAGGTATT GATAACGCCA CAGTCGCTAT TTACGGTTGT
GACAATTTCG TGATTGATAA TATTGAAATG ACTAATAGCG CCGGGATGTT AATCGGTTAT
GGGGTAATTA AAGGCAAATA TCTCTCGATA CCGCAAAATT TCCGAGTGAA TAATATTCAA
CTGGATAACA CCCATCTTGC TTATAAATTG CGCGGCATCC AAATCTCCGC CGGGAATGCT
GTCTCCTTTG TGGCACTGAC TAACATTGAG ATGAAGCGTG CATCGCTGGA GTTACACAAC
AAACCGCAAC ATCTTTTTAT GCGTAATATC AAGGTGATGC AGGAATCCTC AGTTGGACCA
GCATTGAGCA TGAACTTCGA CATGCGCAAA GACGTTCGCG GCGTCTTTAT GGCGAAAAAA
GAAACACTGC TGTCTCTTGC AAATGTTCAT GCGGTGAATG AAAAAGGGCA AAGCTCCGTC
GATATCGACA GAGTTAATCA CCATATTGTT AATGTGGAAA AGATTAACTT TAGATTGCCG
GAACGGAGAG AGTAG
 
Protein sequence
MPFKKLSRRT FLTASSALAF LHTPFARALP ARQSVNINDY NPHDWIASFK QAFSEGQTVV 
VPAGLVCDNI NTGIFIPPGK TLHILGSLRG NGRGRFVLQD GSQVTGKEGG GMHNITLDVR
GSDCTIKGLT MSGFGPVTQI YIGGKNKRVM RNLTIDNLTV SHANYAILRQ GFHNQIIGAN
ITNCKFSDLQ GDAIEWNVAI NDRDILISDH VIERINCTNG KINWGIGIGL AGSTYDNNYP
EDQAVKNFVV ANITGSDCRQ LIHVENGKHF VIRNIKARNI TPDFSKKAGI DNATVAIYGC
DNFVIDNIEM TNSAGMLIGY GVIKGKYLSI PQNFRVNNIQ LDNTHLAYKL RGIQISAGNA
VSFVALTNIE MKRASLELHN KPQHLFMRNI KVMQESSVGP ALSMNFDMRK DVRGVFMAKK
ETLLSLANVH AVNEKGQSSV DIDRVNHHIV NVEKINFRLP ERRE