Gene EcE24377A_2336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_2336 
SymbolwcaL 
ID5588258 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp2298830 
End bp2300050 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content54% 
IMG OID640926001 
Productcolanic acid biosynthesis glycosyl transferase WcaL 
Protein accessionYP_001463396 
Protein GI157158914 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000162166 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGTCG GCTTCTTTTT ACTGAAATTT CCGCTGTCGT CGGAAACCTT CGTTCTTAAC 
CAAATTACCG CGTTTATTGA TATGGGATTC GAGGTGGAGA TTGTTGCGCT GCAAAAAGGC
GACACCCAGA ACACCCACGC GGCATGGACG AAATACAACC TTGCCGCAAG AACCCGCTGG
TTACAGGACG AACCACAAGG CAAAGTGGCG AAACTGCGCC ACCGCGCCAG CCAGACCTTA
CGCGGCATTC ATCGTAAAAA TACCTGGCAG GCGCTCAATC TCAAACGCTA TGGTGCCGAG
TCGCGGAACC TGATTTTGTC TGCCATTTGC GGTCAGGTCG CAACACCGTT TCATGCCGAT
GTCTTTATCG CTCATTTTGG TCCTGCGGGG GTAGCCGCAG CAAAACTACG CGAACTGGGT
GTCATTCGCG GCAAAATTGC CACTATCTTC CACGGTATTG ATATCTCCAG TCGGGAAGTG
CTCAACCACT ACACTCCCGA ATATCAGCAA CTGTTTTGCC GTGGCGACCT GATGTTACCG
ATAAGCGATC TGTGGGCCGG AAGGCTGCAA AAAATGGGCT GCCCGAGGGA AAAAATCGCC
GTATCGCGCA TGGGCGTAGA TATGACGCGC TTTAGCCCGC GTCCCGTGAA AGCGCCCGCA
ACGCCGCTGG AAATCATCTC CGTCGCACGT TTAACCGAGA AAAAAGGCCT GCATGTGGCG
ATTGAAGCCT GCCGTCAGTT GAAAGAGCAG GGCGTGGCAT TTCGCTATCG CATCCTCGGC
ATTGGCCCGT GGGAACGACG CCTGCGCACG CTCATCGAAC AATATCAACT GGAAGATGTG
GTGGAGATGC CTGGCTTTAA ACCGAGCCAT GAAGTGAAAG CGATGCTCGA CGACGCGGAT
GTCTTCCTGT TGCCATCGGT TACAGGTGCG GATGGTGATA TGGAAGGCAT TCCGGTGGCG
CTAATGGAAG CGATGGCGGT CGGCATTCCG GTGGTTTCTA CTCTGCATAG CGGAATACCG
GAACTGGTGG AGGCTGACAA ATCCGGCTGG CTGGTGCCTG AGAACGATGC TCGCGCACTG
GCGCAACGAC TGGCGGCGTT TAGCCAACTG GACACCGACG AACTGGCTCC GGTTGTCAAA
CGTGCGCGCG AAAAAGTCGA ACACGATTTT AACCAGCAGG TGATTAATCG AGAACTCGCC
AGCTTGTTAC AGGCTTTATA G
 
Protein sequence
MKVGFFLLKF PLSSETFVLN QITAFIDMGF EVEIVALQKG DTQNTHAAWT KYNLAARTRW 
LQDEPQGKVA KLRHRASQTL RGIHRKNTWQ ALNLKRYGAE SRNLILSAIC GQVATPFHAD
VFIAHFGPAG VAAAKLRELG VIRGKIATIF HGIDISSREV LNHYTPEYQQ LFCRGDLMLP
ISDLWAGRLQ KMGCPREKIA VSRMGVDMTR FSPRPVKAPA TPLEIISVAR LTEKKGLHVA
IEACRQLKEQ GVAFRYRILG IGPWERRLRT LIEQYQLEDV VEMPGFKPSH EVKAMLDDAD
VFLLPSVTGA DGDMEGIPVA LMEAMAVGIP VVSTLHSGIP ELVEADKSGW LVPENDARAL
AQRLAAFSQL DTDELAPVVK RAREKVEHDF NQQVINRELA SLLQAL