Gene EcSMS35_1012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1012 
SymbolwcaI 
ID6146917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1029821 
End bp1031044 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content56% 
IMG OID641615899 
Productputative glycosyl transferase 
Protein accessionYP_001743091 
Protein GI170683742 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAC TGGTCTACGG CATTAACTAC TCGCCGGAGT TAACCGGCAT CGGCAAATAC 
ACCGGCGAGA TGGTGGAATG GCTGGCGGCA GAAGGTCATG AGGTGCGGGT TATTACCGCA
CCGCCTTACT ACCCGCAATG GCAGGTGGGC GAGAACTATT CCGCCTGGCG GTACAAACGG
GAAGAGGGGG CCGCCACGGT GTGGCGCTGC CCGCTGTACG TGCCAAAACA GCCGAGCACC
CTGAAACGCC TGTTGCATCT CGGCAGTTTT GCCGCCAGCA GTTTCTTTCC GCTGATGGCG
CAACGTCGCT GGAAGCCGGA TCGCATTATC GGCGTGGTAC CAACGCTGTT TTGCACGCCG
GGAATGCGCC TGCTGGTGAA ACTGTCTGGT GCGCGCACGG TGCTACATAT TCAGGATTAC
GAAGTGGACG CCATGCTGGG GCTGGGCCTT GCCGGAAAAA GTAAAGGCGG CAAAGTGGCA
CAGCTGGCGA CGGCGTTCGA ACGTAGCGGA CTGCATAACG TCGATAACGT CTCCACGATT
TCGCGCTCGA TGATGAATAA AGCTATCGAA AAAGGCGTGG CGGCGGAAAA CGTCATCTTC
TTCCCCAACT GGTCGGAAAT CGCCCGTTTT CAACATGTTG CAGACGCCGA TGTTGACGCC
CTTCGTAACC AGCTTGGCCT GCCGGATAAC AAAAAAATCA TTCTTTACTC CGGCAATATT
GGTGAAAAGC AGGGGCTGGA AAACGTTATT GAAGCTGCCG ATCGCCTGCG CGATGAACCG
CTTATTTTTG CCATTGTCGG GCAGGGCGGC GGCAAAGCGC GGCTTGAAAA AATGGCGCAG
CAGCGTGGAC TGCGCAACAT GCAATTTTTC CCGCTGCAAT CGTATGACGC TTTACCCGCA
CTGCTGAAGA TGGGGGATTG CCATCTGGTG GTGCAAAAAC GCGGCGCGGC AGATGCCGTA
TTGCCGTCGA AACTGACCAA TATTCTGGCG GTAGGCGGTA ACGCGGTGAT TACTGCTGAA
GCCCACACAG AACTGGGACA GCTTTGCGAA ACCTTTCCGG GCATTGCGGT TTGCGTAGAA
CCGGAATCGG TCGAGGCGCT GGTGGCGGGG ATTCGTCAGG CGCTCCTGCT GCCCAAACAC
AACACGGTGG CACGTGAATA TGCCGAACGC ACGCTCGATA AAGAGAACGT GTTACGTCAA
TTTATAAATG ATATTCGGGG ATAA
 
Protein sequence
MKILVYGINY SPELTGIGKY TGEMVEWLAA EGHEVRVITA PPYYPQWQVG ENYSAWRYKR 
EEGAATVWRC PLYVPKQPST LKRLLHLGSF AASSFFPLMA QRRWKPDRII GVVPTLFCTP
GMRLLVKLSG ARTVLHIQDY EVDAMLGLGL AGKSKGGKVA QLATAFERSG LHNVDNVSTI
SRSMMNKAIE KGVAAENVIF FPNWSEIARF QHVADADVDA LRNQLGLPDN KKIILYSGNI
GEKQGLENVI EAADRLRDEP LIFAIVGQGG GKARLEKMAQ QRGLRNMQFF PLQSYDALPA
LLKMGDCHLV VQKRGAADAV LPSKLTNILA VGGNAVITAE AHTELGQLCE TFPGIAVCVE
PESVEALVAG IRQALLLPKH NTVAREYAER TLDKENVLRQ FINDIRG