Gene EcSMS35_1005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1005 
SymbolwcaC 
ID6146381 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1023496 
End bp1024713 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content54% 
IMG OID641615892 
Productputative glycosyl transferase 
Protein accessionYP_001743084 
Protein GI170682720 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.636384 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATTC TGCAATTCAA TGTGCGACTG GCGGAAGGCG GGGCAGCAGG TGTGGCGTTA 
GATCTCCACC AGCGCGCGCT GCAACAGGGG CTGGCGTCAC ATTTTGTCTA CGGCTACGGC
AAAGGTGGCA AAGAGAGCGT CAGCCATCAG AACTATCCGC AGGTCATCAA ACATACGCCG
CGGATGACCG CGATGGCGAA CATTGCGTTG TTTCGTCTGC TTAATCGCGA TCTGTTCGGC
AATTTCAATG AGTTATATCG CACCATTACT CGCACACCGG GTCCGGTGGT CCTGCATTTT
CATGTGCTGC ACAGCTACTG GTTGAATCTT AAGAGCGTGG TGCGCTTTTG CGAAAAAGTG
AAAAACCATA AAACGGATGT CACTCTGGTC TGGACGCTAC ACGACCACTG GAGCGTTACC
GGACGCTGCG CCTTTACCGA CGGTTGCGAA GGCTGGAAAA CGGGCTGCCA GAAATGCCCG
ACCTTAAATA ATTATCCGCC GGTGAAGATT GATCGCGCAC ACCAACTGGT GGCGGGCAAA
CGCCAGTTAT TCCGTGAGAT GCTGGCGCTG GGCTGTCAGT TTATTTCCCC CAGCCAGCAT
GTGGCTGATG CTTTCAATAG TCTGTACGGT CCAGGGCGTT GCCGGATTAT CAATAATGGC
ATTGATATGG CAACCGAAGC GATTCTGGCG GACTTGCCTC CGGTACGCGA AACCCAGGAC
AAGCCGAAAA TCGCGGTGGT GGCGCATGAT CTGCGTTACG ACGGCAAAAC TAACCAGCAA
CTGGTACGTG AGATGATGGC GCTGGGCGAC AAAATTGAAC TGCATACCTT TGGTAAGTTC
TCGCCGTTCA CCGCTGGCAA CGTGGTTAAT CACGGCTTTG AAACTGACAA GCGCAAGTTG
ATGAGCGCGC TCAATCAGAT GGATGCGTTG GTATTCAGTT CTCGCGTCGA TAACTACCCG
CTGATTTTGT GTGAGGCGCT ATCGATTGGC GTGCCGGTGA TTGCCACCCA TAGCGATGCG
GCGCGGGAAG TGCTGCAAAA ATCCGGCGGT AAAACCGTCA GCGAAGAAGA GGTGCTGCAA
CTGGTGCAGT TAAGCAAACC GGAAATTGCG CAGGCGATAT TTGGTACCAC GCTGGCTGGG
TTTAGCCAAC GCAGCCGCGC CGCCTACAGT GGACAACAGA TGCTGGAGGA GTATGTCAAC
TTCTATCAGA ATCTGTAG
 
Protein sequence
MNILQFNVRL AEGGAAGVAL DLHQRALQQG LASHFVYGYG KGGKESVSHQ NYPQVIKHTP 
RMTAMANIAL FRLLNRDLFG NFNELYRTIT RTPGPVVLHF HVLHSYWLNL KSVVRFCEKV
KNHKTDVTLV WTLHDHWSVT GRCAFTDGCE GWKTGCQKCP TLNNYPPVKI DRAHQLVAGK
RQLFREMLAL GCQFISPSQH VADAFNSLYG PGRCRIINNG IDMATEAILA DLPPVRETQD
KPKIAVVAHD LRYDGKTNQQ LVREMMALGD KIELHTFGKF SPFTAGNVVN HGFETDKRKL
MSALNQMDAL VFSSRVDNYP LILCEALSIG VPVIATHSDA AREVLQKSGG KTVSEEEVLQ
LVQLSKPEIA QAIFGTTLAG FSQRSRAAYS GQQMLEEYVN FYQNL