Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1005 |
Symbol | wcaC |
ID | 6146381 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1023496 |
End bp | 1024713 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641615892 |
Product | putative glycosyl transferase |
Protein accession | YP_001743084 |
Protein GI | 170682720 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.636384 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATATTC TGCAATTCAA TGTGCGACTG GCGGAAGGCG GGGCAGCAGG TGTGGCGTTA GATCTCCACC AGCGCGCGCT GCAACAGGGG CTGGCGTCAC ATTTTGTCTA CGGCTACGGC AAAGGTGGCA AAGAGAGCGT CAGCCATCAG AACTATCCGC AGGTCATCAA ACATACGCCG CGGATGACCG CGATGGCGAA CATTGCGTTG TTTCGTCTGC TTAATCGCGA TCTGTTCGGC AATTTCAATG AGTTATATCG CACCATTACT CGCACACCGG GTCCGGTGGT CCTGCATTTT CATGTGCTGC ACAGCTACTG GTTGAATCTT AAGAGCGTGG TGCGCTTTTG CGAAAAAGTG AAAAACCATA AAACGGATGT CACTCTGGTC TGGACGCTAC ACGACCACTG GAGCGTTACC GGACGCTGCG CCTTTACCGA CGGTTGCGAA GGCTGGAAAA CGGGCTGCCA GAAATGCCCG ACCTTAAATA ATTATCCGCC GGTGAAGATT GATCGCGCAC ACCAACTGGT GGCGGGCAAA CGCCAGTTAT TCCGTGAGAT GCTGGCGCTG GGCTGTCAGT TTATTTCCCC CAGCCAGCAT GTGGCTGATG CTTTCAATAG TCTGTACGGT CCAGGGCGTT GCCGGATTAT CAATAATGGC ATTGATATGG CAACCGAAGC GATTCTGGCG GACTTGCCTC CGGTACGCGA AACCCAGGAC AAGCCGAAAA TCGCGGTGGT GGCGCATGAT CTGCGTTACG ACGGCAAAAC TAACCAGCAA CTGGTACGTG AGATGATGGC GCTGGGCGAC AAAATTGAAC TGCATACCTT TGGTAAGTTC TCGCCGTTCA CCGCTGGCAA CGTGGTTAAT CACGGCTTTG AAACTGACAA GCGCAAGTTG ATGAGCGCGC TCAATCAGAT GGATGCGTTG GTATTCAGTT CTCGCGTCGA TAACTACCCG CTGATTTTGT GTGAGGCGCT ATCGATTGGC GTGCCGGTGA TTGCCACCCA TAGCGATGCG GCGCGGGAAG TGCTGCAAAA ATCCGGCGGT AAAACCGTCA GCGAAGAAGA GGTGCTGCAA CTGGTGCAGT TAAGCAAACC GGAAATTGCG CAGGCGATAT TTGGTACCAC GCTGGCTGGG TTTAGCCAAC GCAGCCGCGC CGCCTACAGT GGACAACAGA TGCTGGAGGA GTATGTCAAC TTCTATCAGA ATCTGTAG
|
Protein sequence | MNILQFNVRL AEGGAAGVAL DLHQRALQQG LASHFVYGYG KGGKESVSHQ NYPQVIKHTP RMTAMANIAL FRLLNRDLFG NFNELYRTIT RTPGPVVLHF HVLHSYWLNL KSVVRFCEKV KNHKTDVTLV WTLHDHWSVT GRCAFTDGCE GWKTGCQKCP TLNNYPPVKI DRAHQLVAGK RQLFREMLAL GCQFISPSQH VADAFNSLYG PGRCRIINNG IDMATEAILA DLPPVRETQD KPKIAVVAHD LRYDGKTNQQ LVREMMALGD KIELHTFGKF SPFTAGNVVN HGFETDKRKL MSALNQMDAL VFSSRVDNYP LILCEALSIG VPVIATHSDA AREVLQKSGG KTVSEEEVLQ LVQLSKPEIA QAIFGTTLAG FSQRSRAAYS GQQMLEEYVN FYQNL
|
| |