Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_01952 |
Symbol | wcaC |
ID | 8114334 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 2029174 |
End bp | 2030391 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 644848166 |
Product | hypothetical protein |
Protein accession | YP_002999739 |
Protein GI | 251785435 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATATTT TGCAATTTAA TGTGCGACTG GCGGAAGGCG GGGCAGCAGG TGTGGCGTTA GATCTCCACC AGCGTGCGCT GCAACAGGGG CTGGCGTCAC ATTTTGTGTA CGGTTACGGC AAAGGCGGCA AAGAGAGTGT CAGCCATCAA AACTATCCGC AGGTCATCAA ACATACGCCG CGGATGACCG CGATGGCGAA CATTGCCCTG TTTCGTCTGT TTAATCGCGA TCTGTTTGGC AATTTCAATG AGTTATATCG CACCATTACT CGTACACCGG GTCCGGTGGT CCTGCATTTT CATGTGCTGC ACAGCTACTG GCTAAATCTT AAGAGCGTGG TGCGCTTTTG CGAAAAGGTG AAAAACCACA AACCGGATGT CACTCTGGTC TGGACGCTGC ACGACCACTG GAGCGTTACC GGGCGCTGCG CCTTTACCGA CGGTTGTGAA GGCTGGAAAA CGGGCTGCCA GAAATGCCCG ACCTTAATTA ATTATCCGCC GGTGAAGATT GATCGCGCAC ACCAGCTGGT GGCGGGCAAA CGCCAGTTAT TCCGTGAGAT GCTGGCGCTG GGCTGTCAGT TTATTTCCCC CAGCCAGCAT GTGGCTGACG CTTTCAATAG CCTGTACGGT CCAGGGCGTT GCCGGATTAT CAATAATGGC ATTGATATGG CAACCGAAGC GATTCTGGCG GACTTGCCTC CGGTGCGCGA AACCCAGGGT AAGCCGAAAA TCGCGGTGGT GGCGCATGAC CTGCGTTACG ACGGCAAAAC TAACCAGCAA CTGGTGCGCG AGATGATGGC GCTGGGCGAC AAAATCGAAC TGCATACCTT TGGTAAGTTC TCGCCGTTCA CCGCTGGCAA CGTGGTTAAT CACGGCTTTG AAACTGACAA GCGCAAGTTG ATGAGCGCGC TCAATCAGAT GGATGCGCTG GTGTTCAGTT CTCGCGTCGA TAACTACCCG CTGATTTTGT GTGAGGCGCT ATCGATTGGC GTGCCGGTGA TTGCCACCCA TAGCGATGCG GCGCGGGAAG TGTTGCAAAA ATCCGGCGGT AAAACCGTCA GCGAAGAAGA GGTGCTGCAA CTGGTGCAGT TAAGCAAACC GGAAATCGCG CAGGCGATAT TTGGTACCAC GCTGGCTGAG TTCAGCCAAC GCAGCCGCGC CACCTACAGT GGACAACAGA TGCTGGAGGA GTATGTCAAC TTCTATCAGA ATCTGTAG
|
Protein sequence | MNILQFNVRL AEGGAAGVAL DLHQRALQQG LASHFVYGYG KGGKESVSHQ NYPQVIKHTP RMTAMANIAL FRLFNRDLFG NFNELYRTIT RTPGPVVLHF HVLHSYWLNL KSVVRFCEKV KNHKPDVTLV WTLHDHWSVT GRCAFTDGCE GWKTGCQKCP TLINYPPVKI DRAHQLVAGK RQLFREMLAL GCQFISPSQH VADAFNSLYG PGRCRIINNG IDMATEAILA DLPPVRETQG KPKIAVVAHD LRYDGKTNQQ LVREMMALGD KIELHTFGKF SPFTAGNVVN HGFETDKRKL MSALNQMDAL VFSSRVDNYP LILCEALSIG VPVIATHSDA AREVLQKSGG KTVSEEEVLQ LVQLSKPEIA QAIFGTTLAE FSQRSRATYS GQQMLEEYVN FYQNL
|
| |