Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_01939 |
Symbol | wcaL |
ID | 8114341 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 2014254 |
End bp | 2015474 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 644848154 |
Product | hypothetical protein |
Protein accession | YP_002999727 |
Protein GI | 251785423 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.156433 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGTCG GCTTCTTTTT ACTGAAATTT CCACTGTCGT CGGAAACCTT CGTCCTCAAT CAAATTACCG CGTTTATTGA TATGGGCTTT GAGGTGGAGA TTGTCGCGCT GCAAAAAGGC GACACACAAA ACACCCACGC GGCATGGACA AAATATAACC TTGCCGCCAG AACCCGCTGG TTACAGGACG AGCCACAAGG CAAAGTGGCG AAACTGCGCC ACCGCGCCAG CCAGACGTTG CGCGGTATTC ATCGTAAAAA TACCTGGCAG GCGCTTAACC TCAAACGCTA TGGTGCTGAG TCGCGGAACC TGATTTTGTC TGCCATTTGC GGCCAGGTCG CAACACCGTT TCATGCCGAT GTGTTTATCG CTCATTTTGG CCCTGCTGGG GTAACCGCGG CAAAACTACG CGAACTGGGT GTCATTCGCG GCAAAATTGC CACTATCTTC CACGGTATTG ATATCTCCAG TCGGGAAGTG CTCAACCACT ACACTCCCGA ATATCAACAA CTGTTTCGCC GTGGCGACCT GATGTTACCA ATAAGCAATC TGTGGGCCGG AAGGCTGCAA AAAATGGGCT GCCCGAGGGA AAAAATCGCC GTATCGCGCA TGGGCGTGGA CATGACGCGT TTTAGCCCGC GTCCGGTGAA AGCGCCCGCA ACGCCGCTGG AAATCATCTC CGTCGCACGC TTAACCGAGA AAAAAGGCCT GCATGTGGCG ATTGAAGCCT GCCGCCAGTT GAAAGAGCAG GGCGTGGCAT TTCGCTATCG CATCCTCGGC ATTGGCCCGT GGGAACGACG CCTGCGCACC CTCATCGAAC AATATCAACT GGAAGATGTG ATAGAGATGC CGGGCTTTAA ACCGAGCCAT GAAGTGAAAG CGATGCTCGA TGACGCGGAT GTCTTCCTGT TGCCATCGGT TACAGGTGCG GATGGCGATA TGGAAGGTAT TCCGGTGGCG CTAATGGAAG CGATGGCGGT TGGCATTCCG GTTGTTTCTA CTCTGCACAG CGGAATACCG GAACTGGTGG AGGCCGATAA ATCCGGTTGG CTGGTGCCTG AGAACGATGC CTGTGCACTG GCGCAACGAC TGGCGGCGTT TAGCCAACTG GACACCGACG AACTTACTAC GGTTGTCAAA CGTGCGCGCG AAAAAGTCGA ACACGATTTT AACCAGCAGG TGATCAATCG AGAACTCGCC AGCTTGCTGC AGGCTTTATA G
|
Protein sequence | MKVGFFLLKF PLSSETFVLN QITAFIDMGF EVEIVALQKG DTQNTHAAWT KYNLAARTRW LQDEPQGKVA KLRHRASQTL RGIHRKNTWQ ALNLKRYGAE SRNLILSAIC GQVATPFHAD VFIAHFGPAG VTAAKLRELG VIRGKIATIF HGIDISSREV LNHYTPEYQQ LFRRGDLMLP ISNLWAGRLQ KMGCPREKIA VSRMGVDMTR FSPRPVKAPA TPLEIISVAR LTEKKGLHVA IEACRQLKEQ GVAFRYRILG IGPWERRLRT LIEQYQLEDV IEMPGFKPSH EVKAMLDDAD VFLLPSVTGA DGDMEGIPVA LMEAMAVGIP VVSTLHSGIP ELVEADKSGW LVPENDACAL AQRLAAFSQL DTDELTTVVK RAREKVEHDF NQQVINRELA SLLQAL
|
| |