Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_03612 |
Symbol | wzzE |
ID | 8115648 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 3857348 |
End bp | 3858394 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 644849776 |
Product | hypothetical protein |
Protein accession | YP_003001349 |
Protein GI | 251787045 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3765] Chain length determinant protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00829119 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACAAC CAATGCCTGG GAAACCGGCC GAAGACGCTG AAAATGAACT GGATATTCGT GGGTTGTTTC GTACCTTGTG GGCTGGGAAG CTATGGATTA TTGGCATGGG GCTGGCGTTT GCGTTAATCG CGCTGGCGTA TACGTTTTTT GCTCGTCAGG AGTGGAGCTC GACGGCGATT ACCGATCGTC CAACGGTGAA TATGCTGGGG GGATATTACT CACAGCAGCA ATTTTTGCGT AACCTGGATG TCCGTTCAAA CATGGCTTCT GCCGACCAAC CATCGGTCAT GGACGAAGCC TATAAAGAGT TTGTTATGCA ACTGGCCTCG TGGGATACCC GCAGAGAGTT CTGGCTGCAA ACCGACTATT ACAAACAGCG GATGGTGGGC AACAGCAAAG CCGATGCGGC GTTGCTGGAT GAAATGATTA ACAACATCCA GTTTATCCCC GGAGACTTTA CCCGCGCGGT CAATGACAGC GTGAAGCTTA TTGCTGAAAC CGCGCCTGAC GCTAATAACC TGTTACGTCA GTATGTTGCT TTTGCCAGCC AGCGTGCAGC CAGCCATCTG AATGATGAGC TGAAAGGCGC ATGGGCGGCG CGTACCATCC AGATGAAAGC ACAGGTGAAG CGTCAGGAAG AGGTGGCGAA AGCCATCTAC GACCGCCGGA TGAACAGTAT TGAACAGGCG CTGAAAATTG CTGAGCAGCA TAATATTTCG CGCAGTGCGA CAGATGTGCC TGCCGAGGAA TTACCTGATT CAGAAATGTT CCTGCTTGGG CGTCCAATGC TCCAGGCTCG ACTGGAAAAT TTACAGGCCG TCGGTCCGGC CTTTGATCTC GACTATGATC AGAATCGGGC CATGTTAAAC ACCCTGAATG TTGGTCCAAC CCTGGATCCG CGTTTTCAGA CCTATCGCTA TTTGCGTACG CCGGAAGAAC CGGTAAAACG CGATAGCCCA CGTCGTGCCT TCCTGATGAT TATGTGGGGC ATTGTCGGGG GGCTGATCGG GGCTGGTGTC GCATTAACCC GCCGTTGCTC GAAATAG
|
Protein sequence | MTQPMPGKPA EDAENELDIR GLFRTLWAGK LWIIGMGLAF ALIALAYTFF ARQEWSSTAI TDRPTVNMLG GYYSQQQFLR NLDVRSNMAS ADQPSVMDEA YKEFVMQLAS WDTRREFWLQ TDYYKQRMVG NSKADAALLD EMINNIQFIP GDFTRAVNDS VKLIAETAPD ANNLLRQYVA FASQRAASHL NDELKGAWAA RTIQMKAQVK RQEEVAKAIY DRRMNSIEQA LKIAEQHNIS RSATDVPAEE LPDSEMFLLG RPMLQARLEN LQAVGPAFDL DYDQNRAMLN TLNVGPTLDP RFQTYRYLRT PEEPVKRDSP RRAFLMIMWG IVGGLIGAGV ALTRRCSK
|
| |