Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4149 |
Symbol | |
ID | 6146238 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4248678 |
End bp | 4249724 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641618972 |
Product | lipopolysaccharide biosynthesis protein WzzE |
Protein accession | YP_001746104 |
Protein GI | 170682919 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3765] Chain length determinant protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACAAC CAATGCCTGG GAAACCGGCC GAAGACGCTG AAAATGAACT GGATATTCGT GGGTTGTTTC GTACCTTGTG GGCTGGGAAG CTGTGGATTA TTGGCATGGG GCTGGCGTTT GCGTTAATCG CGCTGGCGTA TACTTTTTTT GCTCGACAGG AGTGGAGCGC AACGGCGATT ACCGATCGTC CAACGGTGAA TATGCTGGGG GGATATTACT CGCAGCAGCA ATTTTTGCGT AACCTCGATG TCCGTTCAAA CATGGCTTCT GCCGACCAAC CATCGGTCAT GGACGAAGCC TATAAAGAGT TTGTTATGCA ACTGGCCTCG TGGGATACCC GCAGAGAGTT CTGGCTGCAA ACCGACTATT ACAAACAGCG GATGGTGGGC AATAGCAAAG CCGATGCGGC GTTGCTGGAT GAAATGATCA ACAACATCGT GTTTATTCCC GGTGACTTCA CCCGCGCGGT CAATGACAGC GTGAAGCTGA TTGCCGAAAC TGCGCCTGAC GCTAATAACC TGTTACGTCA GTATGTTGCT TTTGCCAGCC AGCGTGCAGC CAGCCATCTG AATGATGAGC TGAAAGGCGC ATGGGCGGCA CGCACCATCC AGATGAAAGC TCAGGTGAAG CGTCAGGAAG AGGTGGCGAA AGCTATCTAC GATCGCCGGA TGAACAGTAT TGAACAGGCG CTGAAAATTG CTGAGCAGCA TAATATTTCG CGCAGTGCGA CAGATGTGCC TGCCGAGGAA CTACCTGATT CAGAAATGTT CCTGCTTGGG CGTCCAATGC TCCAGGCTCG ACTGGAAAAT TTACAGGCCG TCGGTCCGGC CTTTGATCTC GACTATGATC AGAATCGGGC CATGTTAAAC ACCCTGAATG TTGGTCCAAC CCTGGATCCG CGTTTTCAGA CCTATCGCTA TTTGCGTACG CCGGAAGAAC CGGTAAAACG CGATAGCCCA CGTCGTGCCT TCCTGATGAT TATGTGGGGC ATTGTCGGGG GACTGATCGG GGCTGGTGTC GCATTAACCC GCCGTTGCTC GAAATAG
|
Protein sequence | MTQPMPGKPA EDAENELDIR GLFRTLWAGK LWIIGMGLAF ALIALAYTFF ARQEWSATAI TDRPTVNMLG GYYSQQQFLR NLDVRSNMAS ADQPSVMDEA YKEFVMQLAS WDTRREFWLQ TDYYKQRMVG NSKADAALLD EMINNIVFIP GDFTRAVNDS VKLIAETAPD ANNLLRQYVA FASQRAASHL NDELKGAWAA RTIQMKAQVK RQEEVAKAIY DRRMNSIEQA LKIAEQHNIS RSATDVPAEE LPDSEMFLLG RPMLQARLEN LQAVGPAFDL DYDQNRAMLN TLNVGPTLDP RFQTYRYLRT PEEPVKRDSP RRAFLMIMWG IVGGLIGAGV ALTRRCSK
|
| |