Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B4145 |
Symbol | |
ID | 6796871 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | + |
Start bp | 4047935 |
End bp | 4048981 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642778260 |
Product | lipopolysaccharide biosynthesis protein WzzE |
Protein accession | YP_002148848 |
Protein GI | 197249976 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3765] Chain length determinant protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACAAC CATTACCGGG GGCACGCGCG GTGAGCGCTG AAAATGAACT GGATATTCGC GGGTTGTTTC GTACTTTATG GGCTGGCAAA TACTGGATTA TCGGCATTGG CCTGCTATTT GCCCTTATCG CGTTAGCCTA TACCTTTTTT GCTCGTCAGG AGTGGAGTGC GACGGCGATC ACCGATCGCC CAACCGTAAA TATGTTGGGC GGTTATTACT CCCAGCAGCA GTTTCTGCGC AACCTGGATA TTAAGACCGA TCCCTCTTCT TCCGATAAGC CCTCGGTGAT GGATGAAGCG TATAAAGAGT TCATCATGCA ACTTGCCTCC TGGGATACGC GTCGCGATTT CTGGTTACAG ACGGACTATT ACAAGCAGCG AATGGTCGGG AATAGCAAAG CTGATGCGGC GATGCTTGAT GAACTAATCA ATAACATACA GTTTACGCCC GGCGATTTTA CACGCGCCAT CAACGACAAT GTGAAGCTGA TTGCTGAAAC TGCGCCGGAC GCCAATAATC TGCTGCGTCA GTATGTCGCA TTCGCCAGCC AGCGGGCGGC GAGCCATCTG AATGATGAAT TAAAAGGTGC CTGGGCTGCG CGTACCGTGC AGATGAAAGC CCAGGTCAAA CGGCAGGAAG AGGTTGCGAA AGCGATCTAT TCGCGTCGTG TAAACAGTAT TGAGCAGGCG CTCAAAATTG CGGAACAACA TAATATTTCT CGTAGCGCGA CGGATGTCCC GGCGGATGAA TTACCGGACT CAGAGCTTTT TTTACTCGGT CGCCCTATGT TGCAGGCGCG TCTTGAAAAT CTGCAAGCGG TTGGCCCTGC GTTCGATTTG GACTACTTTC AAAATCGGGC AATGTTGAAT ACGCTTAATG TGGGGCCGAC CCTGGACCCT CGTTTTCAGA CCTATCGTTA TTTGCGTACG CCGGAAGAAC CGGTAAAACG TGATAGCCCA CGCCGAGCCT TCCTGATGAT TATGTGGGGT ATCGTTGGGG CGCTAATCGG TGCGGGCGTT GCCTTAACCC GTCGCCGCAC GATTTAG
|
Protein sequence | MTQPLPGARA VSAENELDIR GLFRTLWAGK YWIIGIGLLF ALIALAYTFF ARQEWSATAI TDRPTVNMLG GYYSQQQFLR NLDIKTDPSS SDKPSVMDEA YKEFIMQLAS WDTRRDFWLQ TDYYKQRMVG NSKADAAMLD ELINNIQFTP GDFTRAINDN VKLIAETAPD ANNLLRQYVA FASQRAASHL NDELKGAWAA RTVQMKAQVK RQEEVAKAIY SRRVNSIEQA LKIAEQHNIS RSATDVPADE LPDSELFLLG RPMLQARLEN LQAVGPAFDL DYFQNRAMLN TLNVGPTLDP RFQTYRYLRT PEEPVKRDSP RRAFLMIMWG IVGALIGAGV ALTRRRTI
|
| |