Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2184 |
Symbol | |
ID | 5594251 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2163469 |
End bp | 2164899 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640921317 |
Product | putative polysaccharide biosynthesis protein |
Protein accession | YP_001458856 |
Protein GI | 157161538 |
COG category | [R] General function prediction only |
COG ID | [COG2244] Membrane protein involved in the export of O-antigen and teichoic acid |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.00000339542 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACATTGC TTTCAAATGC TAAATGGAAC ATGGTATCAC AATTTATCAA AATGTTAGTG CAACTAACAA ACATAGTCTA TTTAGCAAAA ATTATTCCTC CAAGCGAATA TGGTTTAATG GCTATGGCTC TTGTTGTTGT GAACTTAGGT ATCCTGTTAC GAGATTTAGG CACTTCTGCT GCATTAATAC AAAGGAAAGA TCTTACTGAG TCTCTAATTA ATACTGTTTT TTGGTTAAAT CTCCTCATGG GATTGACTTT ATTTGTTCTG GTTTTTTCGG GCTCTAGTGT GATATCCAGT ATCTATCATC AGCCGAAATT AACTTTAGTA TTGATGTTGC TTAGTTTTAC ATTTCCTCTT TCGAGCTGTG CGGCAGCTCA CCTTGCTTTG CTTGAGAGGG ACTCGAAATT CAAAACAGTC TCTAGGATAG AAATTTCTTC TTCACTTGCT TCATTAGTAC TAGCGATTAC TTTAGCAAAT ATGGGCTTTG GTGTTTTCAG TCTTGTTGGT CAAGCTTTAA TACTTAATCT GATGTCTGCT ATACAATTCT GGCTTGCATC AAATTGGAAA CCTTCTATTA GGGTTTTTAT TAATTATAAA GATTTGAAAA GTATTTTTAG TTTTAGTGCA AATTTATCAA TGTTTAATTT TATAAATTAT TTTTCAAGGA ACGCAGATAG CTTTATTATT GGTAAATTCA TGTCCGCTTC AATTCTCGGG AGTTATAATC TTGCTTATAG AATTATGCTT TTCCCATTGC AAAGTCTTAC CTTTGTCGCA ACTAGGTCAC TTTATCCAAT ATTAAGTAAA CAACAAAGTA ATAATCAGCA TATATCAAAA ATATATTTAA AGTGTGTCTA TGTTGTGTTG TTTATAACAT GTCCATTGAT GTCTGGACTG GCATTTTATA GTGAGCCTTT TATCCGATTG ATATTCGGTG ATGAATGGTA TCTGACGGGA ATAGTATTAA AATGGTTAGC GCCAACTGCA ATAATACAAG CGGTACTTAG TACTTCTGGT TCAGTTTTTA TGGCTAAAGG TCGTACAGAT ATTTTATTAA AGCTTGGCAT TATTGGAATG ATCTTGCAAG TGGGAGCTTT CATCATAGGT GTTCAATATA CAATAACCAC ATTTGCTATG TGTTATCTTC TTGCTAATGT TATTAATTTT TTCCCTGTCA TGTGGTCTTT AATGGGTTTG TTAGGGGAGA ATCTCAATGT TTTTTTTAAA AAGAACTATT CAATTGTTTT GGCTACATTA GTCATGTTAA GTTACTTAAA ACTCATTGAT TATTTCTTTT TCCCCAATGC TATTGCGAGC CTTAATATTT TAGTTCTATT ATCCTTTTCT GGCGCCGTTG TTTATTTATT AGCCTCTCTC ACTCTTTCCG CAACTCTTAG AAAATTTGTG CTATCAAAGA TTAAAAAATG A
|
Protein sequence | MTLLSNAKWN MVSQFIKMLV QLTNIVYLAK IIPPSEYGLM AMALVVVNLG ILLRDLGTSA ALIQRKDLTE SLINTVFWLN LLMGLTLFVL VFSGSSVISS IYHQPKLTLV LMLLSFTFPL SSCAAAHLAL LERDSKFKTV SRIEISSSLA SLVLAITLAN MGFGVFSLVG QALILNLMSA IQFWLASNWK PSIRVFINYK DLKSIFSFSA NLSMFNFINY FSRNADSFII GKFMSASILG SYNLAYRIML FPLQSLTFVA TRSLYPILSK QQSNNQHISK IYLKCVYVVL FITCPLMSGL AFYSEPFIRL IFGDEWYLTG IVLKWLAPTA IIQAVLSTSG SVFMAKGRTD ILLKLGIIGM ILQVGAFIIG VQYTITTFAM CYLLANVINF FPVMWSLMGL LGENLNVFFK KNYSIVLATL VMLSYLKLID YFFFPNAIAS LNILVLLSFS GAVVYLLASL TLSATLRKFV LSKIKK
|
| |