Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2190 |
Symbol | |
ID | 5594125 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2171069 |
End bp | 2172496 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640921323 |
Product | sugar transferase |
Protein accession | YP_001458862 |
Protein GI | 157161544 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2148] Sugar transferases involved in lipopolysaccharide synthesis |
TIGRFAM ID | [TIGR03022] Undecaprenyl-phosphate galactose phosphotransferase, WbaP [TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0000000000191042 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGCTTC TCGTAAAGAA TGTCATAGTT AGTATCACAT TGGCTTTATC AGATTTCATT TCATTCATTC TTTCTTTATA CATAGCGTTG GATGTGTTGT CATTAATTGT AGAAAAGAAT GATTATTTAT TATTAACTAA TGAAATTGAT GGGTGGTTTT TACTTCATTG GATTCTCGGT GTTTGTTGTG TAGGCTGGTA TTCAGTTAGA CTAAGGCATT ATTTTTATCG TAAACCATTT TGGTTTGAAC TTAAAGAAAT ACTTCGAACC TTGGTTATTT TTGCAATAAT CGAAATTGCT GTTATGGCTT TTACCACTTG GTCATTCTCA CGTTACTTAT GGGTGATAAC GTGGTTATTT GTAATCATAC TTGTTCCATT GTCAAGAATG GTAACAAAGA CCTTATTGAA TAAAGCGGGG GTGTGGCAAA GAGATACTTG GATAGTAGGT AGTAGTGAAA ATGCTCATGA AGCATATAAA GCAATTAGCG GAGAAAAAAA CTTAGGTTTA AACATTGTCG GTTTCATATC AAGTCCTGGC GGAGTTCCAT CAGGAGAGGC AATTGATGGT ATACCAGTGA TTGAAAATAA TCTAGAATGG CTTAATGATA TTGATCCCAA AACACAATTC ATTGTTGCTG TTGAGTCTCA TCAAAGTAAA ATAAGAAATG CGTGGCTTAG AAACTTTATG ATTAAAGGCT ATCGATATGT CTCAGTTATT CCAACATTGC GTGGAATGCC TCTTGATAGC ACAGATATGT CATTCATATT TAGTCATGAA GTTATGATTT TTCGTGTACA GCAAAATCTA GCCAAATGGT CGTCAAGAAT TCTCAAAAGA CTTTTTGATA TTATAGGCTC ATTGTCTATA ATTACTCTTC TTTCACCAGT ATTATTATAT ATAAGTCTTA AAGTTAAAAA AGATGGAGGC CCGGCAATAT ATGGTCATGA AAGGATCGGG AAAGGAGGTA AACCCTTTAA GTGTTTGAAG TTCAGGTCTA TGGTAATCAA TTCCAAAGAA GTTCTTGAAG AACTTCTAAA TAACGATATA AATGCGCGAG AAGAATGGAA TTTAACATTC AAATTGAAAA ACGACCCAAG AATAACTAAG ATAGGCGGTT TCCTTAGAAG GACAAGCCTG GATGAATTGC CGCAGTTATT TAATGTATTA AAAGGGGAAA TGAGTTTAGT TGGACCAAGG CCTATCATTA CTGCTGAGTT AGAAAGATAC AATGAAGAAG TTGATTATTA TTTATTGAGC AAACCAGGAA TGACAGGCCT ATGGCAAGTT AGTGGGCGTA GTGATGTGGA TTATGAGACA CGCGTTTATC TTGATGCTTG GTACGTTAAA AATTGGTCAA TGTGGAATGA TATCGCGATT TTATTTAAAA CTGTGGGTGT TGTATTAAAA AGAGACGGTG CGTATTAA
|
Protein sequence | MRLLVKNVIV SITLALSDFI SFILSLYIAL DVLSLIVEKN DYLLLTNEID GWFLLHWILG VCCVGWYSVR LRHYFYRKPF WFELKEILRT LVIFAIIEIA VMAFTTWSFS RYLWVITWLF VIILVPLSRM VTKTLLNKAG VWQRDTWIVG SSENAHEAYK AISGEKNLGL NIVGFISSPG GVPSGEAIDG IPVIENNLEW LNDIDPKTQF IVAVESHQSK IRNAWLRNFM IKGYRYVSVI PTLRGMPLDS TDMSFIFSHE VMIFRVQQNL AKWSSRILKR LFDIIGSLSI ITLLSPVLLY ISLKVKKDGG PAIYGHERIG KGGKPFKCLK FRSMVINSKE VLEELLNNDI NAREEWNLTF KLKNDPRITK IGGFLRRTSL DELPQLFNVL KGEMSLVGPR PIITAELERY NEEVDYYLLS KPGMTGLWQV SGRSDVDYET RVYLDAWYVK NWSMWNDIAI LFKTVGVVLK RDGAY
|
| |