Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4997 |
Symbol | |
ID | 6972029 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 4649300 |
End bp | 4650313 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 643388678 |
Product | lipopolysaccharide 1,2-glucosyltransferase |
Protein accession | YP_002273105 |
Protein GI | 209398034 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1442] Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0250725 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 71 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGATTTTA AACATCTTAC TCAATTTAAA GATATAATTG AACTGGACAA GCGCCCCGTT AAACTTGATG AACGGGAAAC GTTTAATGTC TCATGGGGTA TTGATGAGAA CTACCAGGTT GGGGCTGCGA TTTCAATTGC TTCAATTCTT GAAAATAATA AACAAAACAA ATTTACCTTT CACATAATCG CTGATTACTT AGACAAAGAG TATATTGAAT TATTATCACA ATTAGCAACG AAGTATCAAA CAGTAATTAA ATTATATCAT ATTGATTCTG AGCCATTGAA GGCGCTACCT CAATCAAATA TCTGGCCAGT ATCTATTTAT TATCGTTTGC TTTCATTTGA TTATTTTTCT GCGCGATTGG ATTCATTATT ATATCTTGAT GCTGATATCG TCTGTAAGGG TTCATTGAAC GAGTTAATAG CATTAGAGTT TAAAGATGAA TATGGGGCAG TGGTAATTGA TGTAGATGCT ATGCAAAGTA AAAGCGCTGA GCGTTTGTGT AATGAGGATT TTAACGGTAG CTATTTTAAC TCTGGTGTAA TGTATATTAA TTTACGGGAA TGGTTAAAAC AAAGACTAAC GGAAAAATTC TTTGATCTAT TATCAGATGA GTCAATTATA AAAAAATTAA AGTACCCGGA TCAAGATATT TTAAACTTAA TGTTTCTACA TCATGCTAAA ATATTACCGA GAAAATATAA TTGTATTTAT ACTATAAAGT CAGAATTTGA AGAAAAAAAT AGTGAATATT ACACCCGGTT TATTAATGAT GACACTGTCT TCATACATTA TACTGGTATA ACTAAGCCAT GGCATGATTG GGCGAACTAC GCCTCTGCAG ATTATTTTCG TAATATTTAT AATATATCAC CATGGAGAAA TATACCTTAT AAAAAAGCTG TTAAAAAACA TGAGTACAAA GAAAAATATA AACACTTGCT TTACCAGAAA AAATTTCTCG ATGGTGTTTT TACAGCAATT AAATATAATG TTATGAAAGG TTAA
|
Protein sequence | MDFKHLTQFK DIIELDKRPV KLDERETFNV SWGIDENYQV GAAISIASIL ENNKQNKFTF HIIADYLDKE YIELLSQLAT KYQTVIKLYH IDSEPLKALP QSNIWPVSIY YRLLSFDYFS ARLDSLLYLD ADIVCKGSLN ELIALEFKDE YGAVVIDVDA MQSKSAERLC NEDFNGSYFN SGVMYINLRE WLKQRLTEKF FDLLSDESII KKLKYPDQDI LNLMFLHHAK ILPRKYNCIY TIKSEFEEKN SEYYTRFIND DTVFIHYTGI TKPWHDWANY ASADYFRNIY NISPWRNIPY KKAVKKHEYK EKYKHLLYQK KFLDGVFTAI KYNVMKG
|
| |