Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4999 |
Symbol | |
ID | 6968589 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 4651071 |
End bp | 4652078 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 643388680 |
Product | lipopolysaccharide 1,3-galactosyltransferase |
Protein accession | YP_002273107 |
Protein GI | 209396863 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1442] Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.13851 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 76 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTCAAC TCAATGATAG TGACATCATC CTTTTTGAGT ATAATTTTCA TTATCAAAAT ATAAGATCTA AAAATACTCT TGATATAGCA TTTGGTATTG ACAGAAATTT TCTTTTTGGA TGTGGTGTAG CCATCGCATC TATTCTATTA AACAATAGAG AAATCTCTTG TGAATTTCAT GTTTTCACAG ATTATATCAG TGATAAAGAC AAATTATATT TTTCTGATTT AGCAAAACAA TATAATTCAA GAATTAATAT TTATGTTATC AATTGTGATA AGCTGAAGTC ATTACCAAGC ACGAAAAACT GGACTTACGC AACATATTTT CGATTTATAA TTGCAGATTA TTTCTATCAT AAACATGAAA AAATACTATA TCTTGATGCA GATATTGCTT GCAAGGGTAG TATTAAAGAA CTCTTAGATT ATCAATTTTC TACTAATGAA ATTGCCGCAG TTGTAGCTGA AAGAGATGTT GAATGGTGGC AAAATCGAGC CTCGGTATTA ACTACACCAC AGTTAGCTTC TGGATATTTC AATGCTGGTT TTTTACTGAT AAATATTGAT GAGTGGAATC TAAATAACAT TTCGTCAAAA GCTATTGAAA TGTTGCGTGA CCCAGATTGG GTAAGTAAAA TCACCCACCT TGATCAAGAT GTACTGAATG TATTATTGAA TGGTAAAGTG AAGTTTATTT CGGAGAAATA TAATACCCGA TATAGTATTA ACTATGAATT AAAAGACAAA GTTGATAATC CAGTCAATGA TGACACCGTG TTTATACACT ATGTGGGACC TACAAAACCT TGGCATGAGT GGGCTGACTA TCCGGTGTCA CGTAGTTTTT TGATCGCCAA AGCAGCTTCT CCGTGGAGTA AAGAAGATTT ACTTAAACCT GTAAATAGCA ATCAGTATCG GTATTGTGCA AAACATAAAT TTAAACAAAA GCATTATATG GCAGGCATTT TTAATTATTT AAAGTATTAT AAAGAAAAAT GCTTCTAA
|
Protein sequence | MSQLNDSDII LFEYNFHYQN IRSKNTLDIA FGIDRNFLFG CGVAIASILL NNREISCEFH VFTDYISDKD KLYFSDLAKQ YNSRINIYVI NCDKLKSLPS TKNWTYATYF RFIIADYFYH KHEKILYLDA DIACKGSIKE LLDYQFSTNE IAAVVAERDV EWWQNRASVL TTPQLASGYF NAGFLLINID EWNLNNISSK AIEMLRDPDW VSKITHLDQD VLNVLLNGKV KFISEKYNTR YSINYELKDK VDNPVNDDTV FIHYVGPTKP WHEWADYPVS RSFLIAKAAS PWSKEDLLKP VNSNQYRYCA KHKFKQKHYM AGIFNYLKYY KEKCF
|
| |