Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1369 |
Symbol | |
ID | 6968114 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1369743 |
End bp | 1370867 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 643385347 |
Product | glycosyl transferase |
Protein accession | YP_002269842 |
Protein GI | 209399614 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | [TIGR01426] glycosyltransferase, MGT family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.141679 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAAAAC GTATTCTTTT TATTGGCCCA CCGCTGTACG GTTTGTTATA CCCATTGATT TCTCTGGCTC AGGCCTTTCG TGTAATTGGA CATGATGTAG TAATTAGTAG CGCTGGCAAA TTCGCGAATA AAGCAGCAGA AGCTGGACTG GTTGTTTTTG ATGCAGTTCC AGGTTTAGAT TCAGAGGCTG GATATCGCCA TCAGGAAGAG TTGAGGAAAA AAAGTAATAT TATTGGTCAT TTCTCTTTTT TTAGCGATGA AATGGCAGAT AACCTCATCG ATTTTGCAGG AAAATGGAGG CCAGATTTAA TAGTCTATCC CCCTCTTGAT CCGGCAGGCC CATTGGTTGC TGCTAAATAT AGAATTCCTT CAGTGATGCT GGCTGTTGGA TTCGCGCATA CATCTGCCCA TATTCAGATG TTAAACCGTT CTTTAAGCAA TGCTTACAGG CGGCATGGAG TCAGCGGTCC ACTATGTGAT TTAGCATGGA TTGATGTTGC TCCCCCAAGT ATGAGCATTC TTAAAAATGC TGGAGAACCG GTTATCTCAA TGAGATATAT TCCTTATAAC GGAGGTGCTG TAAAGGAAAC ATGGTGGGAC AGGGATTCTG ATCGGAAACG TTTACTTATC AGCCTTGGCA CTGTAAAACC AATGGTTGAT GGTCTGGAGC TGATTTCATG GGTTATGGAT TCTGCAAATG AAGTTGATGC TGATATCATT TTGCAACTTG CAATAAATGC TCGTACTGGT TTACGAAAAC TACCATCAAA TGTACGTCTG GTTGACTGGA TACCTATGGG TGTATTCCTT AATGGAGCTG ATGGATTTAT TCATCACGGT GGCGCAGGTA ATACCCTGAC AGCGCTGTAT AGTGGAATAC CACAGATTGT GTTTGGCGAA GGTGCAGATC GCTCTGTTAA TGCAGAAATT GTTGCGAAGC GTGGGTGTGG GATTATTCCG GACAAGCATG GACTGACCAG TGATTTGGTA AATCGCCTTC TTTATGATGA TTCACTACGC TTCTGTTCAG ATCAGGTAGC CGCTGAAATG GCTGAACAAC CCAGTCCTGC AGAGATTGCA GAGGTTTTGA TGAGAAAATT AAAAAACAAC GGGAAACAAT TGTAG
|
Protein sequence | MRKRILFIGP PLYGLLYPLI SLAQAFRVIG HDVVISSAGK FANKAAEAGL VVFDAVPGLD SEAGYRHQEE LRKKSNIIGH FSFFSDEMAD NLIDFAGKWR PDLIVYPPLD PAGPLVAAKY RIPSVMLAVG FAHTSAHIQM LNRSLSNAYR RHGVSGPLCD LAWIDVAPPS MSILKNAGEP VISMRYIPYN GGAVKETWWD RDSDRKRLLI SLGTVKPMVD GLELISWVMD SANEVDADII LQLAINARTG LRKLPSNVRL VDWIPMGVFL NGADGFIHHG GAGNTLTALY SGIPQIVFGE GADRSVNAEI VAKRGCGIIP DKHGLTSDLV NRLLYDDSLR FCSDQVAAEM AEQPSPAEIA EVLMRKLKNN GKQL
|
| |