Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4806 |
Symbol | |
ID | 6971643 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 4441488 |
End bp | 4443164 |
Gene Length | 1677 bp |
Protein Length | 558 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643388498 |
Product | glycosyl transferase, family 2 |
Protein accession | YP_002272926 |
Protein GI | 209398505 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis [COG4261] Predicted acyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.705565 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 73 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGTAA ACTTTTCTCC CTGCGTGTTG ATACCCTGCT ACAACCACGG CGCGATGATG CCGGGCGTGC TGGCGCGTCT TAAGCCATTT AATCTGCCCT GTATTGTGGT GGATGACGGC AGCGATGCCA CCACACAACA GCAACTGGAC AGTCTGCTTG CCGAACAGCC TGGCGTGACC TTAATTCGCC TGGCAGAAAA CGCAGGCAAA GGCGCGGCGG TGATGCGTGG CTTACAGGCA GCGGCAGACG CAGGGTTCAG CCATGCGGTG CAGGTGGATG CTGACGGTCA GCACGCGATT GAAGATATCC CTAAACTGCT GGCTCTCGCT GAACAACAAC CTGCGGCACT GATCTCCGGC CAGCCAATTT ACGATGACTC CATCCCCCGC TCACGGCTTT ACGGGCGCTG GGTCACCCAC GTCTGGGTAT GGATCGAAAC GCTCTCCCTG CAACTGAAAG ACAGCATGTG CGGTTTTCGC GTTTATCCGG TTGCGCCAAC GCTGCAACTG GCAAAACACG CCACCATCGG CAAGCGGATG GATTTCGACA CCGAAGTGAT GGTGCGCCTC TACTGGCAGG GAAATACCAG CTATTTCGTG CCGACCCGCG TCACCTATCC ACTGGACGGG CTTTCGCATT TTGATGCCCT GAAAGATAAC GTCCGCATCT CGCTCATGCA CACGCGTCTG TTTTTCGGCA TGTTGCCGCG TATTCCTTCA CTGCTGATGC GCCGCTCTTC CTGCCACTGG GCGCGGCAGA GTGAAGTGAA AGGATTATGG GGAATGCGCC TGATGCTGCT GGTCTGGCGT CTGCTGGGAA GAACGGCGTT TAGCGCGCTG CTTTACCCGG TGGTGGGCGT CTACTGGCTC ACTGCTTCTC GTGCGCGCAA AGCGTCGCAA GACTGGCTCG CCCGTGTACG ACAGCATCAA CCACAGGCGG CAAAACTCAA CAGCTATCAG CACTTTCTAC GTTTCGGTAA TGCCATGCTC GACAAAATCG CCAGCTGGCG CGGCGAGCTA CAACCAGGGC GTGATGTGCT GTTTGCGCCA GGCGCAGAAG CAGCGCTTGA CGTCCGCGAT CCGCGCGGCA AATTGCTGCT GGCCTCGCAT CTTGGCGATG TGGAAGTGTG CCGGGCGCTG GCAAAAATTC AGGGCTACAA AACCATTAAC GCGCTGGTGT TTAGCGAAAA CGCCCAACGC TTTAAACAGA TAATGCAGGA GATGGCTCCT CAGGCAGGCA TTAACCTGAT GCCGGTAACA GATATCGGCC CAGAAACCGC CATCCTGCTG AAAGAGAAGC TGGATAACGG CGAATGGGTG GCGATTGTCG GTGACCGCAT CGCCGTCAAC CCGCAACGCG GCGGCGACTG GCGCGTCTGC TGGAGTTCGT TTATGGGCCA GCCTGCGCCT TTCCCACAGG GGCCGTTTAT TCTCGCCTCT ATTTTGCGCT GCCCGGTGAA TCTGATTTTC GCCCTGCGCC AGCACGGCAA GCTGCATATT CACTGCGAAA GCTTTGCCGA CCCACTGCAG CTGCCGCGCG GCGAACGCCA ACAGGCGCTG CAAAACGCTA TCGATCATTA CGCCGCGCGT CTGGAACATT ACGCGCTCCA GTCGCCTCTC GACTGGTTTA ATTTTTTCGA TTTCTGGCAA CTGCCGGAAA TTCAGGACAA GGAGTAA
|
Protein sequence | MSVNFSPCVL IPCYNHGAMM PGVLARLKPF NLPCIVVDDG SDATTQQQLD SLLAEQPGVT LIRLAENAGK GAAVMRGLQA AADAGFSHAV QVDADGQHAI EDIPKLLALA EQQPAALISG QPIYDDSIPR SRLYGRWVTH VWVWIETLSL QLKDSMCGFR VYPVAPTLQL AKHATIGKRM DFDTEVMVRL YWQGNTSYFV PTRVTYPLDG LSHFDALKDN VRISLMHTRL FFGMLPRIPS LLMRRSSCHW ARQSEVKGLW GMRLMLLVWR LLGRTAFSAL LYPVVGVYWL TASRARKASQ DWLARVRQHQ PQAAKLNSYQ HFLRFGNAML DKIASWRGEL QPGRDVLFAP GAEAALDVRD PRGKLLLASH LGDVEVCRAL AKIQGYKTIN ALVFSENAQR FKQIMQEMAP QAGINLMPVT DIGPETAILL KEKLDNGEWV AIVGDRIAVN PQRGGDWRVC WSSFMGQPAP FPQGPFILAS ILRCPVNLIF ALRQHGKLHI HCESFADPLQ LPRGERQQAL QNAIDHYAAR LEHYALQSPL DWFNFFDFWQ LPEIQDKE
|
| |