Gene ECH74115_4806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4806 
Symbol 
ID6971643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4441488 
End bp4443164 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content58% 
IMG OID643388498 
Productglycosyl transferase, family 2 
Protein accessionYP_002272926 
Protein GI209398505 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis
[COG4261] Predicted acyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.705565 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones73 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGTAA ACTTTTCTCC CTGCGTGTTG ATACCCTGCT ACAACCACGG CGCGATGATG 
CCGGGCGTGC TGGCGCGTCT TAAGCCATTT AATCTGCCCT GTATTGTGGT GGATGACGGC
AGCGATGCCA CCACACAACA GCAACTGGAC AGTCTGCTTG CCGAACAGCC TGGCGTGACC
TTAATTCGCC TGGCAGAAAA CGCAGGCAAA GGCGCGGCGG TGATGCGTGG CTTACAGGCA
GCGGCAGACG CAGGGTTCAG CCATGCGGTG CAGGTGGATG CTGACGGTCA GCACGCGATT
GAAGATATCC CTAAACTGCT GGCTCTCGCT GAACAACAAC CTGCGGCACT GATCTCCGGC
CAGCCAATTT ACGATGACTC CATCCCCCGC TCACGGCTTT ACGGGCGCTG GGTCACCCAC
GTCTGGGTAT GGATCGAAAC GCTCTCCCTG CAACTGAAAG ACAGCATGTG CGGTTTTCGC
GTTTATCCGG TTGCGCCAAC GCTGCAACTG GCAAAACACG CCACCATCGG CAAGCGGATG
GATTTCGACA CCGAAGTGAT GGTGCGCCTC TACTGGCAGG GAAATACCAG CTATTTCGTG
CCGACCCGCG TCACCTATCC ACTGGACGGG CTTTCGCATT TTGATGCCCT GAAAGATAAC
GTCCGCATCT CGCTCATGCA CACGCGTCTG TTTTTCGGCA TGTTGCCGCG TATTCCTTCA
CTGCTGATGC GCCGCTCTTC CTGCCACTGG GCGCGGCAGA GTGAAGTGAA AGGATTATGG
GGAATGCGCC TGATGCTGCT GGTCTGGCGT CTGCTGGGAA GAACGGCGTT TAGCGCGCTG
CTTTACCCGG TGGTGGGCGT CTACTGGCTC ACTGCTTCTC GTGCGCGCAA AGCGTCGCAA
GACTGGCTCG CCCGTGTACG ACAGCATCAA CCACAGGCGG CAAAACTCAA CAGCTATCAG
CACTTTCTAC GTTTCGGTAA TGCCATGCTC GACAAAATCG CCAGCTGGCG CGGCGAGCTA
CAACCAGGGC GTGATGTGCT GTTTGCGCCA GGCGCAGAAG CAGCGCTTGA CGTCCGCGAT
CCGCGCGGCA AATTGCTGCT GGCCTCGCAT CTTGGCGATG TGGAAGTGTG CCGGGCGCTG
GCAAAAATTC AGGGCTACAA AACCATTAAC GCGCTGGTGT TTAGCGAAAA CGCCCAACGC
TTTAAACAGA TAATGCAGGA GATGGCTCCT CAGGCAGGCA TTAACCTGAT GCCGGTAACA
GATATCGGCC CAGAAACCGC CATCCTGCTG AAAGAGAAGC TGGATAACGG CGAATGGGTG
GCGATTGTCG GTGACCGCAT CGCCGTCAAC CCGCAACGCG GCGGCGACTG GCGCGTCTGC
TGGAGTTCGT TTATGGGCCA GCCTGCGCCT TTCCCACAGG GGCCGTTTAT TCTCGCCTCT
ATTTTGCGCT GCCCGGTGAA TCTGATTTTC GCCCTGCGCC AGCACGGCAA GCTGCATATT
CACTGCGAAA GCTTTGCCGA CCCACTGCAG CTGCCGCGCG GCGAACGCCA ACAGGCGCTG
CAAAACGCTA TCGATCATTA CGCCGCGCGT CTGGAACATT ACGCGCTCCA GTCGCCTCTC
GACTGGTTTA ATTTTTTCGA TTTCTGGCAA CTGCCGGAAA TTCAGGACAA GGAGTAA
 
Protein sequence
MSVNFSPCVL IPCYNHGAMM PGVLARLKPF NLPCIVVDDG SDATTQQQLD SLLAEQPGVT 
LIRLAENAGK GAAVMRGLQA AADAGFSHAV QVDADGQHAI EDIPKLLALA EQQPAALISG
QPIYDDSIPR SRLYGRWVTH VWVWIETLSL QLKDSMCGFR VYPVAPTLQL AKHATIGKRM
DFDTEVMVRL YWQGNTSYFV PTRVTYPLDG LSHFDALKDN VRISLMHTRL FFGMLPRIPS
LLMRRSSCHW ARQSEVKGLW GMRLMLLVWR LLGRTAFSAL LYPVVGVYWL TASRARKASQ
DWLARVRQHQ PQAAKLNSYQ HFLRFGNAML DKIASWRGEL QPGRDVLFAP GAEAALDVRD
PRGKLLLASH LGDVEVCRAL AKIQGYKTIN ALVFSENAQR FKQIMQEMAP QAGINLMPVT
DIGPETAILL KEKLDNGEWV AIVGDRIAVN PQRGGDWRVC WSSFMGQPAP FPQGPFILAS
ILRCPVNLIF ALRQHGKLHI HCESFADPLQ LPRGERQQAL QNAIDHYAAR LEHYALQSPL
DWFNFFDFWQ LPEIQDKE