Gene ECH74115_3794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3794 
Symbol 
ID6969354 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3520301 
End bp3521857 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content51% 
IMG OID643387579 
Productputative transglycosylase 
Protein accessionYP_002272032 
Protein GI209399097 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4623] Predicted soluble lytic transglycosylase fused to an ABC-type amino acid-binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAAAAAT TAAAGATTAA TTATCTGTTC ATCGGCATTC TGGCACTGCT GCTCGCGGTC 
GCTCTCTGGC CATCCATTCC CTGGTTTGGT AAAGCCGACA ACCGTATCGC CGCCATTCAA
GCGCGGGGAG AGTTGCGTGT GAGCACCATT CATACTCCCC TGACGTATAA CGAAATCAAC
GGGAAACCTT TTGGCCTGGA TTACGAACTG GCGAAACAGT TTGCCGATTA CCTCGGCGTA
AAACTGAAAG TGACCGTGCG GCAGAATATC AGCCAGCTGT TTGACGACCT CGATAATGGT
AACGCCGACC TGCTGGCGGC AGGACTTGTC TATAACAGTG AGCGGGTAAA AAATTATCAG
CCTGGCCCTA CCTATTATTC CGTGTCACAA CAACTGGTTT ATAAAGTGGG TCAGTATCGC
CCACGTACGC TGGGCAACCT GACGGCGGAG CAACTCACCG TTGCACCGGG TCATGTGGTG
GTTAACGATC TCCAGACCCT GAAAGAAACA AAATTCCCGG AATTAAGCTG GAAGGTAGAC
GACAAAAAAG GCTCTGCGGA ATTAATGGAA GATGTCATCG AAGGAAAACT CGATTACACC
ATTGCTGATT CTGTCGCCAT CAGCCTGTTT CAGCGCGTTC ACCCGGAGCT CGCCGTAGCG
CTCGATATCA CCGATGAACA ACCGGTGACC TGGTTTAGCC CGTTAGATGG CGATAATACC
CTTTCCGCCG CCCTGCTCGA CTTCTTCAAC GAAATGAATG AAGACGGTAC GCTGGCACGC
ATTGAAGAGA AATACCTGGG GCATGGCGAT GATTTTGATT ACGTCGATAC GCGCACATTT
TTACGCGCCG TCGATGCGGT ACTGCCGCAG TTAAAGCCCC TGTTTGAGAA ATACGCCGAA
GAAATTGACT GGCGTTTGCT GGCCGCTATT GCTTATCAGG AATCGCACTG GGATGCACAG
GCCACTTCAC CGACGGGTGT GCGCGGCATG ATGATGTTAA CCAAAAATAC CGCGCAAAGC
CTCGGCATTA CGGATCGTAC CGATGCCGAA CAGAGCATCA GCGGTGGCGT GCGTTATTTG
CAGGATATGA TGAGTAAAGT GCCGGAAAGT GTGCCGGAGA ACGAGCGGAT CTGGTTTGCC
CTCGCTGCGT ACAATATGGG CTATGCGCAT ATGCAGGATG CCCGCGCCCT GACGGCAAAA
ACCAAAGGGA ATCCTGACAG TTGGGCTGAC GTAAAACAGC GTCTGCCTTT ACTTAGCCAG
AAACCCTATT ACAGCAAGCT GACTTACGGC TACGCTCGTG GGCATGAAGC CTACGCTTAT
GTCGAAAATA TTCGTAAGTA TCAGATTAGC CTGGTGGGTT ATCTGCAAGA GAAAGAGAAG
CAGGCTACAG AAGCGGCGAT GCAACTGGCG CAGGATTATC CGGCGGTATC GCCTACGGAG
TTGGGCAAAG AGAAATTTCC TTTTCTCTCG TTTCTTTCCC AGTCGTCATC AAACTATTTG
ACCCATTCTC CCTCTCTGCT GTTTTCCAGG AAAGGGAGTG AAGAGAAACA AAATTAA
 
Protein sequence
MKKLKINYLF IGILALLLAV ALWPSIPWFG KADNRIAAIQ ARGELRVSTI HTPLTYNEIN 
GKPFGLDYEL AKQFADYLGV KLKVTVRQNI SQLFDDLDNG NADLLAAGLV YNSERVKNYQ
PGPTYYSVSQ QLVYKVGQYR PRTLGNLTAE QLTVAPGHVV VNDLQTLKET KFPELSWKVD
DKKGSAELME DVIEGKLDYT IADSVAISLF QRVHPELAVA LDITDEQPVT WFSPLDGDNT
LSAALLDFFN EMNEDGTLAR IEEKYLGHGD DFDYVDTRTF LRAVDAVLPQ LKPLFEKYAE
EIDWRLLAAI AYQESHWDAQ ATSPTGVRGM MMLTKNTAQS LGITDRTDAE QSISGGVRYL
QDMMSKVPES VPENERIWFA LAAYNMGYAH MQDARALTAK TKGNPDSWAD VKQRLPLLSQ
KPYYSKLTYG YARGHEAYAY VENIRKYQIS LVGYLQEKEK QATEAAMQLA QDYPAVSPTE
LGKEKFPFLS FLSQSSSNYL THSPSLLFSR KGSEEKQN