Gene ECH74115_4561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4561 
SymboltldD 
ID6971208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4227584 
End bp4229029 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content55% 
IMG OID643388272 
Productprotease TldD 
Protein accessionYP_002272707 
Protein GI209398830 
COG category[R] General function prediction only 
COG ID[COG0312] Predicted Zn-dependent proteases and their inactivated homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.657779 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTTA ACCTGGTAAG TGAACAATTG CTAGCGGCGA ACGGCCTGAA ACATCAGGAC 
TTGTTCGTGA TCCTCGGTCA ACTGGCCGAA CGTCGCCTTG ATTATGGCGA TCTCTATTTT
CAGTCGAGCT ATCACGAATC CTGGGTTTTA GAAGACCGCA TTATTAAAGA TGGTTCTTAC
AACATCGATC AGGGCGTTGG TGTGCGTGCA ATCAGCGGTG AAAAAACCGG ATTTGCTTAC
GCTGACCAAA TCAGCCTGCT GGCGCTGGAA CAGAGTGCGC AAGCGGCGCG CACCATCGCC
CGTGATAGCG GTGATGGCAA AGTACAGACG CTGGGCGCGG TAGAGCATAG CCCGTTGTAT
ACCTCGGTAG ATCCGCTGCA AAGCATGAGC CGTGAAGAGA AGCTGGATAT CCTGCGTCGC
GTCGATAAGG TTGCCCGCGA AGCGGACAAG CGCGTACAGG AAGTGACTGC CAGCCTCAGT
GGTGTCTATG AATTAATTCT GGTTGCGGCC ACCGACGGCA CGCTAGCGGC GGATGTCCGT
CCGCTGGTGC GTCTTTCCGT GAGCGTTCTG ATCGAAGAAG ATGGCAAACG CGAACGCGGT
GCCAGTGGCG GCGGCGGTCG TTTTGGTTAT GAGTTCTTCC TTGCCGATCT CGACGGCGAA
GTCCGTGCGG ATGCATGGGC AAAAGAAGCA GTACGTATGG CGCTGGTCAA TCTTTCTGCC
GTTGCTGCAC CAGCGGGCAC CATGCCGGTA GTACTTGGCG CAGGTTGGCC GGGCGTGCTG
TTGCATGAAG CGGTAGGTCA CGGTCTGGAA GGCGACTTCA ACCGCCGTGG CACTTCAGTA
TTTAGTGGAC AGGTCGGGGA GCTGGTGGCT TCAGAACTGT GTACCGTGGT TGATGACGGT
ACGATGGTCG ATCGCCGTGG TTCGGTGGCG ATTGATGACG AAGGTACGCC AGGCCAGTAC
AACGTGCTGA TTGAGAACGG CATTCTGAAA GGCTACATGC AGGATAAACT CAACGCGCGT
TTGATGGGGA TGACGCCGAC TGGCAACGGT CGCCGTGAAT CCTACGCCCA TCTGCCCATG
CCGCGTATGA CCAACACCTA TATGCTGCCG GGTAAATCGA CCCCGCAGGA AATTATTGAA
TCCGTTGAGT ACGGTATCTA TGCACCGAAC TTTGGTGGCG GTCAGGTGGA TATCACCTCC
GGCAAATTCG TTTTCTCCAC TTCAGAAGCA TATCTGATTG AAAACGGTAA AGTAACGAAG
CCGGTGAAAG GCGCAACGTT GATTGGTTCC GGTATCGAAA CCATGCAGCA GATTTCGATG
GTTGGCAACG ACCTGAAACT GGATAACGGC GTGGGTGTCT GCGGTAAAGA AGGGCAAAGT
TTGCCGGTTG GCGTGGGCCA GCCAACGCTG AAAGTTGATA ACCTGACTGT TGGCGGTACT
GCGTAA
 
Protein sequence
MSLNLVSEQL LAANGLKHQD LFVILGQLAE RRLDYGDLYF QSSYHESWVL EDRIIKDGSY 
NIDQGVGVRA ISGEKTGFAY ADQISLLALE QSAQAARTIA RDSGDGKVQT LGAVEHSPLY
TSVDPLQSMS REEKLDILRR VDKVAREADK RVQEVTASLS GVYELILVAA TDGTLAADVR
PLVRLSVSVL IEEDGKRERG ASGGGGRFGY EFFLADLDGE VRADAWAKEA VRMALVNLSA
VAAPAGTMPV VLGAGWPGVL LHEAVGHGLE GDFNRRGTSV FSGQVGELVA SELCTVVDDG
TMVDRRGSVA IDDEGTPGQY NVLIENGILK GYMQDKLNAR LMGMTPTGNG RRESYAHLPM
PRMTNTYMLP GKSTPQEIIE SVEYGIYAPN FGGGQVDITS GKFVFSTSEA YLIENGKVTK
PVKGATLIGS GIETMQQISM VGNDLKLDNG VGVCGKEGQS LPVGVGQPTL KVDNLTVGGT
A