Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3540 |
Symbol | tldD |
ID | 6143370 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3617502 |
End bp | 3618947 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641618369 |
Product | protease TldD |
Protein accession | YP_001745516 |
Protein GI | 170680441 |
COG category | [R] General function prediction only |
COG ID | [COG0312] Predicted Zn-dependent proteases and their inactivated homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCTTA ACCTGGTAAG TGAACAATTG CTAGCGGCGA ACGGCCTGAA ACATCAGGAC TTGTTCGCGA TCCTCGGTCA ACTGGCCGAA CGTCGCCTTG ATTATGGCGA TCTCTATTTT CAGTCGAGCT ATCACGAATC CTGGGTTTTA GAAGACCGCA TTATTAAAGA TGGTTCTTAC AACATCGATC AGGGCGTTGG TGTGCGTGCA ATCAGCGGTG AAAAAACCGG ATTTGCTTAC GCTGACCAAA TCAGCCTGCT GGCGCTGGAA CAAAGTGCGC AAGCGGCGCG CACCATCGTC CGTGATAGTG GTGATGGCAA AGTACAGACG CTGGGCGCGG TAGAGCATAG CCCGTTGTAT ACCTCGGTAG ATCCGCTGCA AAGCATGAGC CGTGAAGAGA AGCTGGATAT CCTGCGTCGC GTCGATAAGG TTGCCCGCGA AGCGGACAAG CGCGTACAAG AAGTGACTGC CAGCCTCAGC GGCGTTTATG AATTAATTCT GGTTGCGGCC ACCGACGGCA CGCTGGCGGC GGATGTTCGT CCGCTGGTGC GTCTTTCCGT GAGCGTTCTG GTCGAAGAAG ATGGCAAACG CGAACGCGGT GCCAGTGGCG GCGGCGGTCG TTTTGGTTAT GAGTTCTTCC TTGCCGATCT CGACGGTGAA GTCCGTGCGG ATGCATGGGC AAAAGAAGCT GTGCGTATGG CGCTGGTCAA TCTTTCTGCC GTCGCCGCAC CAGCAGGCAC CATGCCGGTA GTACTTGGCG CAGGTTGGCC GGGCGTGCTG TTGCATGAAG CGGTGGGTCA CGGTCTGGAA GGCGACTTCA ACCGCCGTGG CACTTCAGTA TTTAGTGGAC AGGTCGGGGA GCTGGTGGCT TCAGAACTGT GTACCGTGGT TGATGACGGT ACGATGGTCG ATCGCCGTGG TTCGGTGGCG ATTGATGACG AAGGTACGCC AGGCCAGTAC AACGTGCTGA TTGAGAACGG CATTCTGAAA GGCTACATGC AGGATAAACT CAACGCGCGT TTGATGGGGA TGACGCCGAC TGGCAACGGT CGCCGTGAAT CCTACGCCCA TCTGCCCATG CCGCGTATGA CCAACACCTA TATGCTGCCG GGTAAATCGA CCCCGCAGGA AATTATTGAA TCCGTTGAGT ACGGTATCTA TGCACCGAAC TTTGGTGGCG GTCAGGTGGA TATCACCTCC GGCAAATTCG TTTTCTCCAC TTCAGAAGCG TATCTGATTG AAAACGGTAA AGTAACGAAG CCGGTGAAAG GCGCAACGTT GATTGGTTCC GGTATCGAAA CCATGCAGCA GATTTCGATG GTTGGCAACG ACCTGAAACT GGATAACGGC GTGGGTGTCT GCGGTAAAGA AGGGCAAAGT TTGCCGGTTG GCGTGGGCCA GCCAACGCTG AAAGTTGATA ACCTGACTGT TGGCGGTACT GCGTAA
|
Protein sequence | MSLNLVSEQL LAANGLKHQD LFAILGQLAE RRLDYGDLYF QSSYHESWVL EDRIIKDGSY NIDQGVGVRA ISGEKTGFAY ADQISLLALE QSAQAARTIV RDSGDGKVQT LGAVEHSPLY TSVDPLQSMS REEKLDILRR VDKVAREADK RVQEVTASLS GVYELILVAA TDGTLAADVR PLVRLSVSVL VEEDGKRERG ASGGGGRFGY EFFLADLDGE VRADAWAKEA VRMALVNLSA VAAPAGTMPV VLGAGWPGVL LHEAVGHGLE GDFNRRGTSV FSGQVGELVA SELCTVVDDG TMVDRRGSVA IDDEGTPGQY NVLIENGILK GYMQDKLNAR LMGMTPTGNG RRESYAHLPM PRMTNTYMLP GKSTPQEIIE SVEYGIYAPN FGGGQVDITS GKFVFSTSEA YLIENGKVTK PVKGATLIGS GIETMQQISM VGNDLKLDNG VGVCGKEGQS LPVGVGQPTL KVDNLTVGGT A
|
| |