Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3782 |
Symbol | |
ID | 6967023 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3504082 |
End bp | 3507456 |
Gene Length | 3375 bp |
Protein Length | 1124 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643387569 |
Product | tetratricopeptide repeat protein |
Protein accession | YP_002272022 |
Protein GI | 209397136 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCGTGATT ATTCCGCGAA AACATTGCCA GATTGTTCAA TACACTGCCA CAAATCTTTT AATTCAGTAT GTCTTGTTAA TATTGAGGGC ACCATGACTC CAGTAAAAGT GTGGCAAGAG CGCGTTGAGA TCCCGACCTA TGAAACCGGG CCGCAGGATA TACATCCCAT GTTCCTGGAA AATCGCGTTT ATCAGGGATC GTCCGGCGCG GTTTATCCCT ACGGCGTGAC CGATACGCTG AGCGAGCAGA AAACCCTGAA ATCCTGGCAG GCGGTGTGGC TGGAAAACGA CTACATCAAA GTGATGATCC TGCCGGAATT GGGCGGTCGG GTGCATCGCG CATGGGATAA AGTGAAACAG CGCGATTTTG TCTATCACAA TGAAGTCATT AAACCTGCGC TGGTGGGGCT GCTGGGACCG TGGATCTCCG GCGGGATTGA GTTTAACTGG CCGCAACACC ATCGCCCGAC CACCTTTATG CCCGTTGATT TCACCCTCGA AGCCCATGAA GACGGCGCAC AGACGGTGTG GGTAGGCGAA ACGGAGCCGA TGCATGGTTT ACAGGTGATG ACAGGTTTCA CCCTGCGCCC TGACCGGGCG GCGCTGGAAA TCGCCAGCCG CGTCTATAAC GGCAACGCCA CGCCGCGTCA TTTCTTGTGG TGGGCCAACC CGGCAGTGAA AGGGGGTGAA GGGCATCAGA GCGTCTTTCC GCCGGATGTA ACGGCGGTGT TTGATCACGG CAAACGGGCC GTCTCCGCTT TCCCTATCGC CACCGGCACT TACTACAAAG TGGACTACTC CGCTGGAGTG GACATTTCTC GCTATAAAAA TGTGCCCGTT CCAACCTCAT ATATGGCTGA AAAATCACAG TACGATTTTG TTGGCGCGTG GTGTCACGAT GAAGATGGCG GTTTGCTACA CGTTGCCAAC CACCATATTG CGCCAGGTAA AAAGCAGTGG AGCTGGGGAC ACAGTGAATT TGGCCAGGCG TGGGATAAGA GCCTAACTGA TAATAACGGC CCGTATATCG AACTGATGAC CGGTATTTTT GCCGATAACC AGCCTGATTT TACCCGGCTT GATGCTTACG AAGAGAAGCG TTTTGAGCAG TTTTTCCTGC CTTACCATTC TCTGGGCATG GTGCAAAACG CCTCCCGCGA TGCGGTAATC AAACTCCAGC GTAGTGAGCG GGGGATTGAG TGGGGGCTGT ATGCCATCTC TCCGTTGAAC GGATACCGCC TGGCGATCCG CGAAATCGGC AAATGCGACG CGTTGCTCGA TGATGCCGTG GCCCTGACAC CAGCGACCGC CATCCAGGGC GTGTTACACG GTATCAATCC TGAAAGACTG ACCATTGAGC TCTCTGATGC CGACGGCAAT ATTGTACTGA GTTATCAGGA ACATCAGCCG CAAGAGTTGC CGTTGCCGGA CGTCGCCAAA GCGCCACTGT CAGCACAAGA CATTACCAGT ACAGATGAAG CCTGGTTTAT CGGTCAGCAT CTGGAGCAAT ATCATCACGC CAGCCGTTCA CCGTTCGATT ACTACCTGCG CGGCGTGGCG CTGGATCCGC TGGATTACCG CTGTAATCTG GCGCTGGCGA TGCTGGAGTA TAACCGCGCA GATTTCCCGC AAGCGGTGGC GTATGCCACT CAGGCTCTGA AACGCGCACA TGCGCTGAAC AAAAATCCGC AGTGCGGACA GGCGAGTTTG ATTCGCGCCA GTGCTTACGA ACGTCAGGGA CAATATCAAC AAGCCGAAGA GGATTTCTGG CGTGCGGTCT GGAGCGGCAA CAGTAAAGCC GGAGGCTATT ATGGTCTGGC ACGACTGGCG GCGCGTAATG GTAACTTCGA CGCGGGTCTG GATTTTTGCC AACAAAGTCT TCGCGCCTGC CCAATCAATC AGGAAGTGCT TTGCCTGCAT AATCTGCTGC TGGTGTTAAG TGGTCGTCAG GACAACGCGC GTTTGCAGCG CGAGAAACTG CTGCGCGATT ATCCGCTGAA CGCCACTCTG TGGTGCCTGA ACTGGTTCGA TGGTCGTAGC GAATCAGCTC TCGCGCAGTG GCGCGGTCTG TGTCAGGGAC GCGACGTTAA CGCCCTGATG ACCGCCGGGC AACTGATTAA CTGGGGAATG CCCACCCTCG CGGCAGAGAT GCTGAATGCA CTGGACTGCC AGCGCACGCT GCCGCTTTAC CTGCAAGCCA GCTTGCTGCC GAAAGCCGAA CGTGGCGAAC TGGTCGCAAA AGCCATTGAT GTCTTCCCGC AGTTTGTCCG TTTCCCGAAT ACGCTGGAAG AAGTGGCGGC GCTGGAGAGT ATTGAAGAGT GCTGGTTTGC TCGCCATTTA CTGGCCTGCT TCTACTACAA CAAACGTAGC TACAACGAAG CCATTGCCTT ATGGCAACGT TGCGTAGAGA TGTCGCCGGA GTTTGCCGAC GGCTGGCGCG GGTTAGCGAT CCATGCGTGG AATAAGCAAC ACGATTATGA GCTGGCCGCG CGTTATCTTG ATAATGCTTA TCAGCTTGCG CCGCAGGATG CACGTCTGCT TTTCGAACGG GATTTGCTTG ATAAGCTAAG TGGAACCACA CCGGAGAAAC GACTGGCGCG TCTGGAAAAT AATCTGGAAA TTGCGCTGAA ACGCGACGAC ATGACCGCAG AACTGCTCAA TTTGTGGCAT CTCACGGGGC AGGCAGACAA AGCGGCGGAC ATTCTCGCCA CGCGCAAATT CCACCCGTGG GAAGGCGGGG AAGGGAAGGT CACCAGTCAG TTTATCCTCA ACCAGTTATT ACGCGCCTGG CAGCATCTTG ATGCCAGAGA GCCGCAGCAG GCCAGCGAAC TGCTTCATGC CGCGCTGCAT TATCCGGAGA ATTTAAGCGA AGGCCGTTTA CCGGGGCAAA CTGATAACGA CATCTGGTTC TGGCAGGCGA TATGCGCCAA AGCCCAGGGC GATGAAACTG AAGCGACGCG CTGTTTACAT CTGGCGGCGA CCGGCGATCG CACCATTAAC ATCCACAGCT ATTACAACGA TCAGCCGGTT GATTACCTCT TCTGGCAAGG AATGGCGCTG CGATTACTGG GCGAACAACA CACCGCACAG CAACTGTTTA GTGAAATGAA ACAGTGGGCG CAAGAGATGG CGAAAACCAG TATCGAAGCG GATTTCTTTG CCGTCTCGCA GCCTGACTTG TTGTCGCTGT ATGGCGATTT ACAACAGCAG CATAAAGAAA AATGCCTGAT GGTGGCGATG CTGGCGGCCG CGGGATTAGG CGAGATTGCG CAATACGAAT CTGCTCGCGC TGAATTGACG GCGATTAATC CGGCCTGGCC GAAAGCGGCA TTATTCACCA CCGTGATGCC TTTTATTTTT AACTACGTTC ACTAA
|
Protein sequence | MRDYSAKTLP DCSIHCHKSF NSVCLVNIEG TMTPVKVWQE RVEIPTYETG PQDIHPMFLE NRVYQGSSGA VYPYGVTDTL SEQKTLKSWQ AVWLENDYIK VMILPELGGR VHRAWDKVKQ RDFVYHNEVI KPALVGLLGP WISGGIEFNW PQHHRPTTFM PVDFTLEAHE DGAQTVWVGE TEPMHGLQVM TGFTLRPDRA ALEIASRVYN GNATPRHFLW WANPAVKGGE GHQSVFPPDV TAVFDHGKRA VSAFPIATGT YYKVDYSAGV DISRYKNVPV PTSYMAEKSQ YDFVGAWCHD EDGGLLHVAN HHIAPGKKQW SWGHSEFGQA WDKSLTDNNG PYIELMTGIF ADNQPDFTRL DAYEEKRFEQ FFLPYHSLGM VQNASRDAVI KLQRSERGIE WGLYAISPLN GYRLAIREIG KCDALLDDAV ALTPATAIQG VLHGINPERL TIELSDADGN IVLSYQEHQP QELPLPDVAK APLSAQDITS TDEAWFIGQH LEQYHHASRS PFDYYLRGVA LDPLDYRCNL ALAMLEYNRA DFPQAVAYAT QALKRAHALN KNPQCGQASL IRASAYERQG QYQQAEEDFW RAVWSGNSKA GGYYGLARLA ARNGNFDAGL DFCQQSLRAC PINQEVLCLH NLLLVLSGRQ DNARLQREKL LRDYPLNATL WCLNWFDGRS ESALAQWRGL CQGRDVNALM TAGQLINWGM PTLAAEMLNA LDCQRTLPLY LQASLLPKAE RGELVAKAID VFPQFVRFPN TLEEVAALES IEECWFARHL LACFYYNKRS YNEAIALWQR CVEMSPEFAD GWRGLAIHAW NKQHDYELAA RYLDNAYQLA PQDARLLFER DLLDKLSGTT PEKRLARLEN NLEIALKRDD MTAELLNLWH LTGQADKAAD ILATRKFHPW EGGEGKVTSQ FILNQLLRAW QHLDAREPQQ ASELLHAALH YPENLSEGRL PGQTDNDIWF WQAICAKAQG DETEATRCLH LAATGDRTIN IHSYYNDQPV DYLFWQGMAL RLLGEQHTAQ QLFSEMKQWA QEMAKTSIEA DFFAVSQPDL LSLYGDLQQQ HKEKCLMVAM LAAAGLGEIA QYESARAELT AINPAWPKAA LFTTVMPFIF NYVH
|
| |