Gene ECH74115_3782 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3782 
Symbol 
ID6967023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3504082 
End bp3507456 
Gene Length3375 bp 
Protein Length1124 aa 
Translation table11 
GC content54% 
IMG OID643387569 
Producttetratricopeptide repeat protein 
Protein accessionYP_002272022 
Protein GI209397136 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGTGATT ATTCCGCGAA AACATTGCCA GATTGTTCAA TACACTGCCA CAAATCTTTT 
AATTCAGTAT GTCTTGTTAA TATTGAGGGC ACCATGACTC CAGTAAAAGT GTGGCAAGAG
CGCGTTGAGA TCCCGACCTA TGAAACCGGG CCGCAGGATA TACATCCCAT GTTCCTGGAA
AATCGCGTTT ATCAGGGATC GTCCGGCGCG GTTTATCCCT ACGGCGTGAC CGATACGCTG
AGCGAGCAGA AAACCCTGAA ATCCTGGCAG GCGGTGTGGC TGGAAAACGA CTACATCAAA
GTGATGATCC TGCCGGAATT GGGCGGTCGG GTGCATCGCG CATGGGATAA AGTGAAACAG
CGCGATTTTG TCTATCACAA TGAAGTCATT AAACCTGCGC TGGTGGGGCT GCTGGGACCG
TGGATCTCCG GCGGGATTGA GTTTAACTGG CCGCAACACC ATCGCCCGAC CACCTTTATG
CCCGTTGATT TCACCCTCGA AGCCCATGAA GACGGCGCAC AGACGGTGTG GGTAGGCGAA
ACGGAGCCGA TGCATGGTTT ACAGGTGATG ACAGGTTTCA CCCTGCGCCC TGACCGGGCG
GCGCTGGAAA TCGCCAGCCG CGTCTATAAC GGCAACGCCA CGCCGCGTCA TTTCTTGTGG
TGGGCCAACC CGGCAGTGAA AGGGGGTGAA GGGCATCAGA GCGTCTTTCC GCCGGATGTA
ACGGCGGTGT TTGATCACGG CAAACGGGCC GTCTCCGCTT TCCCTATCGC CACCGGCACT
TACTACAAAG TGGACTACTC CGCTGGAGTG GACATTTCTC GCTATAAAAA TGTGCCCGTT
CCAACCTCAT ATATGGCTGA AAAATCACAG TACGATTTTG TTGGCGCGTG GTGTCACGAT
GAAGATGGCG GTTTGCTACA CGTTGCCAAC CACCATATTG CGCCAGGTAA AAAGCAGTGG
AGCTGGGGAC ACAGTGAATT TGGCCAGGCG TGGGATAAGA GCCTAACTGA TAATAACGGC
CCGTATATCG AACTGATGAC CGGTATTTTT GCCGATAACC AGCCTGATTT TACCCGGCTT
GATGCTTACG AAGAGAAGCG TTTTGAGCAG TTTTTCCTGC CTTACCATTC TCTGGGCATG
GTGCAAAACG CCTCCCGCGA TGCGGTAATC AAACTCCAGC GTAGTGAGCG GGGGATTGAG
TGGGGGCTGT ATGCCATCTC TCCGTTGAAC GGATACCGCC TGGCGATCCG CGAAATCGGC
AAATGCGACG CGTTGCTCGA TGATGCCGTG GCCCTGACAC CAGCGACCGC CATCCAGGGC
GTGTTACACG GTATCAATCC TGAAAGACTG ACCATTGAGC TCTCTGATGC CGACGGCAAT
ATTGTACTGA GTTATCAGGA ACATCAGCCG CAAGAGTTGC CGTTGCCGGA CGTCGCCAAA
GCGCCACTGT CAGCACAAGA CATTACCAGT ACAGATGAAG CCTGGTTTAT CGGTCAGCAT
CTGGAGCAAT ATCATCACGC CAGCCGTTCA CCGTTCGATT ACTACCTGCG CGGCGTGGCG
CTGGATCCGC TGGATTACCG CTGTAATCTG GCGCTGGCGA TGCTGGAGTA TAACCGCGCA
GATTTCCCGC AAGCGGTGGC GTATGCCACT CAGGCTCTGA AACGCGCACA TGCGCTGAAC
AAAAATCCGC AGTGCGGACA GGCGAGTTTG ATTCGCGCCA GTGCTTACGA ACGTCAGGGA
CAATATCAAC AAGCCGAAGA GGATTTCTGG CGTGCGGTCT GGAGCGGCAA CAGTAAAGCC
GGAGGCTATT ATGGTCTGGC ACGACTGGCG GCGCGTAATG GTAACTTCGA CGCGGGTCTG
GATTTTTGCC AACAAAGTCT TCGCGCCTGC CCAATCAATC AGGAAGTGCT TTGCCTGCAT
AATCTGCTGC TGGTGTTAAG TGGTCGTCAG GACAACGCGC GTTTGCAGCG CGAGAAACTG
CTGCGCGATT ATCCGCTGAA CGCCACTCTG TGGTGCCTGA ACTGGTTCGA TGGTCGTAGC
GAATCAGCTC TCGCGCAGTG GCGCGGTCTG TGTCAGGGAC GCGACGTTAA CGCCCTGATG
ACCGCCGGGC AACTGATTAA CTGGGGAATG CCCACCCTCG CGGCAGAGAT GCTGAATGCA
CTGGACTGCC AGCGCACGCT GCCGCTTTAC CTGCAAGCCA GCTTGCTGCC GAAAGCCGAA
CGTGGCGAAC TGGTCGCAAA AGCCATTGAT GTCTTCCCGC AGTTTGTCCG TTTCCCGAAT
ACGCTGGAAG AAGTGGCGGC GCTGGAGAGT ATTGAAGAGT GCTGGTTTGC TCGCCATTTA
CTGGCCTGCT TCTACTACAA CAAACGTAGC TACAACGAAG CCATTGCCTT ATGGCAACGT
TGCGTAGAGA TGTCGCCGGA GTTTGCCGAC GGCTGGCGCG GGTTAGCGAT CCATGCGTGG
AATAAGCAAC ACGATTATGA GCTGGCCGCG CGTTATCTTG ATAATGCTTA TCAGCTTGCG
CCGCAGGATG CACGTCTGCT TTTCGAACGG GATTTGCTTG ATAAGCTAAG TGGAACCACA
CCGGAGAAAC GACTGGCGCG TCTGGAAAAT AATCTGGAAA TTGCGCTGAA ACGCGACGAC
ATGACCGCAG AACTGCTCAA TTTGTGGCAT CTCACGGGGC AGGCAGACAA AGCGGCGGAC
ATTCTCGCCA CGCGCAAATT CCACCCGTGG GAAGGCGGGG AAGGGAAGGT CACCAGTCAG
TTTATCCTCA ACCAGTTATT ACGCGCCTGG CAGCATCTTG ATGCCAGAGA GCCGCAGCAG
GCCAGCGAAC TGCTTCATGC CGCGCTGCAT TATCCGGAGA ATTTAAGCGA AGGCCGTTTA
CCGGGGCAAA CTGATAACGA CATCTGGTTC TGGCAGGCGA TATGCGCCAA AGCCCAGGGC
GATGAAACTG AAGCGACGCG CTGTTTACAT CTGGCGGCGA CCGGCGATCG CACCATTAAC
ATCCACAGCT ATTACAACGA TCAGCCGGTT GATTACCTCT TCTGGCAAGG AATGGCGCTG
CGATTACTGG GCGAACAACA CACCGCACAG CAACTGTTTA GTGAAATGAA ACAGTGGGCG
CAAGAGATGG CGAAAACCAG TATCGAAGCG GATTTCTTTG CCGTCTCGCA GCCTGACTTG
TTGTCGCTGT ATGGCGATTT ACAACAGCAG CATAAAGAAA AATGCCTGAT GGTGGCGATG
CTGGCGGCCG CGGGATTAGG CGAGATTGCG CAATACGAAT CTGCTCGCGC TGAATTGACG
GCGATTAATC CGGCCTGGCC GAAAGCGGCA TTATTCACCA CCGTGATGCC TTTTATTTTT
AACTACGTTC ACTAA
 
Protein sequence
MRDYSAKTLP DCSIHCHKSF NSVCLVNIEG TMTPVKVWQE RVEIPTYETG PQDIHPMFLE 
NRVYQGSSGA VYPYGVTDTL SEQKTLKSWQ AVWLENDYIK VMILPELGGR VHRAWDKVKQ
RDFVYHNEVI KPALVGLLGP WISGGIEFNW PQHHRPTTFM PVDFTLEAHE DGAQTVWVGE
TEPMHGLQVM TGFTLRPDRA ALEIASRVYN GNATPRHFLW WANPAVKGGE GHQSVFPPDV
TAVFDHGKRA VSAFPIATGT YYKVDYSAGV DISRYKNVPV PTSYMAEKSQ YDFVGAWCHD
EDGGLLHVAN HHIAPGKKQW SWGHSEFGQA WDKSLTDNNG PYIELMTGIF ADNQPDFTRL
DAYEEKRFEQ FFLPYHSLGM VQNASRDAVI KLQRSERGIE WGLYAISPLN GYRLAIREIG
KCDALLDDAV ALTPATAIQG VLHGINPERL TIELSDADGN IVLSYQEHQP QELPLPDVAK
APLSAQDITS TDEAWFIGQH LEQYHHASRS PFDYYLRGVA LDPLDYRCNL ALAMLEYNRA
DFPQAVAYAT QALKRAHALN KNPQCGQASL IRASAYERQG QYQQAEEDFW RAVWSGNSKA
GGYYGLARLA ARNGNFDAGL DFCQQSLRAC PINQEVLCLH NLLLVLSGRQ DNARLQREKL
LRDYPLNATL WCLNWFDGRS ESALAQWRGL CQGRDVNALM TAGQLINWGM PTLAAEMLNA
LDCQRTLPLY LQASLLPKAE RGELVAKAID VFPQFVRFPN TLEEVAALES IEECWFARHL
LACFYYNKRS YNEAIALWQR CVEMSPEFAD GWRGLAIHAW NKQHDYELAA RYLDNAYQLA
PQDARLLFER DLLDKLSGTT PEKRLARLEN NLEIALKRDD MTAELLNLWH LTGQADKAAD
ILATRKFHPW EGGEGKVTSQ FILNQLLRAW QHLDAREPQQ ASELLHAALH YPENLSEGRL
PGQTDNDIWF WQAICAKAQG DETEATRCLH LAATGDRTIN IHSYYNDQPV DYLFWQGMAL
RLLGEQHTAQ QLFSEMKQWA QEMAKTSIEA DFFAVSQPDL LSLYGDLQQQ HKEKCLMVAM
LAAAGLGEIA QYESARAELT AINPAWPKAA LFTTVMPFIF NYVH