Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2702 |
Symbol | |
ID | 5594778 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2717798 |
End bp | 2721172 |
Gene Length | 3375 bp |
Protein Length | 1124 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640921820 |
Product | TPR repeat-containing protein |
Protein accession | YP_001459344 |
Protein GI | 157162026 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCGTGATT ATTCCGCGAA AACATTGCCA GATTGTTCAA TACACTGCCA CAAATCTTTT AATTCAGTAT GTCTTGTTAA TATTGAGGGC ACCATGACTC CAGTAAAAGT GTGGCAAGAG CGCGTTGAGA TCCCGACCTA TGAAACCGGG CCGCAGGATA TACATCCCAT GTTCCTGGAA AATCGCGTTT ATCAGGGATC GTCCGGCGCG GTTTATCCCT ATGGCGTGAC CGATACGCTG AGCGAGCAGA AAACCCTGAA ATCCTGGCAG GCGGTGTGGC TGGAAAACGA CTACATCAAA GTGATGATCC TGCCGGAACT GGGCGGTCGT GTGCATCGCG CATGGGATAA AGTGAAACAA CGCGATTTTG TTTATCACAA TGAAGTCATT AAACCTGCGC TGGTGGGGCT GCTGGGACCG TGGATCTCCG GTGGGATTGA GTTTAACTGG CCGCAACACC ATCGCCCGAC CACCTTTATG CCCGTTGATT TCACCCTCGA AGCCCATGAA GACGGCGCAC AGACGGTGTG GGTCGGCGAA ACGGAGCCGA TGCATGGTTT ACAGGTGATG ACAGGTTTCA CCCTGCGCCC TGACCGGGCG GCGCTGGAAA TCGCCAGCCG CGTCTATAAC GGCAACGCCA CGCCGCGTCA TTTCTTGTGG TGGGCCAACC CGGCAGTGAA AGGGGGGGAA GGGCATCAGA GCGTTTTCCC GCCGGATGTA ACGGCGGTGT TTGATCACGG CAAACGGGCC GTCTCCGCTT TCCCCATCGC CACCGGCACT TACTACAAAG TGGACTACTC CGCTGGAGTG GACATTTCTC GCTATAAAAA TGTGCCCGTT CCAACCTCAT ATATGGCTGA AAAATCACAG TACGATTTTG TTGGCGCGTG GTGTCACGAT GAAGATGGCG GTTTGCTGCA CGTTGCCAAC CACCATATTG CGCCAGGTAA AAAACAGTGG AGTTGGGGAC ACAGTGAATT TGGCCAGGCG TGGGACAAGA GCCTGACCGA CAATAACGGC CCGTATATCG AACTGATGAC CGGTATTTTT GCCGATAACC AGCCTGATTT TACCTGGCTT GATGCTTACG AAGAGAAGCG TTTCGAGCAG TATTTCCTGC CTTATCATTC TCTGGGCATG GTGCAAAATG CCTCCCGCGA TGCGGTGATA AAACTCCAGC GTAGTGAGCG GGGGATTGAG TGGGGGCTGT ATGCCATCTC TCCGTTGAAC GGATACCGCC TGGCGATCCG CGAAATCGGC AAATGCAACG CGTTACTTGA TGATGCCGTG GCACTGATGC CTGCGACCGC CATCCAGGGC GTGTTGCACG GTATCAATCC TGAAAGGCTG ACCATTGAGC TCTCCGATGC CGACGGCAAT ATTGTACTGA GTTATCAGGA ACATCAGGCG CAAGAGTTGC CGTTGCCGGA CGTCGCCAAA GCGCCACTGG CAGCACAAGA CATTACCAGT ACAGATGAAG CCTGGTTTAT CGGTCAGCAT CTGGAGCAAT ATCATCACGC CAGCCGTTCA CCGTTCGATT ACTACCTGCG CGGCGTGGCG CTGGATCCGC TGGATTACCG CTGTAACCTG GCGCTGGCGA TGCTGGAATA TAACCGTGCC GATTTCCCGC AAGCGGTGGC GTATGCCACT CAGGCTCTGA AACGCGCACA TGCGCTGAAC AAAAATCCGC AGTGCGGACA GGCGAGTTTG ATTCGCGCCA GTGCTTACGA ACGTCAGGGA CAATATCAAC AAGCCGAAGA GGATTTCTGG CGGGCGGTCT GGAGCGGCAA CAGCAAAGCC GGTGGCTATT ATGGCCTGGC ACGACTGGCT GCGCGTAATG GTAACTTCGA CGCGGGTCTG GATTTTTGCC AACAAAGTCT TCGCGCCTGC CCAACCAATC AGGAAGTGCT TTGCCTGCAT AACCTGCTGC TGGTGTTAAG TGGTCGTCAG GACAACGCGC GTTTGCAGCG CGAGAAACTG CTGCGCGATT ATCCGCTGAA CGCCACTCTG TGGTGGCTGA ACTGGTTCGA TGGTCGTAGC GAATCAGCCC TCGCGCAGTG GCGCGGTCTG TGTCAGGGAC GCGACGTTAA CGCTCTGATG ACCGCCGGGC AACTGATTAA CTGGGGAATG CCCACCCTGG CGGCAGAGAT GCTGAACGCA CTGGACTGCC AGCGCACGCT GCCGCTTTAC CTGCAAGCCA GCTTGCTGCC GAAAGCCGAA CGTGGCGAAC TGGTCGCAAA AGCCATTGAT GTCTTCCCGC AGTTTGTCCG TTTCCCGAAT ACGCTGGAAG AAGTGGCGGC GCTGGAGAGT ATTGAAGAGT GCTGGTTTGC TCGCCATTTA CTGGCTTGCT TCTACTACAA CAAACGTAGC TACAACAAAG CCATTGCCTT TTGGCAACGT TGCGTAGAGA TGTCGCCGGA GTTTGACGAC GGCTGGCGCG GGTTAGCGAT CCATGCGTGG AATAAGCAAC ACGATTATGA GCTGACCGCG CGTTATCTTG ATAATGCTTA TCAGCTTGCG CCGCAGGATG CACGTCTGCT TTTCGAACGG GATTTGCTTG ATAAGTTAAG TGGAGCCACA CCGGAGAAAC GACTGGCGCG TCTGGAAAAT AATCAGGAAA TTGCGCTGAA ACGCGACGAC ATGACCGCAG AACTGCTCAA TTTGTGGCAT CTCACGGGTC AGGCAGACAA AGCGGCGGAC ATTCTCGCCA CGCGTAAATT CCACCCGTGG GAAGGCGGGG AAGGGAAGGT CACCAGTCAG TTTATCCTCA ACCAGTTATT ACGCGCCTGG CAGCATCTTG ATGCCAGAGA GCCGCAGCAG GCCAGCGAAC TGCTTCATGC CGCGCTGCAT TATCCGGAGA ATTTAAGCGA AGGCCGTTTA CCGGGGCAAA CTGATAACGA CATCTGGTTC TGGCAGGCGA TATGCGCCAA CGCGCAGGGC GATGAAACTG AAGCGATGCG TTGTTTACGT CTGGCGGCGA CCGGCGATCG CACCATTAAC ATCCACAGTT ATTACAACGA TCAGCCGGTT GATTATCTCT TCTGGCAAGG AATGGCGCTG CGACTGCTGG GTGAACAGCA AACCGCACAG CAACTGTTTA GTGAAATGAA ACAGTGGGCG CAAGAGATGG CGAAAACCAG TATTGAGGCG GATTTCTTTG CTGTTTCACA ACCTGACCTG TTGTCGCTGT ATGGCGATTT ACAACAGCAG CATAAAGAAA AATGCCTGAT GGTGGCGATG CTGGCGTCCG CGGGACTCGG GGAGGTTGCG CAATATGAAT CTGCTCGCGC TGAATTGACG GCGATTAATC CGGCCTGGCC GAAAGCGGCA TTATTCACCA CCGTGATGCC TTTTATTTTT AACTACGTTC ACTAA
|
Protein sequence | MRDYSAKTLP DCSIHCHKSF NSVCLVNIEG TMTPVKVWQE RVEIPTYETG PQDIHPMFLE NRVYQGSSGA VYPYGVTDTL SEQKTLKSWQ AVWLENDYIK VMILPELGGR VHRAWDKVKQ RDFVYHNEVI KPALVGLLGP WISGGIEFNW PQHHRPTTFM PVDFTLEAHE DGAQTVWVGE TEPMHGLQVM TGFTLRPDRA ALEIASRVYN GNATPRHFLW WANPAVKGGE GHQSVFPPDV TAVFDHGKRA VSAFPIATGT YYKVDYSAGV DISRYKNVPV PTSYMAEKSQ YDFVGAWCHD EDGGLLHVAN HHIAPGKKQW SWGHSEFGQA WDKSLTDNNG PYIELMTGIF ADNQPDFTWL DAYEEKRFEQ YFLPYHSLGM VQNASRDAVI KLQRSERGIE WGLYAISPLN GYRLAIREIG KCNALLDDAV ALMPATAIQG VLHGINPERL TIELSDADGN IVLSYQEHQA QELPLPDVAK APLAAQDITS TDEAWFIGQH LEQYHHASRS PFDYYLRGVA LDPLDYRCNL ALAMLEYNRA DFPQAVAYAT QALKRAHALN KNPQCGQASL IRASAYERQG QYQQAEEDFW RAVWSGNSKA GGYYGLARLA ARNGNFDAGL DFCQQSLRAC PTNQEVLCLH NLLLVLSGRQ DNARLQREKL LRDYPLNATL WWLNWFDGRS ESALAQWRGL CQGRDVNALM TAGQLINWGM PTLAAEMLNA LDCQRTLPLY LQASLLPKAE RGELVAKAID VFPQFVRFPN TLEEVAALES IEECWFARHL LACFYYNKRS YNKAIAFWQR CVEMSPEFDD GWRGLAIHAW NKQHDYELTA RYLDNAYQLA PQDARLLFER DLLDKLSGAT PEKRLARLEN NQEIALKRDD MTAELLNLWH LTGQADKAAD ILATRKFHPW EGGEGKVTSQ FILNQLLRAW QHLDAREPQQ ASELLHAALH YPENLSEGRL PGQTDNDIWF WQAICANAQG DETEAMRCLR LAATGDRTIN IHSYYNDQPV DYLFWQGMAL RLLGEQQTAQ QLFSEMKQWA QEMAKTSIEA DFFAVSQPDL LSLYGDLQQQ HKEKCLMVAM LASAGLGEVA QYESARAELT AINPAWPKAA LFTTVMPFIF NYVH
|
| |