Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_1119 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | + |
Start bp | 1201017 |
End bp | 1204298 |
Gene Length | 3282 bp |
Protein Length | 1093 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | Tetratricopeptide TPR_2 repeat protein |
Protein accession | ACX38793 |
Protein GI | 260448371 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTCCAG TAAAAGTGTG GCAAGAGCGC GTTGAGATCC CGACCTATGA AACCGGGCCG CAGGATATAC ATCCCATGTT CCTGGAAAAT CGCGTTTATC AGGGATCGTC CGGCGCGGTT TATCCCTACG GCGTGACCGA TACGCTGAGC GAGCAGAAAA CCCTGAAATC CTGGCAGGCG GTGTGGCTGG AAAACGACTA CATCAAAGTG ATGATCCTGC CGGAACTGGG CGGTCGGGTG CATCGCGCGT GGGATAAAGT GAAACAGCGC GATTTTGTTT ATCACAATGA AGTCATTAAA CCTGCGCTGG TGGGGCTGCT GGGGCCGTGG ATTTCCGGTG GGATTGAGTT TAACTGGCCG CAACATCATC GCCCGACCAC CTTTATGCCC GTTGATTTCA CCCTCGAAGC CCATGAAGAC GGTGCACAGA CGGTGTGGGT AGGCGAAACG GAGCCGATGC ATGGTTTACA GGTGATGACA GGTTTCACCC TGCGCCCTGA CCGGGCGGCG CTGGAAATCG CCAGCCGCGT CTATAACGGG AACGCCACGC CGCGTCATTT CTTGTGGTGG GCCAACCCGG CAGTGAAAGG GGGGGAAGGG CATCAGAGCG TCTTCCCGCC GGATGTAACG GCGGTGTTTG ATCACGGCAA ACGGGCCGTC TCCGCTTTCC CCATCGCTAC CGGCACTTAC TACAAAGTGG ACTACTCCGC TGGAGTGGAC ATTTCTCGCT ATAAAAATGT GCCTGTTCCA ACCTCATATA TGGCCGAAAA ATCGCAGTAC GATTTTGTCG GCGCGTGGTG TCACGATGAA GATGGCGGTT TGCTGCACGT TGCCAACCAC CATATTGCGC CAGGTAAAAA ACAGTGGAGT TGGGGACACA GTGAATTTGG CCAGGCGTGG GATAAGAGTC TGACCGACAA TAACGGCCCG TATATCGAAC TGATGACCGG TATTTTTGCC GATAACCAGC CCGATTTTAC CTGGCTTGAT GCTTACGAAG AGAAGCGTTT TGAGCAGTAT TTCCTGCCTT ATCATTCTTT GGGCATGGTG CAAAATGCCT CCCGCGATGC GGTGATAAAA CTCCAGCGTA GTAAGCGGGG GATTGAGTGG GGGCTGTATG CCATCTCTCC GTTGAACGGA TACCGCCTGG CGATCCGCGA AATCGGTAAA TGCAACGCGT TACTTGATGA TGCCGTGGCA CTGATGCCTG CGACCGCCAT CCAGGGCGTG TTGCACGGTA TCAATCCTGA AAGGCTGACC ATTGAGCTCT CCGATGCCGA CGGCAATATT GTACTGAGTT ATCAGGAACA TCAGCCGCAA GCGTTGCCGT TACCGGACGT CGCCAAAGCG CCACTGGCAG CACAAGACAT TACCAGTACA GATGAAGCCT GGTTTATCGG TCAGCATCTG GAGCAATATC ATCACGCCAG CCGTTCACCG TTCGATTACT ACCTGCGCGG CGTGGCGCTG GACCCGCTGG ATTATCGCTG TAACCTGGCG CTGGCGATGC TGGAATATAA CCGCGCAGAT TTCCCGCAAG CGGTGGCGTA TGCCACTCAG GCTCTGAAAC GCGCACATGC GCTGAACAAA AATCCGCAGT GCGGACAGGC GAGTTTGATT CGCGCCAGTG CTTACGAACG TCAGGGACAA TATCAACAAG CCGAAGAGGA TTTCTGGCGT GCGGTCTGGA GCGGCAACAG CAAAGCCGGT GGCTATTATG GCCTGGCACG ACTGGCGGCG CGTAATGGTA ACTTCGACGC TGGTCTGGAT TTTTGCCAAC AAAGTCTTCG CGCCTGCCCA ACCAATCAGG AAGTGCTTTG CCTGCATAAT CTGCTGCTGG TGTTAAGTGG TCGTCAGGAC AACGCGCGTG TGCAGCGCGA GAAACTGCTG CGCGATTATC CGCTGAACGC CACTCTGTGG TGGCTGAACT GGTTCGATGG TCGTAGCGAA TCAGCTCTCG CGCAGTGGCG CGGTCTGTGT CAGGGACGCG ACGTTAACGC CCTGATGACC GCCGGGCAAC TGATTAACTG GGGAATGCCC ACCCTGGCGG CAGAGATGCT GAACGCACTG GACTGCCAGC GCACGCTGCC GCTTTACCTG CAAGCCAGCT TGCTGCCGAA AGCCGAACGT GGCGAACTGG TCGCAAAAGC CATTGATGTC TTCCCGCAGT TTGTCCGTTT CCCGAATACG CTGGAAGAAG TGGCGGCGCT GGAGAGTATT GAAGAGTGCT GGTTTGCTCG CCATTTACTA GCTTGCTTCT ACTACAACAA ACGTAGCTAC AACAAAGCCA TTGCCTTTTG GCAACGTTGC GTAGAGATGT CGCCGGAGTT TGCCGACGGC TGGCGCGGGT TAGCGATCCA TGCGTGGAAT AAGCAACACG ATTATGAGCT GGCCGCGCGT TATCTTGATA ATGCTTATCA GCTTGCGCCG CAGGATGCAC GTCTGCTTTT CGAACGGGAT TTGCTGGATA AGTTAAGTGG AGCCACACCG GAGAAACGAC TGGCGCGTCT GGAAAATAAT CTGGAAATTG CGCTGAAACG CGACGACATG ACCGCAGAAC TGCTCAATTT GTGGCATCTC ACGGGTCAGG CAGACAAAGC GGCGGACATT CTCGCCACGC GCAAATTCCA CCCGTGGGAA GGCGGGGAAG GGAAGGTCAC CAGTCAGTTT ATCCTCAACC AGTTATTACG CGCCTGGCAG CATCTTGATG CCAGACAGCC GCAGCAGGCC TGCGAACTGC TTCATGCCGC GCTGCATTAT CCGGAGAATT TAAGCGAAGG CCGTTTACCG GGGCAAACTG ATAACGACAT CTGGTTCTGG CAGGCGATAT GCGCCAACGC GCAGGGCGAT GAAACTGAAG CGACGCGTTG TTTACGTCTG GCGGCGACCG GCGATCGCAC TATTAACATT CACAGTTATT ACAACGATCA GCCGGTTGAT TATCTCTTCT GGCAAGGGAT GGCGCTGCGA CTGCTGGGCG AACAACAAAC CGCACAGCAA CTGTTTAGTG AAATGAAACA GTGGGCGCAA GAGATGGCGA AAACCAGTAT CGAAGCGGAT TTCTTTGCCG TCTCGCAACC TGACTTGTTG TCGCTGTATG GCGATTTACA ACAGCAGCAT AAAGAAAAAT GCCTGATGGT GGCGATGCTG GCGTCCGCGG GACTCGGGGA GGTTGCGCAA TATGAGTCTG CTCGCGCTGA ATTGACGGCG ATTAATCCGG CTTGGCCGAA AGCGGCATTA TTCACCACCG TGATGCCTTT TATTTTTAAC CGCGTTCACT AA
|
Protein sequence | MTPVKVWQER VEIPTYETGP QDIHPMFLEN RVYQGSSGAV YPYGVTDTLS EQKTLKSWQA VWLENDYIKV MILPELGGRV HRAWDKVKQR DFVYHNEVIK PALVGLLGPW ISGGIEFNWP QHHRPTTFMP VDFTLEAHED GAQTVWVGET EPMHGLQVMT GFTLRPDRAA LEIASRVYNG NATPRHFLWW ANPAVKGGEG HQSVFPPDVT AVFDHGKRAV SAFPIATGTY YKVDYSAGVD ISRYKNVPVP TSYMAEKSQY DFVGAWCHDE DGGLLHVANH HIAPGKKQWS WGHSEFGQAW DKSLTDNNGP YIELMTGIFA DNQPDFTWLD AYEEKRFEQY FLPYHSLGMV QNASRDAVIK LQRSKRGIEW GLYAISPLNG YRLAIREIGK CNALLDDAVA LMPATAIQGV LHGINPERLT IELSDADGNI VLSYQEHQPQ ALPLPDVAKA PLAAQDITST DEAWFIGQHL EQYHHASRSP FDYYLRGVAL DPLDYRCNLA LAMLEYNRAD FPQAVAYATQ ALKRAHALNK NPQCGQASLI RASAYERQGQ YQQAEEDFWR AVWSGNSKAG GYYGLARLAA RNGNFDAGLD FCQQSLRACP TNQEVLCLHN LLLVLSGRQD NARVQREKLL RDYPLNATLW WLNWFDGRSE SALAQWRGLC QGRDVNALMT AGQLINWGMP TLAAEMLNAL DCQRTLPLYL QASLLPKAER GELVAKAIDV FPQFVRFPNT LEEVAALESI EECWFARHLL ACFYYNKRSY NKAIAFWQRC VEMSPEFADG WRGLAIHAWN KQHDYELAAR YLDNAYQLAP QDARLLFERD LLDKLSGATP EKRLARLENN LEIALKRDDM TAELLNLWHL TGQADKAADI LATRKFHPWE GGEGKVTSQF ILNQLLRAWQ HLDARQPQQA CELLHAALHY PENLSEGRLP GQTDNDIWFW QAICANAQGD ETEATRCLRL AATGDRTINI HSYYNDQPVD YLFWQGMALR LLGEQQTAQQ LFSEMKQWAQ EMAKTSIEAD FFAVSQPDLL SLYGDLQQQH KEKCLMVAML ASAGLGEVAQ YESARAELTA INPAWPKAAL FTTVMPFIFN RVH
|
| |