Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_1128 |
Symbol | |
ID | 6068006 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 1230178 |
End bp | 1233459 |
Gene Length | 3282 bp |
Protein Length | 1093 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641600544 |
Product | TPR repeat-containing protein |
Protein accession | YP_001724122 |
Protein GI | 170019168 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCCAG TAAAAGTGTG GCAAGAGCGC GTTGAGATCC CGACCTATGA AACCGGGCCG CAGGATATAC ATCCCATGTT CCTGGAAAAT CGCGTTTATC AGGGATCGTC CGGCGCGGTT TATCCCTATG GCGTGACCGA TACGCTGAGC GAGCAGAAAA CCCTGAAATC CTGGCAGGCG GTGTGGCTGG AAAACGACTA CATCAAAGTG ATGATCCTGC CGGAACTGGG CGGTCGTGTG CATCGCGCAT GGGATAAAGT GAAACAACGC GATTTTGTTT ATCACAATGA AGTCATTAAA CCTGCGCTGG TGGGGCTGCT GGGACCGTGG ATCTCCGGTG GGATTGAGTT TAACTGGCCG CAACACCATC GCCCGACCAC CTTTATGCCC GTTGATTTCA CCCTCGAAGC CCATGAAGAC GGCGCACAGA CGGTGTGGGT CGGCGAAACG GAGCCGATGC ATGGTTTACA GGTGATGACA GGTTTCACCC TGCGCCCTGA CCGGGCGGCG CTGGAAATCG CCAGCCGCGT CTATAACGGC AACGCCACGC CGCGTCATTT CTTGTGGTGG GCCAACCCGG CAGTGAAAGG GGGGGAAGGG CATCAGAGCG TTTTCCCGCC GGATGTAACG GCGGTGTTTG ATCACGGCAA ACGGGCCGTC TCCGCTTTCC CCATCGCCAC CGGCACTTAC TACAAAGTGG ACTACTCCGC TGGAGTGGAC ATTTCTCGCT ATAAAAATGT GCCCGTTCCA ACCTCATATA TGGCTGAAAA ATCACAGTAC GATTTTGTTG GCGCGTGGTG TCACGATGAA GATGGCGGTT TGCTGCACGT TGCCAACCAC CATATTGCGC CAGGTAAAAA ACAGTGGAGT TGGGGACACA GTGAATTTGG CCAGGCGTGG GACAAGAGCC TGACCGACAA TAACGGCCCG TATATCGAAC TGATGACCGG TATTTTTGCC GATAACCAGC CTGATTTTAC CTGGCTTGAT GCTTACGAAG AGAAGCGTTT CGAGCAGTAT TTCCTGCCTT ATCATTCTCT GGGCATGGTG CAAAATGCCT CCCGCGATGC GGTGATAAAA CTCCAGCGTA GTGAGCGGGG GATTGAGTGG GGGCTGTATG CCATCTCTCC GTTGAACGGA TACCGCCTGG CGATCCGCGA AATCGGCAAA TGCAACGCGT TACTTGATGA TGCCGTGGCC TTGACACCAG CGACCGCCAT CCAGGGCGTG TTGCATGGTA TCAATCCTGA AAGGCTGACC ATTGAGCTCT CCGATGCCGA CGGCAATATT GTACTGAGTT ATCAGGAACA TCAGCCGCAA GAGTTGCCGT TGCCGGACGT CGCCAAAGCG CCACTGGCAG CACAAGACAT TACCAGTACA GATGAAGCCT GGTTTATCGG TCAGCATCTG GAGCAATATC ATCACGCCAG CCGTTCACCG TTCGATTACT ACCTGCGCGG TGTGGCGCTG GATCCGCTGG ATTACCGCTG TAACCTGGCG CTGGCGATGC TGGAATATAA CCGCGCAGAT TTCCCACAAG CGGTGGCGTA TGCCACTCAG GCGCTGAAAC GCGCACATGC GCTGAACAAA AATCCGCAGT GCGGACAGGC GAGTTTGATT CGCGCCAGTG CTTACGAACG TCAGGGACAA TATCAACAAG CCGAAGAGGA TTTCTGGCGT GCGGTCTGGA GCGGCAATAG TAAAGCAGGC GGCTATTATG GTCTGGCACG ACTGGCGGCG CGTAATGGAA ACTTCGACGC GGGTCTGGAT TTTTGCCAAC ACAGTCTTCG CACCTGCCCA ACCAATCAGG AAGTGCTTTG CCTGCATAAT CTGCTGCTGG TGTTAAGTGG TCGTCAGGAC AACGCGCGTT TGCAGCGCGA GAAACTGCTG CGCGATTATC CGCTGAACGC CACTCTGTGG TGGCTGAACT GGTTCGATGG TCGTAGCGAA TCAGCTCTCG CGCAGTGGCG CGGTCTGTGT CAGGGACGCG ACGTTAACGC CCTGATGACC GCCGGGCAAC TGATTAACTG GGGAATGCCC ACCCTGGCGG CAGAGATGCT GAACGCACTG GACTGCCAGC GCACGCTGCC GCTTTACCTG CAAGCCAGCT TGCTGCCGAA AGCCGAACGT GGCGAACTGG TCGCAAAAGC CATTGATGTC TTCCCGCAGT TTGTCCGTTT CCCGAATACG TTGGAAGAAG TGGCGGCGCT GGAGAGTATT GAAGAGTGCT GGTTTGCTCG CCATTTACTG GCCTGCTTCT ACTACAACAA ACGTAGCTAC AACAAAGCCA TTGCCTTATG GCAACGTTGC GTAGAGATGT CGCCGGAGTT TGCCGACGGC TGGCGCGGGT TAGCGATCCA TGCGTGGAAT AAGCAACACG ATTATGAGCT GGCCGCGCGT TATCTTGATA ATGCTTATCA GCTTGCGCCG CAGGATGCAC GTCTGCTTTT CGAACGGGAT TTGCTTGATA AGTTAAGTGG AGCCACACCG GAGAAACGAC TGGCGCGTCT GGAAAATAAT CAGGAAATTG CGCTGAAACG CGACGACATG ACCGCAGAAC TGCTCAATTT GTGGCATCTC ACGGGTCAGG CAGACAAAGC GGCGGACATT CTCGCCACGC GTAAATTCCA CCCGTGGGAA GGCGGGGAAG GAAAGGTCAC CAGTCAGTTT ATCCTCAACC AGTTATTACG CGCCTGGCAG CATCTTGATG CCAGACAGCC GCAGCAGGCC AGCGAACTGC TTCATGCCGC GCTGCATTAT CCGGAGAATT TAAGCGAAGG CCGTTTACCG GGGCAAACTG ATAACGACAT CTGGTTCTGG CAGGCGATAT GCGCCAACGC GCAGGGCGAT GAAACTGAAG CGATGCGTTG TTTACGTCTG GCGGCGACCG GCGATCGCAC CATTAACATC CACAGTTATT ACAACGATCA GCCGGTTGAT TATCTCTTCT GGCAAGGAAT GGCGCTGCGA CTGCTGGGTG AACAGCAAAC CGCACAGCAA CTGTTTAGTG AAATGAAACA GTGGGCGCAA GAGATGGCGA AAACCAGTAT TGAGGCGGAT TTCTTTGCTG TTTCACAACC TGACCTGTTG TCGCTGTATG GCGATTTACA ACAGCAGCAT AAAGAAAAAT GCCTGATGGT GGCGATGCTG GCGTCCGCGG GACTCGGGGA GGTTGCGCAA TATGAATCTG CTCGCGCTGA ATTGACGGCG ATTAATCCGG CCTGGCCGAA AGCGGCATTA TTCACCACCG TGATGCCTTT TATTTTTAAC TACGTTCACT AA
|
Protein sequence | MTPVKVWQER VEIPTYETGP QDIHPMFLEN RVYQGSSGAV YPYGVTDTLS EQKTLKSWQA VWLENDYIKV MILPELGGRV HRAWDKVKQR DFVYHNEVIK PALVGLLGPW ISGGIEFNWP QHHRPTTFMP VDFTLEAHED GAQTVWVGET EPMHGLQVMT GFTLRPDRAA LEIASRVYNG NATPRHFLWW ANPAVKGGEG HQSVFPPDVT AVFDHGKRAV SAFPIATGTY YKVDYSAGVD ISRYKNVPVP TSYMAEKSQY DFVGAWCHDE DGGLLHVANH HIAPGKKQWS WGHSEFGQAW DKSLTDNNGP YIELMTGIFA DNQPDFTWLD AYEEKRFEQY FLPYHSLGMV QNASRDAVIK LQRSERGIEW GLYAISPLNG YRLAIREIGK CNALLDDAVA LTPATAIQGV LHGINPERLT IELSDADGNI VLSYQEHQPQ ELPLPDVAKA PLAAQDITST DEAWFIGQHL EQYHHASRSP FDYYLRGVAL DPLDYRCNLA LAMLEYNRAD FPQAVAYATQ ALKRAHALNK NPQCGQASLI RASAYERQGQ YQQAEEDFWR AVWSGNSKAG GYYGLARLAA RNGNFDAGLD FCQHSLRTCP TNQEVLCLHN LLLVLSGRQD NARLQREKLL RDYPLNATLW WLNWFDGRSE SALAQWRGLC QGRDVNALMT AGQLINWGMP TLAAEMLNAL DCQRTLPLYL QASLLPKAER GELVAKAIDV FPQFVRFPNT LEEVAALESI EECWFARHLL ACFYYNKRSY NKAIALWQRC VEMSPEFADG WRGLAIHAWN KQHDYELAAR YLDNAYQLAP QDARLLFERD LLDKLSGATP EKRLARLENN QEIALKRDDM TAELLNLWHL TGQADKAADI LATRKFHPWE GGEGKVTSQF ILNQLLRAWQ HLDARQPQQA SELLHAALHY PENLSEGRLP GQTDNDIWFW QAICANAQGD ETEAMRCLRL AATGDRTINI HSYYNDQPVD YLFWQGMALR LLGEQQTAQQ LFSEMKQWAQ EMAKTSIEAD FFAVSQPDLL SLYGDLQQQH KEKCLMVAML ASAGLGEVAQ YESARAELTA INPAWPKAAL FTTVMPFIFN YVH
|
| |