Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2702 |
Symbol | |
ID | 6147137 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2774798 |
End bp | 2778079 |
Gene Length | 3282 bp |
Protein Length | 1093 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641617573 |
Product | TPR repeat-containing protein |
Protein accession | YP_001744738 |
Protein GI | 170682243 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCCAG TAAAAGTGTG GCAAGAGCGC GTTGAGATCC CGACCTATGA AACCGGGCCG CAGGATATAC ATCCCATGTT CCTGGAAAAT CGCGTTTATC AGGGATCGTC CGGCGCGGTT TATCCCTACG GCGTGACCGA TACGCTGAGC GAGCAGAAAA CCCTGAAATC CTGGCAGGCG GTGTGGCTGG AAAATGACTA CATCAAAGTG ATGATCCTGC CGGAACTGGG CGGTCGGGTG CATCGCGCAT GGGATAAAGT GAAACAGCGC GATTTTGTCT ATCACAATGA AGTCATTAAA CCTGCGCTGG TGGGGCTGCT GGGACCGTGG ATCTCCGGCG GGATTGAGTT TAACTGGCCG CAACACCATC GCCCGACCAC CTTTATGCCC GTTGATTTCA CCCTCGAAGC CCATGACGAC GGCGCACAGA CGGTGTGGGT CGGCGAAACG GAGCCGATGC ACGGTTTACA GGTGATGACA GGTTTCACCC TGCGTCCTGA CCGGGCGGCG CTGGAAATCG CCAGTCGCGT CTATAACGGC AACGCCACGC CGCGTCATTT CTTGTGGTGG GCCAACCCGG CAGTGAAAGG GGGTGAAGGG CATCAGAGCG TCTTCCCGCC GGATGTAACG GCGGTGTTTG ATCACGGCAA ACGGGCCGTC TCCGCTTTCC CCATCGCCAC CGGCACTTAC TACAAAGTGG ACTACTCCGC CGGAGTGGAC ATTTCTCGCT ATAAAAATGT GCCCGTTCCA ACCTCATATA TGGCTGAAAA ATCACAGTAC GATTTTGTTG GCGCGTGGTG TCACGATGAA GATGGTGGTT TGCTACACGT TGCCAACCAC CATATTGCGC CAGGTAAAAA ACAGTGGAGC TGGGGACACA GTGAATTTGG CCAGGCGTGG GATAAGAGCC TGACTGACAA TAACGGCCCG TATATCGAAC TGATGACCGG TATTTTTGCC GATAACCAGC CTGATTTTAC CTGGCTTGAT GCTTACGAGG AGAAGCGTTT CGAGCAGTAT TTCCTGCCTT ATCATTCTCT GGGCATGGTG CAAAATGCCT CCCGCGATGC GGTGATAAAA CTCCAGCGTA GTGAGCGGGG GATTGAGTGG GGGCTGTATG CCATCTCTCC GTTGAACGGA TACCGCCTGG CGATCCGCGA AATCGGCAAA TGCAACGCGT TGCTCGATGA TGCCGTGGCC CTGACACCAG CGACCGCCAT CCAGGGCGTG TTGCACGGTA TCAATCCTGA CAGACTGACC ATTGAGCTCT CCGATGCCGA CGGCAATATT GTACTGAGTT ATCATGAACA TCAGCCGCAA GCGTTGCCGT TGCCGGACGT CGCCAAAGCG CCACTGTCAG CACAAGACAT TACCAGTACA GATGAAGCCT GGTTTATCGG TCAGCATCTG GAGCAATATC ATCACGCCAG CCGTTCACCG TTCGATTACT ACCTGCGCGG CGTGGCGCTG GATCCGCTGG ATTATCGCTG TAACCTGGCG CTGGCGATGC TGGAATATAA CCGCGCCGAT TTCCCGCAAG CGGTGGCGTA TGCCACTCAG GCGCTGAAAC GCGCACATGC GCTGAACAAA AATCCGCAGT GCGGACAGGC CAGTTTGATT CGCGCCAGTG CTTACGAACG TCAGGGACAA TATCAACAAG CCGAAGAGGA TTTCTGGCGG GCGGTCTGGA GCGGCAATAG TAAAGCAGGT GGCTATTATG GTCTGGCACG ACTGGCGGCG CGTAATGGTC ACTTCGATGC TGGTCTGGAT TTTTGCCAAC AAAGTCTTCG CGCCTGCCCA ACCAATCAGG AAGTGCTTTG CCTGCATAAT CTGCTGCTGG TGTTAAGTGG TCGTCAGGAC AACGCGCGTT TGCAGCGCGA GAAACTGCTG CGCGATTATC CGCTGAACGC CACTCTGTGG TGGCTGAACT GGTTCGATGG TCGTAGCGAA TCAGCCCTCG CGCAGTGGCG CGGTCTGTGT CAGGGACGCG ACGTTAACGC CCTGATGACC GCCGGGCAAC TGATTAACTG GGGAATGCCT ACCCTGGCGG CAGAGATGCT GAACGCACTG GACTGCCAGC GCACGCTGCC GCTTTATCTG CAGGCGAGTT TATTGCCGAA AGCCGAACGC GGCGAACTGG TCGCAAAAGC CATTGATGTC TTCCCGCAGT TTGTCCGTTT CCCGAATACG CTGGAAGAAG TGGCGGCGCT GGAGAGTATT GAAGAGTGCT GGTTTGCCCG TCATTTATTG GCCTGCTTCT ACTACAACAA GCGTAGCTAC GGCAAAGCCA TTGCCTTATG GCAACGTTGC GTAGAGATGT CGCCGGAGTT TGCCGACGGC TGGCGCGGGT TAGCGATCCA TGCGTGGAAT AAGCAACACG ATTATGAGCT GGCCGCGCGT TATCTTGATA ATGCTTATCA GCTTGCGCCG CAGGATGCAC GTCTGCTTTT CGAACGGGAT TTGCTGGATA AGTTAAGTGG AGCCACACCG GAGAAACGAC TGGCGCGTCT GGAAAATAAT CTGGAAATTG CGCTGAAACG CGACGACATG ACCGCAGAAC TGCTCAATTT GTGGCATCTC ACGGGTCAGG CAGACAAAGC GGCGGACATT CTCGCCACGC GCAAATTCCA CCCGTGGGAA GGCGGGGAAG GGAAGATCAC CAGTCAGTTT ATCCTCAACC AGTTATTACG CGCCTGGCAG CATCTTGATG CCAGAGAGCC GCAGCAGGCC AGCGAACTGC TTCATGCCGC GCTGCATTAT CCGGAGAATT TAAGCGAAGG CCGTTTACCG GGGCAAACTG ATAACGACAT CTGGTTCTGG CAGGCGATAT GCGCCAACGC GCAGGGAGAT GAAACTGAAG CGATGCGTTG TTTACGTCTG GCGGCGACTG GCGATCGCAC CATTAACATT CACAGTTATT ACAACGATCA GCCGGTTGAT TATCTCTTCT GGCAAGGAAT GGCGCTGCGA CTGCTGGGCG AACAACAAAT CGCACAGCAA CTGTTTAGTG AAATGAAACA GTGGGCGCAA GAGATGGCGA AAACCAGTAT TGAGGCGGAT TTCTTTGCTG TTTCACAACC TGACCTGTTG TCGCTGTATG GCGATTTACA ACAGCAGCAT AAAGAAAAAT GCCTGATGGT GGCGATGCTG GCGTCCGCGG GACTCGGGGA GGTTGCGCAC TATGAATCTG CTCGCGCTGA ATTGACGGCG ATTAATCCGG CCTGGCCGAA AGCGGCATTA TTCACCACTG TGATGCCTTT TATTTTTAAC TACGTTCACT AA
|
Protein sequence | MTPVKVWQER VEIPTYETGP QDIHPMFLEN RVYQGSSGAV YPYGVTDTLS EQKTLKSWQA VWLENDYIKV MILPELGGRV HRAWDKVKQR DFVYHNEVIK PALVGLLGPW ISGGIEFNWP QHHRPTTFMP VDFTLEAHDD GAQTVWVGET EPMHGLQVMT GFTLRPDRAA LEIASRVYNG NATPRHFLWW ANPAVKGGEG HQSVFPPDVT AVFDHGKRAV SAFPIATGTY YKVDYSAGVD ISRYKNVPVP TSYMAEKSQY DFVGAWCHDE DGGLLHVANH HIAPGKKQWS WGHSEFGQAW DKSLTDNNGP YIELMTGIFA DNQPDFTWLD AYEEKRFEQY FLPYHSLGMV QNASRDAVIK LQRSERGIEW GLYAISPLNG YRLAIREIGK CNALLDDAVA LTPATAIQGV LHGINPDRLT IELSDADGNI VLSYHEHQPQ ALPLPDVAKA PLSAQDITST DEAWFIGQHL EQYHHASRSP FDYYLRGVAL DPLDYRCNLA LAMLEYNRAD FPQAVAYATQ ALKRAHALNK NPQCGQASLI RASAYERQGQ YQQAEEDFWR AVWSGNSKAG GYYGLARLAA RNGHFDAGLD FCQQSLRACP TNQEVLCLHN LLLVLSGRQD NARLQREKLL RDYPLNATLW WLNWFDGRSE SALAQWRGLC QGRDVNALMT AGQLINWGMP TLAAEMLNAL DCQRTLPLYL QASLLPKAER GELVAKAIDV FPQFVRFPNT LEEVAALESI EECWFARHLL ACFYYNKRSY GKAIALWQRC VEMSPEFADG WRGLAIHAWN KQHDYELAAR YLDNAYQLAP QDARLLFERD LLDKLSGATP EKRLARLENN LEIALKRDDM TAELLNLWHL TGQADKAADI LATRKFHPWE GGEGKITSQF ILNQLLRAWQ HLDAREPQQA SELLHAALHY PENLSEGRLP GQTDNDIWFW QAICANAQGD ETEAMRCLRL AATGDRTINI HSYYNDQPVD YLFWQGMALR LLGEQQIAQQ LFSEMKQWAQ EMAKTSIEAD FFAVSQPDLL SLYGDLQQQH KEKCLMVAML ASAGLGEVAH YESARAELTA INPAWPKAAL FTTVMPFIFN YVH
|
| |