Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0028 |
Symbol | ileS |
ID | 6967877 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 26803 |
End bp | 29619 |
Gene Length | 2817 bp |
Protein Length | 938 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643384109 |
Product | isoleucyl-tRNA synthetase |
Protein accession | YP_002268632 |
Protein GI | 209400433 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0060] Isoleucyl-tRNA synthetase |
TIGRFAM ID | [TIGR00392] isoleucyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0184357 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 65 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGACT ATAAATCAAC CCTGAATTTG CCGGAAACAG GGTTCCCGAT GCGTGGCGAT CTCGCCAAGC GCGAACCGGG AATGCTGGCG CGTTGGACTG ATGATGATCT GTACGGCATC ATTCGTGCGG CTAAAAAAGG CAAAAAAACC TTCATTCTGC ATGATGGCCC TCCTTATGCG AATGGCAGCA TTCATATTGG TCACTCGGTT AACAAGATTC TGAAAGACAT TATCGTGAAG TCCAAAGGGC TTTCCGGTTA TGACTCGCCG TATGTGCCTG GCTGGGACTG CCACGGTCTG CCGATCGAGC TGAAAGTAGA GCAAGAATAC GGTAAGCCGG ATGAGAAATT CACCGCCGCC GAGTTCCGCG CCAAGTGCCG CGAATACGCG GCGACCCAGG TTGACGGTCA ACGCAAAGAC TTTATCCGTC TGGGCGTGCT GGGCGACTGG TCGCACCCGT ACCTGACCAT GGACTTCAAA ACTGAAGCCA ATATCATCCG CGCGCTGGGC AAAATCATCG GCAATGGTCA CCTGCACAAA GGCGCGAAGC CGGTGCACTG GTGCGTAGAC TGCCGTTCTG CACTGGCAGA AGCGGAAGTT GAGTATTACG ACAAAACTTC TCCGTCCATT GACGTCGCTT TCCAGGCGGT CGATCAGGAT GCGCTGAAAG CGAAATTTGG CGTAAGCAAC GTTAACGGCC CAATCTCGCT GGTGATCTGG ACCACTACGC CGTGGACTCT GCCTGCGAAC CGCGCAATCT CTATTGCACC TGATTTCGAC TATGCGCTGG TGCAGATCGA CGGTCAGGCC GTGATTCTGG CGAAAGATCT GGTTGAAAGC GTAATGCAGC GTATCGGCGT GACCGATTAC ACCATTCTCG GCACGGTAAA AGGTGCGGAG CTTGAGCTGC TGCGCTTTGC CCATCCGTTT ATGGGCTTCG ACGTCCCGGC AATCCTCGGC GATCACGTTA CCCTGGATGC CGGTACCGGT GCCGTTCACA CCGCGCCTGG CCACGGCCCG GACGACTATG TGATCGGTCA GAAATACGGC CTGGAAACCG CTAACCCGGT TGGCCCGGAC GGCACTTATC TGCCGGGCAC TTATCCGACG CTGGATGGCG TGAACGTCTT CAAAGCGAAC GACATCGTCG TTGCGCTGCT GCAGGAAAAA GGCGCGCTGC TGCACGTTGA GAAAATGCAG CACAGCTATC CGTGCTGCTG GCGTCACAAA ACGCCGATCA TCTTCCGCGC AACGCCGCAG TGGTTCGTCA GTATGGATCA GAAAGGTCTG CGCGCGCAGT CTCTGAAAGA GATCAAAGGT GTGCAGTGGA TCCCGGACTG GGGCCAGGCG CGTATCGAGT CGATGGTCGC TAACCGTCCT GACTGGTGTA TCTCCCGTCA GCGTACCTGG GGCGTACCGA TGTCTCTGTT CGTGCACAAA GACACGGAAG AGCTGCATCC GCGTACCCTC GAACTGATGG AAGAAGTGGC TAAACGCGTT GAAGTTGATG GCATCCAGGC GTGGTGGGAT CTTGATGCGA AAGAGATCCT CGGCGATGAA GCTGATCAGT ATGTGAAAGT GCCGGATACG CTGGATGTAT GGTTTGACTC CGGCTCTACT CACTCTTCTG TTGTTGACGT GCGCCCGGAA TTTGCCGGTC ACGCTGCGGA CATGTATCTG GAAGGTTCTG ACCAACACCG TGGCTGGTTC ATGTCTTCTC TGATGATCTC CACCGCGATG AAGGGCAAAG CGCCGTATCG TCAGGTGCTG ACCCACGGCT TTACCGTAGA TGGTCAGGGC CGCAAGATGT CTAAATCCAT CGGCAACACC GTGTCGCCGC AGGATGTGAT GAACAAACTG GGCGCGGATA TTCTGCGTCT GTGGGTGGCA TCAACCGACT ACACCGGTGA AATGGCCGTT TCTGACGAGA TCCTGAAACG TGCTGCCGAC AGCTATCGTC GTATCCGTAA CACCGCGCGC TTCCTGCTGG CAAACCTGAA CGGTTTTGAT CCGGCAAAAG ATATGGTGAA ACCGGAAGAG ATGGTAGTAC TGGATCGCTG GGCCGTAGGT TGTGCGAAAG CGGCACAGGA AGACATCCTC AAGGCATACG AAGCATACGA TTTCCACGAA GTGGTACAGC GTCTGATGCG CTTCTGCTCC GTTGAGATGG GTTCCTTCTA CCTCGACATC ATCAAAGACC GTCAGTACAC CGCCAAAGCG GACAGCGTGG CGCGTCGTAG CTGCCAGACT GCGCTGTATC ACATCGCAGA AGCGCTGGTT CGCTGGATGG CACCAATCCT CTCCTTCACC GCTGATGAAG TGTGGGGCTA CCTGCCGGGC GAACGTGAAA AATACGTCTT CACCGGCGAG TGGTACGAAG GTCTGTTTGG TCTGGCAGAC AGTGAAGCAA TGAACGATGC GTTCTGGGAC GAGCTGTTGA AAGTGCGTGG CGAAGTGAAC AAAGTCATTG AGCAGGCGCG TGCCGACAAG AAAGTGGGTG GCTCGCTGGA AGCGGCAGTA ACCTTGTATA CAGAACCGGA ACTGGCGGCG AAACTGACCG CATTGGGCGA TGAATTACGA TTTGTCCTGT TGACCTCCGG CGCTACCGTT GCAGACTATA ACGACGCACC TGCTGATGCT CAGCAGAGCG AAGTGCTCAA AGGGCTGAAA GTCGCGTTGA GTAAAGCCGA AGGTGAGAAG TGCCCACGCT GCTGGCACTA CACCCAGGAT GTCGGCAAGG TGGCGGAACA CGCAGAAATC TGCGGCCGCT GTGTCAGCAA CGTCGCCGGT GACGGTGAAA AACGTAAGTT TGCCTGA
|
Protein sequence | MSDYKSTLNL PETGFPMRGD LAKREPGMLA RWTDDDLYGI IRAAKKGKKT FILHDGPPYA NGSIHIGHSV NKILKDIIVK SKGLSGYDSP YVPGWDCHGL PIELKVEQEY GKPDEKFTAA EFRAKCREYA ATQVDGQRKD FIRLGVLGDW SHPYLTMDFK TEANIIRALG KIIGNGHLHK GAKPVHWCVD CRSALAEAEV EYYDKTSPSI DVAFQAVDQD ALKAKFGVSN VNGPISLVIW TTTPWTLPAN RAISIAPDFD YALVQIDGQA VILAKDLVES VMQRIGVTDY TILGTVKGAE LELLRFAHPF MGFDVPAILG DHVTLDAGTG AVHTAPGHGP DDYVIGQKYG LETANPVGPD GTYLPGTYPT LDGVNVFKAN DIVVALLQEK GALLHVEKMQ HSYPCCWRHK TPIIFRATPQ WFVSMDQKGL RAQSLKEIKG VQWIPDWGQA RIESMVANRP DWCISRQRTW GVPMSLFVHK DTEELHPRTL ELMEEVAKRV EVDGIQAWWD LDAKEILGDE ADQYVKVPDT LDVWFDSGST HSSVVDVRPE FAGHAADMYL EGSDQHRGWF MSSLMISTAM KGKAPYRQVL THGFTVDGQG RKMSKSIGNT VSPQDVMNKL GADILRLWVA STDYTGEMAV SDEILKRAAD SYRRIRNTAR FLLANLNGFD PAKDMVKPEE MVVLDRWAVG CAKAAQEDIL KAYEAYDFHE VVQRLMRFCS VEMGSFYLDI IKDRQYTAKA DSVARRSCQT ALYHIAEALV RWMAPILSFT ADEVWGYLPG EREKYVFTGE WYEGLFGLAD SEAMNDAFWD ELLKVRGEVN KVIEQARADK KVGGSLEAAV TLYTEPELAA KLTALGDELR FVLLTSGATV ADYNDAPADA QQSEVLKGLK VALSKAEGEK CPRCWHYTQD VGKVAEHAEI CGRCVSNVAG DGEKRKFA
|
| |