Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0024 |
Symbol | ileS |
ID | 6144335 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 27575 |
End bp | 30391 |
Gene Length | 2817 bp |
Protein Length | 938 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641614925 |
Product | isoleucyl-tRNA synthetase |
Protein accession | YP_001742141 |
Protein GI | 170684022 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0060] Isoleucyl-tRNA synthetase |
TIGRFAM ID | [TIGR00392] isoleucyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0199251 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGACT ATAAATCAAC CCTGAATTTG CCGGAAACAG GGTTCCCGAT GCGTGGCGAT CTCGCCAAGC GCGAACCGGG AATGCTGGCG CGTTGGACTG ATGATGATCT GTACGGCATC ATCCGTGCGG CTAAAAAAGG CAAAAAAACC TTCATTCTGC ATGATGGCCC TCCTTATGCG AATGGCAGCA TTCATATTGG TCACTCGGTT AACAAGATTC TGAAAGACAT TATCGTGAAG TCCAAAGGGC TTTCCGGTTA TGACTCGCCG TATGTGCCTG GCTGGGACTG TCATGGTCTG CCGATCGAGC TGAAAGTAGA GCAAGAATAC GGTAAGCCGG GTGAGAAATT CACCGCTGCC GAGTTCCGCG CCAAGTGCCG CGAATACGCT GCGACCCAGG TTGACGGTCA ACGCAAAGAC TTTATCCGTC TGGGCGTGCT GGGCGACTGG TCGCACCCGT ACCTGACCAT GGACTTCAAA ACTGAAGCCA ATATCATCCG CGCGCTGGGC AAAATCATCG GCAACGGTCA CCTGCACAAA GGCGCGAAGC CGGTACACTG GTGCGTTGAC TGCCGTTCTG CGCTGGCGGA AGCGGAAGTT GAGTATTACG ACAAAACTTC TCCGTCCATC GACGTCGCTT TCCAGGCGGT CGATCAGGAT GCACTGAAAG CAAAATTTGC CGTAAGCAAC GTTAACGGCC CAATCTCGCT GGTGATCTGG ACCACTACGC CGTGGACTCT GCCTGCGAAC CGCGCAATCT CTATTGCACC TGATTTCGAC TATGTGCTGG TGCAGATCGA CGGTCAGGCC GTGATTCTGG CAAAAGATCT GGTTGAAAGC GTAATGCAGC GTATCGGCGT GACCGATTAC ACCATTCTCG GCACGGTAAA AGGTGCAGAG CTTGAGCTGC TGCGCTTTAC CCATCCGTTT ATGGGCTTCG ACGTTCCGGC AATCCTCGGC GATCACGTTA CCCTGGATGC CGGTACCGGT GCCGTTCACA CCGCGCCTGG CCACGGCCCG GACGACTATG TGATCGGTCA GAAATACGGC CTGGAAACCG CTAACCCGGT TGGCCCGGAC GGCACTTATC TGCCGGGTAC TTACCCGACG CTGGATGGCG TGAACGTTTT CAAAGCGAAC GACATCGTTG TTGCGCTGCT GCAGGAAAAA GGCGCGCTGT TGCACGTTGA GAAAATGCAG CACAGCTATC CGTGCTGCTG GCGTCACAAA ACGCCGATCA TCTTCCGTGC GACGCCGCAG TGGTTCGTCA GCATGGATCA GAAAGGTCTG CGTGCGCAGT CACTGAAAGA GATCAAAGGC GTGCAGTGGA TCCCGGACTG GGGCCAGGCG CGTATCGAGT CGATGGTTGC TAACCGTCCT GACTGGTGTA TCTCCCGTCA GCGCACCTGG GGCGTACCGA TGTCACTGTT CGTGCATAAA GACACGGAAG AGCTGCATCC GCGTACCCTC GAACTGATGG AAGAAGTGGC TAAACGCGTG GAAGTTGACG GCATCCAGGC GTGGTGGGAT CTCGATGCGA AAGAGATCCT CGGCGATGAA GCTGATCAGT ACGTGAAAGT GCCGGATACT CTCGACGTAT GGTTTGACTC CGGATCTACC CACTCTTCCG TTGTTGACGT GCGTCCGGAA TTTGCCGGTC ACGCTGCGGA CATGTATCTG GAAGGTTCAG ACCAGCACCG TGGCTGGTTC ATGTCCTCTC TGATGATCTC TACCGCGATG AAGGGCAAAG CACCGTATCG TCAGGTACTG ACCCACGGCT TTACCGTGGA TGGTCAGGGC CGCAAGATGT CTAAATCCAT CGGCAACACC GTTTCGCCGC AGGATGTGAT GAACAAACTG GGCGCGGATA TTCTGCGTCT GTGGGTGGCA TCAACTGACT ACACTGGTGA AATGGCCGTT TCTGACGAAA TCCTGAAACG TGCTGCCGAC AGCTATCGTC GTATCCGTAA CACCGCGCGC TTCCTGCTGG CAAACCTGAA CGGTTTTGAT CCGGCAAAAG ATATGGTGAA ACCGGAAGAG ATGGTGGTAC TGGATCGCTG GGCCGTAGGT TGTGCGAAAG CGGCACAGGA AGACATCCTC AAGGCGTACG AAGCATACGA TTTCCACGAA GTGGTACAGC GTCTGATGCG CTTCTGCTCC GTTGAGATGG GTTCATTCTA CCTCGACATC ATCAAAGACC GTCAGTATAC CGCCAAAGCT GACAGTGTGG CGCGTCGTAG CTGCCAGACT GCGCTGTATC ACATCGCAGA AGCGCTGGTG CGCTGGATGG CACCAATCCT CTCCTTCACC GCTGATGAAG TGTGGGGCTA CCTGCCGGGC GAACGTGAAA AATACGTCTT CACCGGCGAG TGGTACGAAG GCCTGTTTGG TCTGGCAGAC AGTGAAGCGA TGAACGATGC GTTCTGGGAC GAGCTGTTGA AAGTGCGTGG CGAAGTGAAC AAAGTCATTG AGCAAGCGCG TGCCGACAAG AAAGTGGGCG GCTCGCTGGA AGCGGCAGTA ACCTTGTATG CAGAACCGGA ACTGGCGGCG AAACTGACCG CGCTGGGCGA TGAATTACGA TTTGTCCTGT TGACCTCCGG CGCTACCGTT GCAGACTATA ACGATGCACC TGCTGATGCT CAGCAGAGCG AAGTACTCAA AGGGCTGAAA GTCGCGTTGA GTAAAGCCGA AGGTGAGAAG TGCCCACGCT GCTGGCACTA CACCCAGGAT GTCGGCAAGG TGGCGGAACA CGCAGAAATC TGCGGCCGCT GTGTCAGCAA CGTCGCCGGT GACGGTGAAA AACGTAAGTT TGCCTGA
|
Protein sequence | MSDYKSTLNL PETGFPMRGD LAKREPGMLA RWTDDDLYGI IRAAKKGKKT FILHDGPPYA NGSIHIGHSV NKILKDIIVK SKGLSGYDSP YVPGWDCHGL PIELKVEQEY GKPGEKFTAA EFRAKCREYA ATQVDGQRKD FIRLGVLGDW SHPYLTMDFK TEANIIRALG KIIGNGHLHK GAKPVHWCVD CRSALAEAEV EYYDKTSPSI DVAFQAVDQD ALKAKFAVSN VNGPISLVIW TTTPWTLPAN RAISIAPDFD YVLVQIDGQA VILAKDLVES VMQRIGVTDY TILGTVKGAE LELLRFTHPF MGFDVPAILG DHVTLDAGTG AVHTAPGHGP DDYVIGQKYG LETANPVGPD GTYLPGTYPT LDGVNVFKAN DIVVALLQEK GALLHVEKMQ HSYPCCWRHK TPIIFRATPQ WFVSMDQKGL RAQSLKEIKG VQWIPDWGQA RIESMVANRP DWCISRQRTW GVPMSLFVHK DTEELHPRTL ELMEEVAKRV EVDGIQAWWD LDAKEILGDE ADQYVKVPDT LDVWFDSGST HSSVVDVRPE FAGHAADMYL EGSDQHRGWF MSSLMISTAM KGKAPYRQVL THGFTVDGQG RKMSKSIGNT VSPQDVMNKL GADILRLWVA STDYTGEMAV SDEILKRAAD SYRRIRNTAR FLLANLNGFD PAKDMVKPEE MVVLDRWAVG CAKAAQEDIL KAYEAYDFHE VVQRLMRFCS VEMGSFYLDI IKDRQYTAKA DSVARRSCQT ALYHIAEALV RWMAPILSFT ADEVWGYLPG EREKYVFTGE WYEGLFGLAD SEAMNDAFWD ELLKVRGEVN KVIEQARADK KVGGSLEAAV TLYAEPELAA KLTALGDELR FVLLTSGATV ADYNDAPADA QQSEVLKGLK VALSKAEGEK CPRCWHYTQD VGKVAEHAEI CGRCVSNVAG DGEKRKFA
|
| |