Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E0030 |
Symbol | ileS |
ID | 6268661 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 25971 |
End bp | 28787 |
Gene Length | 2817 bp |
Protein Length | 938 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641724290 |
Product | isoleucyl-tRNA synthetase |
Protein accession | YP_001878850 |
Protein GI | 187730740 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0060] Isoleucyl-tRNA synthetase |
TIGRFAM ID | [TIGR00392] isoleucyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.000755713 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGACT ATAAATCAAC CCTGAATTTG CCGGAAACAG GGTTCCCGAT GCGTGGCGAT CTCGCCAAGC GCGAACCGGG AATGCTGGCG CGTTGGACTG ATGATGATCT GTACGGCATC ATCCGTGCGG CTAAAAAAGG CAAAAAAACC TTCATTCTGC ATGATGGCCC TCCTTATGCG AATGGCAGCA TTCATATTGG TCACTCGGTT AACAAGATTC TGAAAGACAT TATCGTGAAG TCCAAAGGGC TTTCCGGTTA TGACTCGCCG TATGTGCCTG GCTGGGACTG CCACGGTCTG CCGATCGAGC TGAAAGTAGA GCAAGAATAC GGTAAGCCGG GTGAGAAATT CACCGCCGCC GAGTTCCGCG CCAAGTGCCG CGAATACGCG GCGACCCAGG TTGACGGTCA ACGCAAAGAC TTTATCCGTC TGGGCGTGCT GGGCGACTGG TCGCACCCGT ACCTGACCAT GGACTTCAAA ACTGAAGCCA ACATCATCCG CGCGCTGGGC AAAATCATCG GCAACGGTCA CCTGCACAAA GGCGCGAAGC CGGTACACTG GTGCGTTGAC TGCCGTTCTG CGCTGGCGGA AGCGGAAGTT GAGTATTACG ACAAAACTTC TCCGTCCATC GACGTTGCTT TTCAGGCGGT CGATCAGGAT GCACTGAAAG CAAAATTTGC CGTAAGCAAC GTTAACGGCC CAATCTCGCT GGTGATCTGG ACCACTACGC CGTGGACTCT GCCTGCGAAC CGCGCAATCT CTATTGCACC TGATTTCGAC TATGCGCTGG TGCAGATCGA CGGTCAGGCC GTGATTCTGG CGAAAGATCT GGTTGAAAGC GTAATGCAGC GTATCGGCGT GACCGATTAC ACCATTCTCG GCACGGTAAA AGGTGCGGAG CTTGAGTTGC TGCGCTTTAC CCATCCGTTT ATGGGCTTCG ACGTTCCGGC AATCCTCGGC GATCACGTTA CCCTGGATGC CGGTACCGGT GCCGTTCACA CCGCGCCTGG CCACGGCCCG GACGACTATG TGATCGGTCA GAAATACGGC CTGGAAACCG CTAACCCGGT TGGCCCGGAC GGCACTTATC TGCCGGGCAC TTATCCGACG CTGGATGGCG TGAACGTCTT CAAAGCGAAC GACATCGTCG TTGCGCTGCT GCAGGAAAAA GGCGCGCTGC TGCACGTTGA GAAAATGCAG CACAGCTATC CGTGCTGCTG GCGTCACAAA ACGCCGATCA TCTTCCGCGC GACGCCGCAG TGGTTCGTCA GCATGGATCA GAAAGGTCTG CGTGCGCAGT CACTGAAAGA GATCAAAGGC GTGCAGTGGA TCCCGGACTG GGGCCAGGCG CGTATCGAGT CGATGGTTGC TAACCGTCCT GACTGGTGTA TCTCCCGTCA GCGCACCTGG GGCGTACCGA TGTCACTGTT CGTGCACAAA GACACGGAAG AGCTGCATCC GCGTACTCTC GAACTAATGG AAGAAGTGGC AAAACGCGTT GAAGTTGACG GCATCCAGGC GTGGTGGGAT CTTGATGCGA AAGAGATCCT CGGCGATGAA GCTGATCAGT ACGTGAAAGT GCCGGACACA TTGGATGTAT GGTTTGACTC CGGATCTACC CACTCTTCTG TTGTTGACGT GCGTCCGGAA TTTGCCGGTC ATGCAGCGGA CATGTATCTG GAAGGTTCTG ACCAGCACCG TGGTTGGTTC ATGTCTTCCC TAATGATCTC CACCGCGATG AAAGGCAAAG CGCCGTATCG TCAGGTACTG ACCCACGGCT TTACCGTGGA TGGTCAGGGC CGCAAGATGT CTAAATCCAT CGGCAATACC GTTTCGCCGC AGGATGTGAT GAACAAACTG GGCGCGGATA TTCTGCGTCT GTGGGTGGCA TCAACCGACT ACACCGGTGA AATGGCCGTT TCTGACGAGA TCCTGAAACG TGCTGCCGAT AGCTATCGTC GTATCCGTAA CACCGCGCGC TTCCTGCTGG CAAACCTGAA CGGTTTTGAT CCAGCAAAAG ATATGGTGAA ACCGGAAGAG ATGGTGGTAC TGGATCGCTG GGCCGTAGGT TGTGCGAAAG CGGCACAGGA AGACATCCTC AAGGCGTACG AAGCATACGA TTTCCACGAA GTGGTACAGC GTCTGATGCG CTTCTGCTCC GTTGAGATGG GTTCCTTCTA CCTCGACATC ATCAAAGACC GTCAGTACAC CGCCAAAGCG GACAGTGTGG CGCGTCGTAG CTGCCAGACT GCGCTGTATC ACATCGCAGA AGCGCTGGTG CGCTGGATGG CACCAATCCT CTCCTTCACC GCTGATGAAG TGTGGGGCTA CCTGCCGGGC GAACGTGAAA AATACGTCTT CACCGGTGAG TGGTACGAAG GCCTGTTTGG CCTGGCAGAC AGCGAAGCGA TGAACGATGC GTTCTGGGAC GAGCTGTTGA AAGTGCGTGG CGAAGTGAAC AAAGTCATTG AGCAAGCGCG TGCCGACAAG AAAGTGGGTG GCTCGCTGGA AGCGGCAGTA ACCTTGTATG CAGAACCGGA ACTGGCGGCG AAACTGACCG CGCTGGGCGA TGAATTACGA TTTGTCCTGT TGACCTCCGG CGCTACCGTT GCAGACTATC ACGACGCACC TGCTGATGCT CAGCAGAGCG AAGTACTCAA AGGGCTGAAA GTCGCGTTGA GTAAAGCCGA AGGTGAGAAG TGCCCACGCT GCTGGCACTA CACCCAGGAT GTCGGCAAGG TGGCGGAACA CGCAGAAATC TGCGGCCGCT GTGTCAGCAA CGTCGCCGGT GACGGTGAAA AACGTAAGTT TGCCTGA
|
Protein sequence | MSDYKSTLNL PETGFPMRGD LAKREPGMLA RWTDDDLYGI IRAAKKGKKT FILHDGPPYA NGSIHIGHSV NKILKDIIVK SKGLSGYDSP YVPGWDCHGL PIELKVEQEY GKPGEKFTAA EFRAKCREYA ATQVDGQRKD FIRLGVLGDW SHPYLTMDFK TEANIIRALG KIIGNGHLHK GAKPVHWCVD CRSALAEAEV EYYDKTSPSI DVAFQAVDQD ALKAKFAVSN VNGPISLVIW TTTPWTLPAN RAISIAPDFD YALVQIDGQA VILAKDLVES VMQRIGVTDY TILGTVKGAE LELLRFTHPF MGFDVPAILG DHVTLDAGTG AVHTAPGHGP DDYVIGQKYG LETANPVGPD GTYLPGTYPT LDGVNVFKAN DIVVALLQEK GALLHVEKMQ HSYPCCWRHK TPIIFRATPQ WFVSMDQKGL RAQSLKEIKG VQWIPDWGQA RIESMVANRP DWCISRQRTW GVPMSLFVHK DTEELHPRTL ELMEEVAKRV EVDGIQAWWD LDAKEILGDE ADQYVKVPDT LDVWFDSGST HSSVVDVRPE FAGHAADMYL EGSDQHRGWF MSSLMISTAM KGKAPYRQVL THGFTVDGQG RKMSKSIGNT VSPQDVMNKL GADILRLWVA STDYTGEMAV SDEILKRAAD SYRRIRNTAR FLLANLNGFD PAKDMVKPEE MVVLDRWAVG CAKAAQEDIL KAYEAYDFHE VVQRLMRFCS VEMGSFYLDI IKDRQYTAKA DSVARRSCQT ALYHIAEALV RWMAPILSFT ADEVWGYLPG EREKYVFTGE WYEGLFGLAD SEAMNDAFWD ELLKVRGEVN KVIEQARADK KVGGSLEAAV TLYAEPELAA KLTALGDELR FVLLTSGATV ADYHDAPADA QQSEVLKGLK VALSKAEGEK CPRCWHYTQD VGKVAEHAEI CGRCVSNVAG DGEKRKFA
|
| |