Gene ECH74115_0028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0028 
SymbolileS 
ID6967877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp26803 
End bp29619 
Gene Length2817 bp 
Protein Length938 aa 
Translation table11 
GC content56% 
IMG OID643384109 
Productisoleucyl-tRNA synthetase 
Protein accessionYP_002268632 
Protein GI209400433 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0060] Isoleucyl-tRNA synthetase 
TIGRFAM ID[TIGR00392] isoleucyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0184357 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGACT ATAAATCAAC CCTGAATTTG CCGGAAACAG GGTTCCCGAT GCGTGGCGAT 
CTCGCCAAGC GCGAACCGGG AATGCTGGCG CGTTGGACTG ATGATGATCT GTACGGCATC
ATTCGTGCGG CTAAAAAAGG CAAAAAAACC TTCATTCTGC ATGATGGCCC TCCTTATGCG
AATGGCAGCA TTCATATTGG TCACTCGGTT AACAAGATTC TGAAAGACAT TATCGTGAAG
TCCAAAGGGC TTTCCGGTTA TGACTCGCCG TATGTGCCTG GCTGGGACTG CCACGGTCTG
CCGATCGAGC TGAAAGTAGA GCAAGAATAC GGTAAGCCGG ATGAGAAATT CACCGCCGCC
GAGTTCCGCG CCAAGTGCCG CGAATACGCG GCGACCCAGG TTGACGGTCA ACGCAAAGAC
TTTATCCGTC TGGGCGTGCT GGGCGACTGG TCGCACCCGT ACCTGACCAT GGACTTCAAA
ACTGAAGCCA ATATCATCCG CGCGCTGGGC AAAATCATCG GCAATGGTCA CCTGCACAAA
GGCGCGAAGC CGGTGCACTG GTGCGTAGAC TGCCGTTCTG CACTGGCAGA AGCGGAAGTT
GAGTATTACG ACAAAACTTC TCCGTCCATT GACGTCGCTT TCCAGGCGGT CGATCAGGAT
GCGCTGAAAG CGAAATTTGG CGTAAGCAAC GTTAACGGCC CAATCTCGCT GGTGATCTGG
ACCACTACGC CGTGGACTCT GCCTGCGAAC CGCGCAATCT CTATTGCACC TGATTTCGAC
TATGCGCTGG TGCAGATCGA CGGTCAGGCC GTGATTCTGG CGAAAGATCT GGTTGAAAGC
GTAATGCAGC GTATCGGCGT GACCGATTAC ACCATTCTCG GCACGGTAAA AGGTGCGGAG
CTTGAGCTGC TGCGCTTTGC CCATCCGTTT ATGGGCTTCG ACGTCCCGGC AATCCTCGGC
GATCACGTTA CCCTGGATGC CGGTACCGGT GCCGTTCACA CCGCGCCTGG CCACGGCCCG
GACGACTATG TGATCGGTCA GAAATACGGC CTGGAAACCG CTAACCCGGT TGGCCCGGAC
GGCACTTATC TGCCGGGCAC TTATCCGACG CTGGATGGCG TGAACGTCTT CAAAGCGAAC
GACATCGTCG TTGCGCTGCT GCAGGAAAAA GGCGCGCTGC TGCACGTTGA GAAAATGCAG
CACAGCTATC CGTGCTGCTG GCGTCACAAA ACGCCGATCA TCTTCCGCGC AACGCCGCAG
TGGTTCGTCA GTATGGATCA GAAAGGTCTG CGCGCGCAGT CTCTGAAAGA GATCAAAGGT
GTGCAGTGGA TCCCGGACTG GGGCCAGGCG CGTATCGAGT CGATGGTCGC TAACCGTCCT
GACTGGTGTA TCTCCCGTCA GCGTACCTGG GGCGTACCGA TGTCTCTGTT CGTGCACAAA
GACACGGAAG AGCTGCATCC GCGTACCCTC GAACTGATGG AAGAAGTGGC TAAACGCGTT
GAAGTTGATG GCATCCAGGC GTGGTGGGAT CTTGATGCGA AAGAGATCCT CGGCGATGAA
GCTGATCAGT ATGTGAAAGT GCCGGATACG CTGGATGTAT GGTTTGACTC CGGCTCTACT
CACTCTTCTG TTGTTGACGT GCGCCCGGAA TTTGCCGGTC ACGCTGCGGA CATGTATCTG
GAAGGTTCTG ACCAACACCG TGGCTGGTTC ATGTCTTCTC TGATGATCTC CACCGCGATG
AAGGGCAAAG CGCCGTATCG TCAGGTGCTG ACCCACGGCT TTACCGTAGA TGGTCAGGGC
CGCAAGATGT CTAAATCCAT CGGCAACACC GTGTCGCCGC AGGATGTGAT GAACAAACTG
GGCGCGGATA TTCTGCGTCT GTGGGTGGCA TCAACCGACT ACACCGGTGA AATGGCCGTT
TCTGACGAGA TCCTGAAACG TGCTGCCGAC AGCTATCGTC GTATCCGTAA CACCGCGCGC
TTCCTGCTGG CAAACCTGAA CGGTTTTGAT CCGGCAAAAG ATATGGTGAA ACCGGAAGAG
ATGGTAGTAC TGGATCGCTG GGCCGTAGGT TGTGCGAAAG CGGCACAGGA AGACATCCTC
AAGGCATACG AAGCATACGA TTTCCACGAA GTGGTACAGC GTCTGATGCG CTTCTGCTCC
GTTGAGATGG GTTCCTTCTA CCTCGACATC ATCAAAGACC GTCAGTACAC CGCCAAAGCG
GACAGCGTGG CGCGTCGTAG CTGCCAGACT GCGCTGTATC ACATCGCAGA AGCGCTGGTT
CGCTGGATGG CACCAATCCT CTCCTTCACC GCTGATGAAG TGTGGGGCTA CCTGCCGGGC
GAACGTGAAA AATACGTCTT CACCGGCGAG TGGTACGAAG GTCTGTTTGG TCTGGCAGAC
AGTGAAGCAA TGAACGATGC GTTCTGGGAC GAGCTGTTGA AAGTGCGTGG CGAAGTGAAC
AAAGTCATTG AGCAGGCGCG TGCCGACAAG AAAGTGGGTG GCTCGCTGGA AGCGGCAGTA
ACCTTGTATA CAGAACCGGA ACTGGCGGCG AAACTGACCG CATTGGGCGA TGAATTACGA
TTTGTCCTGT TGACCTCCGG CGCTACCGTT GCAGACTATA ACGACGCACC TGCTGATGCT
CAGCAGAGCG AAGTGCTCAA AGGGCTGAAA GTCGCGTTGA GTAAAGCCGA AGGTGAGAAG
TGCCCACGCT GCTGGCACTA CACCCAGGAT GTCGGCAAGG TGGCGGAACA CGCAGAAATC
TGCGGCCGCT GTGTCAGCAA CGTCGCCGGT GACGGTGAAA AACGTAAGTT TGCCTGA
 
Protein sequence
MSDYKSTLNL PETGFPMRGD LAKREPGMLA RWTDDDLYGI IRAAKKGKKT FILHDGPPYA 
NGSIHIGHSV NKILKDIIVK SKGLSGYDSP YVPGWDCHGL PIELKVEQEY GKPDEKFTAA
EFRAKCREYA ATQVDGQRKD FIRLGVLGDW SHPYLTMDFK TEANIIRALG KIIGNGHLHK
GAKPVHWCVD CRSALAEAEV EYYDKTSPSI DVAFQAVDQD ALKAKFGVSN VNGPISLVIW
TTTPWTLPAN RAISIAPDFD YALVQIDGQA VILAKDLVES VMQRIGVTDY TILGTVKGAE
LELLRFAHPF MGFDVPAILG DHVTLDAGTG AVHTAPGHGP DDYVIGQKYG LETANPVGPD
GTYLPGTYPT LDGVNVFKAN DIVVALLQEK GALLHVEKMQ HSYPCCWRHK TPIIFRATPQ
WFVSMDQKGL RAQSLKEIKG VQWIPDWGQA RIESMVANRP DWCISRQRTW GVPMSLFVHK
DTEELHPRTL ELMEEVAKRV EVDGIQAWWD LDAKEILGDE ADQYVKVPDT LDVWFDSGST
HSSVVDVRPE FAGHAADMYL EGSDQHRGWF MSSLMISTAM KGKAPYRQVL THGFTVDGQG
RKMSKSIGNT VSPQDVMNKL GADILRLWVA STDYTGEMAV SDEILKRAAD SYRRIRNTAR
FLLANLNGFD PAKDMVKPEE MVVLDRWAVG CAKAAQEDIL KAYEAYDFHE VVQRLMRFCS
VEMGSFYLDI IKDRQYTAKA DSVARRSCQT ALYHIAEALV RWMAPILSFT ADEVWGYLPG
EREKYVFTGE WYEGLFGLAD SEAMNDAFWD ELLKVRGEVN KVIEQARADK KVGGSLEAAV
TLYTEPELAA KLTALGDELR FVLLTSGATV ADYNDAPADA QQSEVLKGLK VALSKAEGEK
CPRCWHYTQD VGKVAEHAEI CGRCVSNVAG DGEKRKFA