Gene EcSMS35_0024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0024 
SymbolileS 
ID6144335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp27575 
End bp30391 
Gene Length2817 bp 
Protein Length938 aa 
Translation table11 
GC content55% 
IMG OID641614925 
Productisoleucyl-tRNA synthetase 
Protein accessionYP_001742141 
Protein GI170684022 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0060] Isoleucyl-tRNA synthetase 
TIGRFAM ID[TIGR00392] isoleucyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0199251 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGACT ATAAATCAAC CCTGAATTTG CCGGAAACAG GGTTCCCGAT GCGTGGCGAT 
CTCGCCAAGC GCGAACCGGG AATGCTGGCG CGTTGGACTG ATGATGATCT GTACGGCATC
ATCCGTGCGG CTAAAAAAGG CAAAAAAACC TTCATTCTGC ATGATGGCCC TCCTTATGCG
AATGGCAGCA TTCATATTGG TCACTCGGTT AACAAGATTC TGAAAGACAT TATCGTGAAG
TCCAAAGGGC TTTCCGGTTA TGACTCGCCG TATGTGCCTG GCTGGGACTG TCATGGTCTG
CCGATCGAGC TGAAAGTAGA GCAAGAATAC GGTAAGCCGG GTGAGAAATT CACCGCTGCC
GAGTTCCGCG CCAAGTGCCG CGAATACGCT GCGACCCAGG TTGACGGTCA ACGCAAAGAC
TTTATCCGTC TGGGCGTGCT GGGCGACTGG TCGCACCCGT ACCTGACCAT GGACTTCAAA
ACTGAAGCCA ATATCATCCG CGCGCTGGGC AAAATCATCG GCAACGGTCA CCTGCACAAA
GGCGCGAAGC CGGTACACTG GTGCGTTGAC TGCCGTTCTG CGCTGGCGGA AGCGGAAGTT
GAGTATTACG ACAAAACTTC TCCGTCCATC GACGTCGCTT TCCAGGCGGT CGATCAGGAT
GCACTGAAAG CAAAATTTGC CGTAAGCAAC GTTAACGGCC CAATCTCGCT GGTGATCTGG
ACCACTACGC CGTGGACTCT GCCTGCGAAC CGCGCAATCT CTATTGCACC TGATTTCGAC
TATGTGCTGG TGCAGATCGA CGGTCAGGCC GTGATTCTGG CAAAAGATCT GGTTGAAAGC
GTAATGCAGC GTATCGGCGT GACCGATTAC ACCATTCTCG GCACGGTAAA AGGTGCAGAG
CTTGAGCTGC TGCGCTTTAC CCATCCGTTT ATGGGCTTCG ACGTTCCGGC AATCCTCGGC
GATCACGTTA CCCTGGATGC CGGTACCGGT GCCGTTCACA CCGCGCCTGG CCACGGCCCG
GACGACTATG TGATCGGTCA GAAATACGGC CTGGAAACCG CTAACCCGGT TGGCCCGGAC
GGCACTTATC TGCCGGGTAC TTACCCGACG CTGGATGGCG TGAACGTTTT CAAAGCGAAC
GACATCGTTG TTGCGCTGCT GCAGGAAAAA GGCGCGCTGT TGCACGTTGA GAAAATGCAG
CACAGCTATC CGTGCTGCTG GCGTCACAAA ACGCCGATCA TCTTCCGTGC GACGCCGCAG
TGGTTCGTCA GCATGGATCA GAAAGGTCTG CGTGCGCAGT CACTGAAAGA GATCAAAGGC
GTGCAGTGGA TCCCGGACTG GGGCCAGGCG CGTATCGAGT CGATGGTTGC TAACCGTCCT
GACTGGTGTA TCTCCCGTCA GCGCACCTGG GGCGTACCGA TGTCACTGTT CGTGCATAAA
GACACGGAAG AGCTGCATCC GCGTACCCTC GAACTGATGG AAGAAGTGGC TAAACGCGTG
GAAGTTGACG GCATCCAGGC GTGGTGGGAT CTCGATGCGA AAGAGATCCT CGGCGATGAA
GCTGATCAGT ACGTGAAAGT GCCGGATACT CTCGACGTAT GGTTTGACTC CGGATCTACC
CACTCTTCCG TTGTTGACGT GCGTCCGGAA TTTGCCGGTC ACGCTGCGGA CATGTATCTG
GAAGGTTCAG ACCAGCACCG TGGCTGGTTC ATGTCCTCTC TGATGATCTC TACCGCGATG
AAGGGCAAAG CACCGTATCG TCAGGTACTG ACCCACGGCT TTACCGTGGA TGGTCAGGGC
CGCAAGATGT CTAAATCCAT CGGCAACACC GTTTCGCCGC AGGATGTGAT GAACAAACTG
GGCGCGGATA TTCTGCGTCT GTGGGTGGCA TCAACTGACT ACACTGGTGA AATGGCCGTT
TCTGACGAAA TCCTGAAACG TGCTGCCGAC AGCTATCGTC GTATCCGTAA CACCGCGCGC
TTCCTGCTGG CAAACCTGAA CGGTTTTGAT CCGGCAAAAG ATATGGTGAA ACCGGAAGAG
ATGGTGGTAC TGGATCGCTG GGCCGTAGGT TGTGCGAAAG CGGCACAGGA AGACATCCTC
AAGGCGTACG AAGCATACGA TTTCCACGAA GTGGTACAGC GTCTGATGCG CTTCTGCTCC
GTTGAGATGG GTTCATTCTA CCTCGACATC ATCAAAGACC GTCAGTATAC CGCCAAAGCT
GACAGTGTGG CGCGTCGTAG CTGCCAGACT GCGCTGTATC ACATCGCAGA AGCGCTGGTG
CGCTGGATGG CACCAATCCT CTCCTTCACC GCTGATGAAG TGTGGGGCTA CCTGCCGGGC
GAACGTGAAA AATACGTCTT CACCGGCGAG TGGTACGAAG GCCTGTTTGG TCTGGCAGAC
AGTGAAGCGA TGAACGATGC GTTCTGGGAC GAGCTGTTGA AAGTGCGTGG CGAAGTGAAC
AAAGTCATTG AGCAAGCGCG TGCCGACAAG AAAGTGGGCG GCTCGCTGGA AGCGGCAGTA
ACCTTGTATG CAGAACCGGA ACTGGCGGCG AAACTGACCG CGCTGGGCGA TGAATTACGA
TTTGTCCTGT TGACCTCCGG CGCTACCGTT GCAGACTATA ACGATGCACC TGCTGATGCT
CAGCAGAGCG AAGTACTCAA AGGGCTGAAA GTCGCGTTGA GTAAAGCCGA AGGTGAGAAG
TGCCCACGCT GCTGGCACTA CACCCAGGAT GTCGGCAAGG TGGCGGAACA CGCAGAAATC
TGCGGCCGCT GTGTCAGCAA CGTCGCCGGT GACGGTGAAA AACGTAAGTT TGCCTGA
 
Protein sequence
MSDYKSTLNL PETGFPMRGD LAKREPGMLA RWTDDDLYGI IRAAKKGKKT FILHDGPPYA 
NGSIHIGHSV NKILKDIIVK SKGLSGYDSP YVPGWDCHGL PIELKVEQEY GKPGEKFTAA
EFRAKCREYA ATQVDGQRKD FIRLGVLGDW SHPYLTMDFK TEANIIRALG KIIGNGHLHK
GAKPVHWCVD CRSALAEAEV EYYDKTSPSI DVAFQAVDQD ALKAKFAVSN VNGPISLVIW
TTTPWTLPAN RAISIAPDFD YVLVQIDGQA VILAKDLVES VMQRIGVTDY TILGTVKGAE
LELLRFTHPF MGFDVPAILG DHVTLDAGTG AVHTAPGHGP DDYVIGQKYG LETANPVGPD
GTYLPGTYPT LDGVNVFKAN DIVVALLQEK GALLHVEKMQ HSYPCCWRHK TPIIFRATPQ
WFVSMDQKGL RAQSLKEIKG VQWIPDWGQA RIESMVANRP DWCISRQRTW GVPMSLFVHK
DTEELHPRTL ELMEEVAKRV EVDGIQAWWD LDAKEILGDE ADQYVKVPDT LDVWFDSGST
HSSVVDVRPE FAGHAADMYL EGSDQHRGWF MSSLMISTAM KGKAPYRQVL THGFTVDGQG
RKMSKSIGNT VSPQDVMNKL GADILRLWVA STDYTGEMAV SDEILKRAAD SYRRIRNTAR
FLLANLNGFD PAKDMVKPEE MVVLDRWAVG CAKAAQEDIL KAYEAYDFHE VVQRLMRFCS
VEMGSFYLDI IKDRQYTAKA DSVARRSCQT ALYHIAEALV RWMAPILSFT ADEVWGYLPG
EREKYVFTGE WYEGLFGLAD SEAMNDAFWD ELLKVRGEVN KVIEQARADK KVGGSLEAAV
TLYAEPELAA KLTALGDELR FVLLTSGATV ADYNDAPADA QQSEVLKGLK VALSKAEGEK
CPRCWHYTQD VGKVAEHAEI CGRCVSNVAG DGEKRKFA