Gene EcHS_A0028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0028 
SymbolileS 
ID5594904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp25687 
End bp28503 
Gene Length2817 bp 
Protein Length938 aa 
Translation table11 
GC content56% 
IMG OID640919216 
Productisoleucyl-tRNA synthetase 
Protein accessionYP_001456811 
Protein GI157159493 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0060] Isoleucyl-tRNA synthetase 
TIGRFAM ID[TIGR00392] isoleucyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00012642 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGACT ATAAATCAAC CCTGAATTTG CCGGAAACAG GGTTCCCGAT GCGTGGCGAT 
CTCGCCAAGC GCGAACCGGG AATGCTGGCG CGTTGGACTG ATGATGATCT GTACGGCATC
ATCCGTGCGG CTAAAAAAGG CAAAAAAACC TTCATTCTGC ATGATGGCCC TCCTTATGCG
AATGGCAGCA TTCATATTGG TCACTCGGTT AACAAGATTC TGAAAGACAT TATCGTGAAG
TCCAAAGGGC TTTCCGGTTA TGACTCGCCG TATGTGCCTG GCTGGGACTG CCACGGTCTG
CCGATCGAGC TGAAAGTAGA GCAAGAATAC GGTAAGCCGG GTGAGAAATT CACCGCCGCC
GAGTTCCGCG CCAAGTGCCG CGAATACGCG GCGACCCAGG TTGACGGTCA ACGCAAAGAC
TTTATCCGTC TGGGCGTGCT GGGCGACTGG TCGCACCCGT ACCTGACCAT GGACTTCAAA
ACTGAAGCCA ACATCATCCG CGCGCTGGGC AAAATCATCG GCAACGGTCA CCTGCACAAA
GGCGCGAAGC CAGTTCACTG GTGCGTTGAC TGCCGTTCTG CGCTGGCGGA AGCGGAAGTT
GAGTATTACG ACAAAACTTC TCTGTCCATC GACGTTGCTT TTCAGGCGGT CGATCAGGAT
GCACTGAAAG CAAAATTTGC CGTAAGCAAC GTTAACGGCC CAATCTCGCT GGTGATCTGG
ACCACTACGC CGTGGACTCT GCCTGCGAAC CGCGCAATCT CTATTGCACC TGATTTCGAC
TATGCGCTGG TGCAGATCGA CGGTCAGGCC GTGATTCTGG CGAAAGATCT GGTTGAAAGC
GTAATGCAGC GTATCGGCGT GACCGATTAC ACCATTCTCG GCACGGTAAA AGGTGCGGAG
CTTGAGTTGC TGCGCTTTAC CCATCCGTTT ATGGGCTTCG ACGTTCCGGC AATCCTCGGC
GATCACGTTA CCCTGGATGC CGGTACCGGT GCCGTTCACA CCGCGCCTGG CCACGGCCCG
GACGACTATG TGATCGGTCA GAAATACGGC CTGGAAACCG CTAACCCGGT TGGCCCGGAC
GGCACTTATC TGCCGGGCAC TTATCCGACG CTGGATGGCG TGAACGTCTT CAAAGCGAAC
GACATCGTCG TTGCGCTGCT GCAGGAAAAA GGCGCGCTGC TGCACGTTGA GAAAATGCAG
CACAGCTATC CGTGCTGCTG GCGTCACAAA ACGCCGATCA TCTTCCGCGC GACGCCGCAG
TGGTTCGTCA GCATGGATCA GAAAGGTCTG CGTGCGCAGT CACTGAAAGA GATCAAAGGC
GTGCAGTGGA TCCCGGACTG GGGCCAGGCG CGTATCGAGT CGATGGTTGC TAACCGTCCT
GACTGGTGTA TCTCCCGTCA GCGCACCTGG GGCGTACCGA TGTCACTGTT CGTGCACAAA
GACACGGAAG AGCTGCATCC GCGTACTCTC GAACTAATGG AAGAAGTGGC AAAACGCGTT
GAAGTTGACG GCATCCAGGC GTGGTGGGAT CTTGATGCGA AAGAGATCCT CGGCGATGAA
GCTGATCAGT ACGTGAAAGT GCCGGACACA TTGGATGTAT GGTTTGACTC CGGATCTACC
CACTCTTCTG TTGTTGACGT GCGTCCGGAA TTTGCCGGTC ACGCAGCGGA CATGTATCTG
GAAGGTTCTG ACCAGCACCG TGGTTGGTTC ATGTCTTCCC TAATGATCTC CACCGCGATG
AAAGGCAAAG CGCCGTATCG TCAGGTACTG ACCCACGGCT TTACCGTGGA TGGTCAGGGT
CGCAAGATGT CTAAATCCAT CGGCAATACC GTTTCGCCGC AGGATGTGAT GAACAAACTG
GGCGCGGATA TTCTGCGTCT GTGGGTGGCA TCAACCGACT ACACCGGTGA AATGGCCGTT
TCTGACGAGA TCCTGAAACG TGCTGCCGAT AGCTATCGTC GTATCCGTAA CACCGCGCGC
TTCCTGCTGG CAAACCTGAA CGGTTTTGAT CCAGCAAAAG ATATGGTGAA ACCGGAAGAG
ATGGTGGTAC TGGATCGCTG GGCCGTAGGT TGTGCGAAAG CGGCACAGGA AGACATCCTC
AAGGCGTACG AAGCATACGA TTTCCACGAA GTGGTACAGC GTCTGATGCG CTTCTGCTCC
GTTGAGATGG GTTCCTTCTA CCTCGACATC ATCAAAGACC GTCAGTACAC CGCCAAAGCG
GACAGTGTGG CGCGTCGTAG CTGCCAGACT GCGCTGTATC ACATCGCAGA AGCGCTGGTG
CGCTGGATGG CACCAATCCT CTCCTTCACC GCTGATGAAG TGTGGGGCTA CCTGCCGGGC
GAACGTGAAA AATACGTCTT CACCGGTGAG TGGTACGAAG GCCTGTTTGG CCTGGCAGAC
AGTGAAGCGA TGAACGATGC GTTCTGGGAC GAGCTGTTGA AAGTGCGTGG CGAAGTGAAC
AAAGTCATTG AGCAAGCGCG TGCCGACAAG AAAGTGGGTG GCTCGCTGGA AGCGGCAGTA
ACCTTGTATG CAGAACCGGA ACTGTCGGCG AAACTGACCG CGCTGGGCGA TGAATTACGA
TTTGTCCTGT TGACCTCCGG CGCTACCGTT GCAGACTATA ACGACGCACC TGCTGATGCT
CAGCAGAGCG AAGTACTCAA AGGGCTGAAA GTCGCGTTGA GTAAAGCCGA AGGTGAGAAG
TGCCCACGCT GCTGGCACTA CACCCAGGAT GTCGGCAAGG TGGCGGAACA CGCAGAAATC
TGCGGCCGCT GTGTCAGCAA CGTCGCCGGT GACGGTGAAA AACGTAAGTT TGCCTGA
 
Protein sequence
MSDYKSTLNL PETGFPMRGD LAKREPGMLA RWTDDDLYGI IRAAKKGKKT FILHDGPPYA 
NGSIHIGHSV NKILKDIIVK SKGLSGYDSP YVPGWDCHGL PIELKVEQEY GKPGEKFTAA
EFRAKCREYA ATQVDGQRKD FIRLGVLGDW SHPYLTMDFK TEANIIRALG KIIGNGHLHK
GAKPVHWCVD CRSALAEAEV EYYDKTSLSI DVAFQAVDQD ALKAKFAVSN VNGPISLVIW
TTTPWTLPAN RAISIAPDFD YALVQIDGQA VILAKDLVES VMQRIGVTDY TILGTVKGAE
LELLRFTHPF MGFDVPAILG DHVTLDAGTG AVHTAPGHGP DDYVIGQKYG LETANPVGPD
GTYLPGTYPT LDGVNVFKAN DIVVALLQEK GALLHVEKMQ HSYPCCWRHK TPIIFRATPQ
WFVSMDQKGL RAQSLKEIKG VQWIPDWGQA RIESMVANRP DWCISRQRTW GVPMSLFVHK
DTEELHPRTL ELMEEVAKRV EVDGIQAWWD LDAKEILGDE ADQYVKVPDT LDVWFDSGST
HSSVVDVRPE FAGHAADMYL EGSDQHRGWF MSSLMISTAM KGKAPYRQVL THGFTVDGQG
RKMSKSIGNT VSPQDVMNKL GADILRLWVA STDYTGEMAV SDEILKRAAD SYRRIRNTAR
FLLANLNGFD PAKDMVKPEE MVVLDRWAVG CAKAAQEDIL KAYEAYDFHE VVQRLMRFCS
VEMGSFYLDI IKDRQYTAKA DSVARRSCQT ALYHIAEALV RWMAPILSFT ADEVWGYLPG
EREKYVFTGE WYEGLFGLAD SEAMNDAFWD ELLKVRGEVN KVIEQARADK KVGGSLEAAV
TLYAEPELSA KLTALGDELR FVLLTSGATV ADYNDAPADA QQSEVLKGLK VALSKAEGEK
CPRCWHYTQD VGKVAEHAEI CGRCVSNVAG DGEKRKFA