Gene ECH74115_4180 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4180 
SymbollysS 
ID6968074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3876203 
End bp3877720 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content52% 
IMG OID643387925 
Productlysyl-tRNA synthetase 
Protein accessionYP_002272364 
Protein GI209398148 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1190] Lysyl-tRNA synthetase (class II) 
TIGRFAM ID[TIGR00499] lysyl-tRNA synthetase, eukaryotic and non-spirochete bacterial 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones83 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAC AACACGCACA GGGCGCTGAC GCGGTAGTCG ATCTTAACAA TGAACTGAAA 
ACGCGTCGTG AGAAGCTGGC GAACCTGCGT GAGCAGGGAA TTGCCTTCCC GAACGATTTC
CGTCGCGATC ATACCTCTGA CCAATTGCAC GCAGAATTCG ACGGTAAAGA GAACGAAGAA
CTTGAAGCGC TGAACATCGA AGTCGCCGTT GCTGGCCGCA TGATGACCCG CCGTATTATG
GGTAAAGCCT CTTTTGTTAC CCTGCAGGAC GTTGGCGGTC GTATTCAGCT GTACGTTGCC
CGTGACGATC TTCCGGAAGG CGTTTACAAC GAGCAGTTCA AAAAATGGGA CCTCGGCGAC
ATCCTCGGCG CGAAAGGTAA ACTGTTCAAA ACCAAAACCG GCGAACTGTC TATCCACTGT
ACCGAGTTGC GTCTGCTGAC CAAAGCACTG CGTCCGCTGC CGGATAAATT CCACGGCTTG
CAGGATCAGG AAGCGCGCTA TCGTCAGCGT TATCTCGATC TCATCTCCAA CGATGAATCC
CGCAACACCT TTAAAGTGCG CTCGCAGATC CTCTCTGGTA TTCGCCAGTT CATGGTGAAC
CGCGGCTTTA TGGAAGTTGA AACGCCGATG ATGCAGGTGA TCCCTGGCGG TGCCGCTGCG
CGTCCGTTTA TCACCCACCA TAACGCGCTG GATCTCGACA TGTACCTGCG TATCGCGCCG
GAACTGTACC TCAAGCGTCT GGTGGTTGGT GGCTTCGAGC GTGTATTCGA AATCAACCGT
AACTTCCGTA ACGAAGGTAT TTCCGTACGT CATAACCCAG AGTTCACCAT GATGGAACTC
TATATGGCTT ACGCAGATTA CAAAGATCTG ATCGAGCTGA CCGAATCGCT GTTCCGTACT
CTGGCACAGG ATATTCTCGG TAAGACGGAA GTGACCTACG GCGACGTGAC GCTGGATTTC
GGTAAACCGT TCGAAAAACT GACCATGCGT GAAGCGATTA AGAAATATCG CCCGGAAACC
GACATGGCGG ATCTGGACAA CTTCGACTCT GCAAAAGCGA TTGTTGAATC TATCGGTATC
CACGTTGAGA AGAGCTGGGG TCTGGGCCGT ATCGTTACCG AGATCTTCGA AGAAGTGGCA
GAAGCACATC TGATTCAGCC GACCTTCATT ACTGAATATC CGGCAGAAGT TTCTCCGCTG
GCGCGTCGTA ACGACGTTAA CCCGGAAATC ACAGACCGCT TTGAGTTCTT CATTGGTGGT
CGTGAAATCG GTAACGGCTT TAGCGAGCTG AATGATGCGG AAGATCAGGC GCAACGCTTC
CTGGATCAGG TTGCCGCGAA AGATGCAGGT GACGACGAAG CGATGTTCTA CGACGAAGAT
TACGTCACCG CACTGGAACA TGGCTTACCG CCGACAGCAG GTCTGGGGAT TGGTATCGAC
CGTATGGTAA TGCTGTTCAC CAACAGCCAT ACCATCCGCG ACGTTATTCT GTTCCCGGCG
ATGCGTCCGG TGAAATAA
 
Protein sequence
MSEQHAQGAD AVVDLNNELK TRREKLANLR EQGIAFPNDF RRDHTSDQLH AEFDGKENEE 
LEALNIEVAV AGRMMTRRIM GKASFVTLQD VGGRIQLYVA RDDLPEGVYN EQFKKWDLGD
ILGAKGKLFK TKTGELSIHC TELRLLTKAL RPLPDKFHGL QDQEARYRQR YLDLISNDES
RNTFKVRSQI LSGIRQFMVN RGFMEVETPM MQVIPGGAAA RPFITHHNAL DLDMYLRIAP
ELYLKRLVVG GFERVFEINR NFRNEGISVR HNPEFTMMEL YMAYADYKDL IELTESLFRT
LAQDILGKTE VTYGDVTLDF GKPFEKLTMR EAIKKYRPET DMADLDNFDS AKAIVESIGI
HVEKSWGLGR IVTEIFEEVA EAHLIQPTFI TEYPAEVSPL ARRNDVNPEI TDRFEFFIGG
REIGNGFSEL NDAEDQAQRF LDQVAAKDAG DDEAMFYDED YVTALEHGLP PTAGLGIGID
RMVMLFTNSH TIRDVILFPA MRPVK