Gene EcSMS35_3023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3023 
SymbollysS 
ID6143305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3112457 
End bp3113974 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content53% 
IMG OID641617892 
Productlysyl-tRNA synthetase 
Protein accessionYP_001745043 
Protein GI170682698 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1190] Lysyl-tRNA synthetase (class II) 
TIGRFAM ID[TIGR00499] lysyl-tRNA synthetase, eukaryotic and non-spirochete bacterial 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAC AACACGCACA GGGCGCTGAC GCGGTAGTCG ATCTTAACAA TGAACTGAAA 
ACGCGTCGTG AGAAGCTGGC GAACCTGCGC GAGCAGGGGA TTGCCTTCCC GAACGATTTC
CGTCGCGATC ATACCTCTGA CCAATTGCAC GCAGAATTCG ACGGTAAAGA GAACGAAGAA
CTGGAAGCGC TGAACATCGA AGTTGCCGTT GCTGGCCGCA TGATGACCCG TCGTATTATG
GGTAAAGCGT CTTTCGTTAC CCTGCAGGAC GTGGGGGGCC GCATTCAGCT GTACGTTGCC
CGTGACGATC TGCCGGAAGG CATTTACAAC GAGCAGTTCA AAAAATGGGA CCTCGGCGAC
ATCCTCGGCG CGAAAGGTAA ACTGTTCAAA ACCAAAACTG GCGAACTGTC TATCCACTGC
ACCGAGCTGC GTCTGCTGAC CAAAGCACTG CGTCCGCTGC CGGACAAATT CCACGGCTTG
CAGGATCAGG AAGCGCGCTA TCGTCAGCGT TATTTGGATC TCATCTCTAA CGATGAATCC
CGCAACACTT TTAAAGTGCG CTCGCAGATC CTCTCTGGTA TTCGCCAGTT CATGGTGAAC
CGCGGCTTTA TGGAAGTTGA AACGCCGATG ATGCAGGTGA TCCCAGGCGG TGCCGCTGCG
CGTCCGTTTA TCACCCACCA TAATGCGCTG GATCTCGACA TGTACCTGCG TATCGCGCCG
GAACTGTACC TCAAGCGTCT GGTGGTCGGT GGCTTCGAGC GTGTATTCGA AATCAACCGT
AACTTCCGTA ACGAAGGTAT CTCCGTACGT CATAACCCAG AGTTCACCAT GATGGAACTC
TATATGGCTT ACGCGGATTA CAAAGATCTG ATCGAGCTGA CCGAATCGCT GTTCCGTACT
CTGGCACAGG ATATTCTCGG TAAGACGGAA GTGACCTACG GCGACGTGAC GCTGGACTTC
GGTAAACCGT TCGAAAAACT GACCATGCGT GAAGCGATCA AGAAATATCG CCCGGAAACC
GACATGGCAG ATCTGGACAA CTTCGACTCT GCGAAAGCGA TTGCTGAATC TATCGGTATC
CACGTTGAGA AGAGCTGGGG TCTGGGCCGT ATCGTTACCG AGATCTTCGA AGAAGTGGCA
GAAGCACATC TGATTCAGCC GACCTTCATT ACTGAATATC CGGCAGAAGT TTCTCCGCTG
GCGCGTCGTA ACGACGTTAA CCCGGAAATC ACAGACCGCT TTGAGTTCTT CATTGGTGGT
CGTGAAATCG GTAACGGCTT TAGCGAGCTG AATGACGCGG AAGATCAGGC GCAGCGCTTC
CTGGATCAGG TTGCCGCGAA AGACGCAGGC GACGACGAAG CGATGTTCTA CGACGAAGAT
TATGTCACCG CACTGGAACA TGGCTTACCG CCGACAGCAG GTCTGGGAAT TGGTATCGAC
CGTATGGTAA TGCTGTTCAC CAACAGCCAT ACCATCCGCG ACGTTATTCT GTTCCCGGCG
ATGCGTCCGG TGAAATAA
 
Protein sequence
MSEQHAQGAD AVVDLNNELK TRREKLANLR EQGIAFPNDF RRDHTSDQLH AEFDGKENEE 
LEALNIEVAV AGRMMTRRIM GKASFVTLQD VGGRIQLYVA RDDLPEGIYN EQFKKWDLGD
ILGAKGKLFK TKTGELSIHC TELRLLTKAL RPLPDKFHGL QDQEARYRQR YLDLISNDES
RNTFKVRSQI LSGIRQFMVN RGFMEVETPM MQVIPGGAAA RPFITHHNAL DLDMYLRIAP
ELYLKRLVVG GFERVFEINR NFRNEGISVR HNPEFTMMEL YMAYADYKDL IELTESLFRT
LAQDILGKTE VTYGDVTLDF GKPFEKLTMR EAIKKYRPET DMADLDNFDS AKAIAESIGI
HVEKSWGLGR IVTEIFEEVA EAHLIQPTFI TEYPAEVSPL ARRNDVNPEI TDRFEFFIGG
REIGNGFSEL NDAEDQAQRF LDQVAAKDAG DDEAMFYDED YVTALEHGLP PTAGLGIGID
RMVMLFTNSH TIRDVILFPA MRPVK