Gene EcSMS35_4738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4738 
SymbolvalS 
ID6146645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4835957 
End bp4838812 
Gene Length2856 bp 
Protein Length951 aa 
Translation table11 
GC content55% 
IMG OID641619553 
Productvalyl-tRNA synthetase 
Protein accessionYP_001746661 
Protein GI170683054 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0366213 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.55715 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAGA CATATAACCC ACAAGATATC GAACAGCCGC TTTACGAGCA CTGGGAAAAG 
CAGGGCTACT TTAAGCCTAA TGGCGATGAA AGCCAGGAAA GTTTCTGCAT CATGATCCCG
CCGCCGAACG TCACCGGCAG TTTGCATATG GGTCACGCCT TCCAGCAAAC CATCATGGAT
ACCATGATCC GCTATCAGCG TATGCAGGGT AAAAACACCC TGTGGCAGGT CGGTACTGAC
CACGCCGGGA TCGCTACCCA GATGGTCGTT GAGCGCAAGA TTGCCGCAGA AGAAGGTAAA
ACCCGTCACG ATTACGGCCG CGAAGCTTTC ATCGACAAAA TCTGGGAATG GAAAGCGGAA
TCCGGCGGCA CCATTACCCG TCAGATGCGT CGTCTCGGCA ACTCCGTGGA CTGGGAGCGT
GAACGCTTCA CGATGGACGA AGGCCTGTCC AATGCAGTGA AAGAAGTTTT CGTTCGTCTG
TATAAAGAAG ACCTGATTTA CCGTGGCAAA CGCCTGGTAA ACTGGGACCC GAAACTGCGC
ACCGCTATCT CCGACCTGGA AGTGGAAAAC CGCGAATCGA AAGGTTCGAT GTGGCACATC
CGCTATCCGC TGGCTGACGG CGCGAAAACC GCAGACGGTA AAGATTATCT GGTGGTTGCG
ACTACCCGTC CGGAAACCCT GCTGGGCGAT ACCGGCGTGG CCGTTAACCC GGAAGATCCG
CGTTATAAAG ATCTGATTGG CAAATATGTC ATTCTGCCGC TGGTTAACCG TCGTATTCCG
ATCGTTGGCG ACGAACACGC CGACATGGAA AAAGGCACTG GCTGCGTGAA GATCACCCCA
GCGCACGACT TTAACGACTA TGAAGTCGGT AAACGTCACG CCCTGCCGAT GATCAACATC
CTGACCTTTG ACGGTGATAT CCGTGAAAGC GCCCAGGTGT TCGATACCAA AGGTAACGAA
TCTGACGTTT ATTCCAGCGA AATCCCGGCA GAGTTCCAGA AACTCGAGCG TTTTGCTGCA
CGTAAAGCCG TCGTTGCTGC GGTTGACGCG CTCGGCCTGC TGGAAGAAAT TAAACCGCAC
GACCTGACCG TTCCTTACGG CGACCGTGGC GGCGTGGTTA TCGAACCGAT GCTGACCGAC
CAGTGGTACG TGCGTGCCGA TGTACTGGCG AAACCGGCAG TTGAAGCGGT TGAGAACGGC
GACATTCAGT TCGTACCGAA GCAGTACGAA AACATGTACT TCTCCTGGAT GCGCGATATT
CAGGACTGGT GTATCTCTCG TCAGCTATGG TGGGGTCACC GTATCCCGGC ATGGTATGAC
GAAGCGGGTA ACGTTTATGT TGGCCGCAAC GAAGACGAAG TGCGTAAAGA AAATAACCTC
GGTGCTGATG TTGTCCTGCG TCAGGACGAA GACGTTCTCG ACACCTGGTT CTCTTCTGCG
CTGTGGACCT TCTCTACCCT TGGCTGGCCG GAAAATACCG ACGCCCTGCG TCAGTTCCAC
CCAACCAGCG TGATGGTATC TGGTTTCGAC ATCATCTTCT TCTGGATTGC CCGCATGATC
ATGATGACCA TGCACTTCAT CAAAGATGAA AATGGCAAAC CGCAGGTGCC GTTCCACACT
GTTTACATGA CCGGTCTGAT TCGTGACGAC GAAGGCCAGA AGATGTCCAA ATCCAAGGGT
AACGTTATCG ACCCGCTGGA TATGGTTGAC GGTATTTCGC TGCCAGAACT GCTGGAAAAA
CGTACCGGCA ATATGATGCA GCCGCAGCTG GCGGACAAAA TCCGTAAGCG CACCGAGAAA
CAGTTCCCGA ACGGCATTGA GCCGCACGGC ACCGATGCCC TGCGCTTCAC CCTGGCGGCG
CTGGCTTCTA CCGGTCGTGA CATCAACTGG GATATGAAGC GTCTGGAAGG CTACCGTAAC
TTCTGTAACA AGCTGTGGAA CGCCAGCCGC TTTGTGCTGA TGAACACAGA AGGTCAGGAT
TGCGGCTTCA ACGGCGGTGA AATGACGCTG TCGCTGGCGG ACCGCTGGAT TCTGGCGGAG
TTCAACCAGA CCATCAAAGC GTACCGCGAA GCGCTGGACA GCTTCCGTTT CGATATTGCC
GCAGGCATTC TGTATGAATT CACCTGGAAC CAGTTCTGTG ACTGGTATCT GGAGCTGACC
AAGCCGGTAA TGAACGGTGG CACCGAAGCC GAACTGCGCG GTACTCGCCA TACGCTGGTG
ACTGTACTGG AAGGTCTGCT GCGCCTCGCG CATCCGATCA TTCCGTTCAT CACCGAAACC
ATCTGGCAGC GTGTGAAAGT ACTTTGCGGT ATCACTGCCG ACACCATCAT GCTGCAACCG
TTCCCGCAGT ACGATGCGTC TCAGGTCGAT GAAGCCGCAC TGGCCGACAC CGAGTGGCTG
AAGCAGGCGA TCGTTGCGGT ACGTAACATC CGTGCAGAAA TGAACATCGC ACCGGGTAAA
CCGCTGGAGC TGCTGCTGCG TGGTTGCAGC GCGGATGCAG AACGTCGCGT AAATGAAAAC
CGTGGCTTCC TGCAAACGCT GGCGCGTCTG GAAAGTATCA CCGTGCTGCC TGCCGATGAC
AAAGGTCCGG TTTCCGTTAC CAAGATCATC GACGGCGCAG AGCTGCTGAT CCCGATGGCT
GGCCTCATCA ACAAAGAAGA TGAGCTGGCG CGTCTGGCGA AAGAAGTGGC GAAGATTGAA
GGTGAAATCA GCCGTATCGA GAACAAACTG GCGAACGAAG GCTTTGTCGC CCGCGCACCG
GAAGCGGTCA TCGCGAAAGA GCGTGAGAAG CTGGAAGGCT ATGCAGAAGC GAAAGCGAAG
CTGATTGAAC AGCAGGCTGT TATCGCCGCG CTGTAA
 
Protein sequence
MEKTYNPQDI EQPLYEHWEK QGYFKPNGDE SQESFCIMIP PPNVTGSLHM GHAFQQTIMD 
TMIRYQRMQG KNTLWQVGTD HAGIATQMVV ERKIAAEEGK TRHDYGREAF IDKIWEWKAE
SGGTITRQMR RLGNSVDWER ERFTMDEGLS NAVKEVFVRL YKEDLIYRGK RLVNWDPKLR
TAISDLEVEN RESKGSMWHI RYPLADGAKT ADGKDYLVVA TTRPETLLGD TGVAVNPEDP
RYKDLIGKYV ILPLVNRRIP IVGDEHADME KGTGCVKITP AHDFNDYEVG KRHALPMINI
LTFDGDIRES AQVFDTKGNE SDVYSSEIPA EFQKLERFAA RKAVVAAVDA LGLLEEIKPH
DLTVPYGDRG GVVIEPMLTD QWYVRADVLA KPAVEAVENG DIQFVPKQYE NMYFSWMRDI
QDWCISRQLW WGHRIPAWYD EAGNVYVGRN EDEVRKENNL GADVVLRQDE DVLDTWFSSA
LWTFSTLGWP ENTDALRQFH PTSVMVSGFD IIFFWIARMI MMTMHFIKDE NGKPQVPFHT
VYMTGLIRDD EGQKMSKSKG NVIDPLDMVD GISLPELLEK RTGNMMQPQL ADKIRKRTEK
QFPNGIEPHG TDALRFTLAA LASTGRDINW DMKRLEGYRN FCNKLWNASR FVLMNTEGQD
CGFNGGEMTL SLADRWILAE FNQTIKAYRE ALDSFRFDIA AGILYEFTWN QFCDWYLELT
KPVMNGGTEA ELRGTRHTLV TVLEGLLRLA HPIIPFITET IWQRVKVLCG ITADTIMLQP
FPQYDASQVD EAALADTEWL KQAIVAVRNI RAEMNIAPGK PLELLLRGCS ADAERRVNEN
RGFLQTLARL ESITVLPADD KGPVSVTKII DGAELLIPMA GLINKEDELA RLAKEVAKIE
GEISRIENKL ANEGFVARAP EAVIAKEREK LEGYAEAKAK LIEQQAVIAA L