Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4738 |
Symbol | valS |
ID | 6146645 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4835957 |
End bp | 4838812 |
Gene Length | 2856 bp |
Protein Length | 951 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641619553 |
Product | valyl-tRNA synthetase |
Protein accession | YP_001746661 |
Protein GI | 170683054 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0366213 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.55715 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAAGA CATATAACCC ACAAGATATC GAACAGCCGC TTTACGAGCA CTGGGAAAAG CAGGGCTACT TTAAGCCTAA TGGCGATGAA AGCCAGGAAA GTTTCTGCAT CATGATCCCG CCGCCGAACG TCACCGGCAG TTTGCATATG GGTCACGCCT TCCAGCAAAC CATCATGGAT ACCATGATCC GCTATCAGCG TATGCAGGGT AAAAACACCC TGTGGCAGGT CGGTACTGAC CACGCCGGGA TCGCTACCCA GATGGTCGTT GAGCGCAAGA TTGCCGCAGA AGAAGGTAAA ACCCGTCACG ATTACGGCCG CGAAGCTTTC ATCGACAAAA TCTGGGAATG GAAAGCGGAA TCCGGCGGCA CCATTACCCG TCAGATGCGT CGTCTCGGCA ACTCCGTGGA CTGGGAGCGT GAACGCTTCA CGATGGACGA AGGCCTGTCC AATGCAGTGA AAGAAGTTTT CGTTCGTCTG TATAAAGAAG ACCTGATTTA CCGTGGCAAA CGCCTGGTAA ACTGGGACCC GAAACTGCGC ACCGCTATCT CCGACCTGGA AGTGGAAAAC CGCGAATCGA AAGGTTCGAT GTGGCACATC CGCTATCCGC TGGCTGACGG CGCGAAAACC GCAGACGGTA AAGATTATCT GGTGGTTGCG ACTACCCGTC CGGAAACCCT GCTGGGCGAT ACCGGCGTGG CCGTTAACCC GGAAGATCCG CGTTATAAAG ATCTGATTGG CAAATATGTC ATTCTGCCGC TGGTTAACCG TCGTATTCCG ATCGTTGGCG ACGAACACGC CGACATGGAA AAAGGCACTG GCTGCGTGAA GATCACCCCA GCGCACGACT TTAACGACTA TGAAGTCGGT AAACGTCACG CCCTGCCGAT GATCAACATC CTGACCTTTG ACGGTGATAT CCGTGAAAGC GCCCAGGTGT TCGATACCAA AGGTAACGAA TCTGACGTTT ATTCCAGCGA AATCCCGGCA GAGTTCCAGA AACTCGAGCG TTTTGCTGCA CGTAAAGCCG TCGTTGCTGC GGTTGACGCG CTCGGCCTGC TGGAAGAAAT TAAACCGCAC GACCTGACCG TTCCTTACGG CGACCGTGGC GGCGTGGTTA TCGAACCGAT GCTGACCGAC CAGTGGTACG TGCGTGCCGA TGTACTGGCG AAACCGGCAG TTGAAGCGGT TGAGAACGGC GACATTCAGT TCGTACCGAA GCAGTACGAA AACATGTACT TCTCCTGGAT GCGCGATATT CAGGACTGGT GTATCTCTCG TCAGCTATGG TGGGGTCACC GTATCCCGGC ATGGTATGAC GAAGCGGGTA ACGTTTATGT TGGCCGCAAC GAAGACGAAG TGCGTAAAGA AAATAACCTC GGTGCTGATG TTGTCCTGCG TCAGGACGAA GACGTTCTCG ACACCTGGTT CTCTTCTGCG CTGTGGACCT TCTCTACCCT TGGCTGGCCG GAAAATACCG ACGCCCTGCG TCAGTTCCAC CCAACCAGCG TGATGGTATC TGGTTTCGAC ATCATCTTCT TCTGGATTGC CCGCATGATC ATGATGACCA TGCACTTCAT CAAAGATGAA AATGGCAAAC CGCAGGTGCC GTTCCACACT GTTTACATGA CCGGTCTGAT TCGTGACGAC GAAGGCCAGA AGATGTCCAA ATCCAAGGGT AACGTTATCG ACCCGCTGGA TATGGTTGAC GGTATTTCGC TGCCAGAACT GCTGGAAAAA CGTACCGGCA ATATGATGCA GCCGCAGCTG GCGGACAAAA TCCGTAAGCG CACCGAGAAA CAGTTCCCGA ACGGCATTGA GCCGCACGGC ACCGATGCCC TGCGCTTCAC CCTGGCGGCG CTGGCTTCTA CCGGTCGTGA CATCAACTGG GATATGAAGC GTCTGGAAGG CTACCGTAAC TTCTGTAACA AGCTGTGGAA CGCCAGCCGC TTTGTGCTGA TGAACACAGA AGGTCAGGAT TGCGGCTTCA ACGGCGGTGA AATGACGCTG TCGCTGGCGG ACCGCTGGAT TCTGGCGGAG TTCAACCAGA CCATCAAAGC GTACCGCGAA GCGCTGGACA GCTTCCGTTT CGATATTGCC GCAGGCATTC TGTATGAATT CACCTGGAAC CAGTTCTGTG ACTGGTATCT GGAGCTGACC AAGCCGGTAA TGAACGGTGG CACCGAAGCC GAACTGCGCG GTACTCGCCA TACGCTGGTG ACTGTACTGG AAGGTCTGCT GCGCCTCGCG CATCCGATCA TTCCGTTCAT CACCGAAACC ATCTGGCAGC GTGTGAAAGT ACTTTGCGGT ATCACTGCCG ACACCATCAT GCTGCAACCG TTCCCGCAGT ACGATGCGTC TCAGGTCGAT GAAGCCGCAC TGGCCGACAC CGAGTGGCTG AAGCAGGCGA TCGTTGCGGT ACGTAACATC CGTGCAGAAA TGAACATCGC ACCGGGTAAA CCGCTGGAGC TGCTGCTGCG TGGTTGCAGC GCGGATGCAG AACGTCGCGT AAATGAAAAC CGTGGCTTCC TGCAAACGCT GGCGCGTCTG GAAAGTATCA CCGTGCTGCC TGCCGATGAC AAAGGTCCGG TTTCCGTTAC CAAGATCATC GACGGCGCAG AGCTGCTGAT CCCGATGGCT GGCCTCATCA ACAAAGAAGA TGAGCTGGCG CGTCTGGCGA AAGAAGTGGC GAAGATTGAA GGTGAAATCA GCCGTATCGA GAACAAACTG GCGAACGAAG GCTTTGTCGC CCGCGCACCG GAAGCGGTCA TCGCGAAAGA GCGTGAGAAG CTGGAAGGCT ATGCAGAAGC GAAAGCGAAG CTGATTGAAC AGCAGGCTGT TATCGCCGCG CTGTAA
|
Protein sequence | MEKTYNPQDI EQPLYEHWEK QGYFKPNGDE SQESFCIMIP PPNVTGSLHM GHAFQQTIMD TMIRYQRMQG KNTLWQVGTD HAGIATQMVV ERKIAAEEGK TRHDYGREAF IDKIWEWKAE SGGTITRQMR RLGNSVDWER ERFTMDEGLS NAVKEVFVRL YKEDLIYRGK RLVNWDPKLR TAISDLEVEN RESKGSMWHI RYPLADGAKT ADGKDYLVVA TTRPETLLGD TGVAVNPEDP RYKDLIGKYV ILPLVNRRIP IVGDEHADME KGTGCVKITP AHDFNDYEVG KRHALPMINI LTFDGDIRES AQVFDTKGNE SDVYSSEIPA EFQKLERFAA RKAVVAAVDA LGLLEEIKPH DLTVPYGDRG GVVIEPMLTD QWYVRADVLA KPAVEAVENG DIQFVPKQYE NMYFSWMRDI QDWCISRQLW WGHRIPAWYD EAGNVYVGRN EDEVRKENNL GADVVLRQDE DVLDTWFSSA LWTFSTLGWP ENTDALRQFH PTSVMVSGFD IIFFWIARMI MMTMHFIKDE NGKPQVPFHT VYMTGLIRDD EGQKMSKSKG NVIDPLDMVD GISLPELLEK RTGNMMQPQL ADKIRKRTEK QFPNGIEPHG TDALRFTLAA LASTGRDINW DMKRLEGYRN FCNKLWNASR FVLMNTEGQD CGFNGGEMTL SLADRWILAE FNQTIKAYRE ALDSFRFDIA AGILYEFTWN QFCDWYLELT KPVMNGGTEA ELRGTRHTLV TVLEGLLRLA HPIIPFITET IWQRVKVLCG ITADTIMLQP FPQYDASQVD EAALADTEWL KQAIVAVRNI RAEMNIAPGK PLELLLRGCS ADAERRVNEN RGFLQTLARL ESITVLPADD KGPVSVTKII DGAELLIPMA GLINKEDELA RLAKEVAKIE GEISRIENKL ANEGFVARAP EAVIAKEREK LEGYAEAKAK LIEQQAVIAA L
|
| |