Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4514 |
Symbol | valS |
ID | 5593019 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 4517167 |
End bp | 4520022 |
Gene Length | 2856 bp |
Protein Length | 951 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640923610 |
Product | valyl-tRNA synthetase |
Protein accession | YP_001461051 |
Protein GI | 157163733 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.00256873 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAAGA CATATAACCC ACAAGATATC GAACAGCCGC TTTACGAGCA CTGGGAAAAG CAGGGCTACT TTAAGCCTAA TGGCGATGAA AGCCAGGAAA GTTTCTGCAT CATGATCCCG CCGCCGAACG TCACCGGCAG TTTGCATATG GGTCACGCCT TCCAGCAAAC CATCATGGAT ACCATGATCC GCTATCAGCG CATGCAGGGC AAAAACACCC TGTGGCAGGT CGGTACTGAC CACGCCGGGA TCGCTACCCA GATGGTCGTT GAGCGCAAGA TTGCCGCAGA AGAAGGTAAA ACCCGTCACG ACTACGGCCG CGAAGCTTTC ATCGACAAAA TCTGGGAATG GAAAGCGGAA TCTGGCGGCA CCATTACCCG TCAGATGCGC CGTCTCGGCA ACTCCGTGGA CTGGGAGCGT GAACGCTTCA CCATGGACGA AGGCCTGTCC AATGCGGTGA AAGAAGTTTT CGTTCGTCTG TATAAAGAAG ACCTGATTTA CCGTGGCAAG CGCCTGGTAA ACTGGGACCC GAAACTGCGC ACCGCTATCT CTGACCTGGA AGTGGAAAAC CGCGAATCGA AAGGTTCGAT GTGGCACATC CGCTATCCGC TGGCTGACGG CGCGAAAACC GCAGACGGTA AAGATTATCT GGTGGTTGCG ACTACCCGTC CGGAAACCCT GCTGGGCGAT ACTGGCGTAG CCGTTAACCC GGAAGATCCG CGTTACAAAG ATCTGATTGG CAAATATGTC ATTCTGCCGC TGGTTAACCG TCGTATTCCG ATCGTTGGCG ACGAACACGC CGACATGGAA AAAGGCACCG GCTGCGTGAA GATCACCCCG GCGCACGACT TTAACGACTA TGAAGTCGGT AAACGTCACG CCCTGCCGAT GATCAACATC CTGACCTTTG ACGGCGATAT CCGTGAAAGT GCCCAGGTGT TCGATACCAA AGGTAACGAA TCTGACGTTT ATTCCAGCGA GATCCCGGCA GAATTCCAGA AACTGGAGCG TTTTGCTGCA CGTAAAGCGG TCGTTGCTGC GGTTGACGCA CTCGGCCTGC TGGAAGAAAT TAAACCACAT GACCTGACCG TTCCTTACGG CGACCGTGGC GGCGTGGTCA TCGAACCAAT GCTGACCGAC CAGTGGTACG TGCGTGCCGA TGTCCTGGCG AAACCGGCAG TTGAAGCGGT TGAGAACGGC GACATTCAGT TCGTACCGAA GCAGTACGAA AACATGTACT TCTCCTGGAT GCGCGATATT CAGGACTGGT GTATCTCTCG TCAGTTGTGG TGGGGTCACC GTATCCCGGC ATGGTATGAC GAAGCGGGTA ACGTTTATGT TGGCCGCAAC GAAGACGAAG TGCGTAAAGA AAATAACCTC GGTGCTGATG TTGTCCTGCG TCAGGACGAA GACGTTCTCG ATACCTGGTT CTCTTCTGCG CTGTGGACCT TCTCTACCCT TGGCTGGCCG GAAAACACCG ACGCCCTGCG TCAGTTCCAC CCAACCAGCG TGATGGTATC TGGTTTCGAC ATCATCTTCT TCTGGATTGC CCGCATGATC ATGATGACCA TGCACTTCAT CAAAGATGAA AATGGCAAAC CGCAGGTGCC GTTCCACACT GTTTACATGA CCGGTCTGAT TCGTGACGAC GAAGGCCAGA AGATGTCCAA ATCCAAGGGT AACGTTATCG ACCCGCTGGA TATGGTTGAC GGTATTTCGC TGCCAGAACT GCTGGAAAAA CGTACCGGCA ATATGATGCA GCCGCAGCTG GCGGACAAAA TCCGTAAGCG CACCGAGAAA CAGTTCCCGA ACGGCATTGA GCCGCACGGC ACCGATGCCC TGCGCTTCAC CCTGGCGGCA CTGGCGTCTA CCGGTCGTGA CATCAACTGG GATATGAAGC GCCTGGAAGG TTACCGTAAC TTCTGTAACA AGCTGTGGAA CGCCAGCCGC TTTGTGCTGA TGAACACAGA AGGTCAGGAT TGCGGCTTCA ACGGCGGCGA GATGACGCTG TCGCTGGCGG ATCGCTGGAT TCTGGCGGAG TTCAACCAGA CCATCAAAGC GTACCGCGAA GCGCTGGACA GCTTCCGCTT CGATATCGCC GCAGGCATTC TGTATGAGTT CACCTGGAAC CAGTTCTGTG ACTGGTATCT GGAGCTGACT AAGCCGGTAA TGAACGGTGG CACCGAAGCC GAACTGCGCG GTACTCGCCA TACGCTGGTG ACCGTGCTGG AAGGTCTGCT GCGCCTCGCG CATCCGATCA TTCCGTTCAT CACCGAAACC ATCTGGCAGC GTGTGAAAGT ACTTTGCGGT ATCACTGCCG ACACCATCAT GCTGCAGCCG TTCCCGCAGT ACGATGCATC TCAGGTTGAT GAAGCCGCAC TGGCCGACAC AGAGTGGCTG AAGCAGGCGA TCGTTGCGGT ACGTAACATC CGTGCAGAAA TGAACATCGC GCCGGGCAAA CCGCTGGAGC TGCTACTGCG TGGTTGCAGC GCGGATGCAG AACGTCGCGT AAATGAAAAC CGTGGCTTCC TGCAAACGCT GGCGCGTCTG GAAAGCATCA CCGTGCTGCC TGCCGATGAC AAAGGTCCGG TTTCCGTTAC TAAGATCATC GACGGTGCAG AGCTGCTGAT CCCGATGGCT GGCCTCATCA ACAAAGAAGA TGAGCTGGCG CGTCTGGCGA AAGAAGTGGC GAAGATTGAA GGTGAAATCA GCCGTATCGA GAACAAGCTG GCGAATGAAG GCTTTGTCGC CCGCGCACCG GAAGCGGTCA TCGCGAAAGA GCGTGAAAAG CTGGAAGGCT ATGCAGAAGC GAAAGCGAAG CTGATTGAAC AGCAGGCTGT TATCGCCGCG CTGTAA
|
Protein sequence | MEKTYNPQDI EQPLYEHWEK QGYFKPNGDE SQESFCIMIP PPNVTGSLHM GHAFQQTIMD TMIRYQRMQG KNTLWQVGTD HAGIATQMVV ERKIAAEEGK TRHDYGREAF IDKIWEWKAE SGGTITRQMR RLGNSVDWER ERFTMDEGLS NAVKEVFVRL YKEDLIYRGK RLVNWDPKLR TAISDLEVEN RESKGSMWHI RYPLADGAKT ADGKDYLVVA TTRPETLLGD TGVAVNPEDP RYKDLIGKYV ILPLVNRRIP IVGDEHADME KGTGCVKITP AHDFNDYEVG KRHALPMINI LTFDGDIRES AQVFDTKGNE SDVYSSEIPA EFQKLERFAA RKAVVAAVDA LGLLEEIKPH DLTVPYGDRG GVVIEPMLTD QWYVRADVLA KPAVEAVENG DIQFVPKQYE NMYFSWMRDI QDWCISRQLW WGHRIPAWYD EAGNVYVGRN EDEVRKENNL GADVVLRQDE DVLDTWFSSA LWTFSTLGWP ENTDALRQFH PTSVMVSGFD IIFFWIARMI MMTMHFIKDE NGKPQVPFHT VYMTGLIRDD EGQKMSKSKG NVIDPLDMVD GISLPELLEK RTGNMMQPQL ADKIRKRTEK QFPNGIEPHG TDALRFTLAA LASTGRDINW DMKRLEGYRN FCNKLWNASR FVLMNTEGQD CGFNGGEMTL SLADRWILAE FNQTIKAYRE ALDSFRFDIA AGILYEFTWN QFCDWYLELT KPVMNGGTEA ELRGTRHTLV TVLEGLLRLA HPIIPFITET IWQRVKVLCG ITADTIMLQP FPQYDASQVD EAALADTEWL KQAIVAVRNI RAEMNIAPGK PLELLLRGCS ADAERRVNEN RGFLQTLARL ESITVLPADD KGPVSVTKII DGAELLIPMA GLINKEDELA RLAKEVAKIE GEISRIENKL ANEGFVARAP EAVIAKEREK LEGYAEAKAK LIEQQAVIAA L
|
| |