Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3754 |
Symbol | valS |
ID | 6068110 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 4107039 |
End bp | 4109894 |
Gene Length | 2856 bp |
Protein Length | 951 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641603169 |
Product | valyl-tRNA synthetase |
Protein accession | YP_001726688 |
Protein GI | 170021734 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.775633 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAAGA CATACAACCC ACAAGATATC GAACAGCCGC TTTACGAGCA CTGGGAAAAG CAGGGCTACT TTAAGCCTAA TGGCGATGAA AGCCAGGAAA GTTTCTGCAT CATGATCCCG CCGCCGAACG TCACCGGCAG TTTGCATATG GGTCACGCCT TCCAGCAAAC CATCATGGAT ACCATGATCC GCTATCAGCG CATGCAGGGC AAAAACACCC TGTGGCAGGT CGGTACTGAC CACGCCGGGA TCGCTACCCA GATGGTCGTT GAGCGCAAGA TTGCCGCAGA AGAAGGTAAA ACCCGTCACG ACTACGGCCG CGAAGCTTTC ATCGACAAAA TTTGGGAATG GAAAGCGGAA TCCGGCGGCA CGATTACCCG TCAGATGCGC CGTCTCGGCA ACTCCGTCGA CTGGGAGCGT GAACGCTTCA CCATGGACGA AGGCCTGTCC AATGCGGTGA AAGAAGTTTT CGTTCGTCTG TATAAAGAAG ACCTGATTTA CCGTGGCAAA CGCCTGGTAA ACTGGGATCC GAAACTGCGC ACCGCTATCT CTGACCTGGA AGTCGAAAAC CGCGAATCGA AAGGTTCGAT GTGGCACATC CGCTATCCGC TGGCTGACGG TGCGAAAACC GCAGACGGTA AAGATTATCT GGTGGTCGCG ACTACCCGTC CAGAAACCCT GCTGGGCGAT ACTGGCGTAG CCGTTAACCC GGAAGATCCG CGTTACAAAG ATCTGATTGG CAAATATGTC ATTCTGCCGC TGGTTAACCG TCGTATTCCG ATCGTTGGCG ACGAACACGC CGACATGGAA AAAGGCACCG GCTGCGTGAA AATCACTCCG GCGCACGACT TTAACGACTA TGAAGTGGGT AAACGTCACG CCCTGCCGAT GATCAACATC CTGACCTTTG ACGGCGATAT CCGTGAAAGC GCCCAGGTGT TCGATACCAA AGGTAACGAA TCTGACGTTT ATTCCAGCGA AATCCCTGCA GAGTTCCAGA AACTGGAGCG TTTTGCTGCA CGTAAAGCAG TCGTTGCCGC AGTTGACGCG CTTGGCCTGC TGGAAGAAAT TAAACCGCAC GACCTGACCG TTCCTTACGG CGACCGTGGC GGCGTAGTTA TCGAACCAAT GCTGACCGAC CAGTGGTACG TGCGTGCCGA TGTCCTGGCG AAACCGGCGG TTGAAGCGGT TGAGAACGGC GACATTCAGT TCGTACCGAA GCAGTACGAA AACATGTACT TCTCCTGGAT GCGCGATATT CAGGACTGGT GTATCTCTCG TCAGTTGTGG TGGGGTCACC GTATCCCGGC ATGGTATGAC GAAGCGGGTA ACGTTTATGT TGGCCGCAAC GAAGACGAAG TGCGTAAAGA AAATAACCTC GGTGCTGATG TTGTCCTGCG TCAGGACGAA GACGTTCTCG ATACCTGGTT CTCTTCTGCG CTGTGGACCT TCTCTACCCT TGGCTGGCCG GAAAATACCG ACGCCCTGCG TCAGTTCCAC CCAACCAGCG TGATGGTATC TGGTTTCGAC ATCATTTTCT TCTGGATTGC CCGCATGATC ATGATGACCA TGCACTTCAT CAAAGATGAA AATGGCAAAC CGCAGGTGCC GTTCCACACC GTTTACATGA CCGGCCTGAT TCGTGATGAC GAAGGCCAGA AGATGTCCAA ATCCAAGGGT AACGTTATCG ACCCACTGGA TATGGTTGAC GGTATTTCGC TGCCAGAACT GCTGGAAAAA CGTACCGGCA ATATGATGCA GCCGCAGCTG GCGGACAAAA TCCGTAAGCG CACCGAGAAG CAGTTCCCGA ACGGTATTGA GCCGCACGGT ACTGACGCGC TGCGCTTCAC CCTGGCGGCG CTGGCGTCTA CCGGTCGTGA CATCAACTGG GATATGAAGC GTCTGGAAGG TTACCGTAAC TTCTGTAACA AGCTGTGGAA CGCCAGCCGC TTTGTGCTGA TGAACACAGA AGGTCAGGAT TGCGGCTTCA ACGGCGGCGA AATGACGCTG TCGCTGGCGG ACCGCTGGAT TCTGGCGGAG TTCAACCAGA CCATCAAAGC GTACCGCGAA GCGCTGGACA GCTTCCGCTT CGATATCGCC GCAGGCATTC TGTATGAGTT CACCTGGAAC CAGTTCTGTG ACTGGTATCT CGAGCTGACC AAGCCGGTAA TGAACGGTGG CACCGAAGCA GAACTGCGCG GTACTCGCCA TACGCTGGTG ACTGTACTGG AAGGTCTGCT GCGCCTCGCG CATCCGATCA TTCCGTTCAT CACCGAAACC ATCTGGCAGC GTGTGAAAGT ACTTTGCGGT ATCACTGCCG ACACCATCAT GCTGCAGCCG TTCCCGCAGT ACGATGCATC TCAGGTTGAT GAAGCCGCAC TGGCCGACAC CGAATGGCTG AAACAGGCGA TCGTTGCGGT ACGTAACATC CGTGCAGAAA TGAACATCGC GCCGGGCAAA CCGCTGGAGC TGCTGCTGCG TGGTTGCAGC GCGGATGCAG AACGTCGCGT AAATGAAAAC CGTGGCTTCC TGCAAACCCT GGCGCGTCTG GAAAGTATCA CCGTGCTGCC TGCCGATGAC AAAGGTCCGG TTTCCGTTAC GAAGATCATC GACGGTGCAG AGCTGCTGAT CCCGATGGCT GGCCTCATCA ACAAAGAAGA TGAGCTGGCG CGTCTGGCGA AAGAAGTGGC GAAGATTGAA GGTGAAATCA GCCGTATCGA GAACAAACTG GCGAACGAAG GCTTTGTCGC CCGCGCACCG GAAGCGGTCA TCGCGAAAGA GCGTGAGAAG CTGGAAGGCT ATGCGAAAGC GAAAGCGAAA CTGATTGAAC AGCAGGCTGT TATCGCCGCG CTGTAA
|
Protein sequence | MEKTYNPQDI EQPLYEHWEK QGYFKPNGDE SQESFCIMIP PPNVTGSLHM GHAFQQTIMD TMIRYQRMQG KNTLWQVGTD HAGIATQMVV ERKIAAEEGK TRHDYGREAF IDKIWEWKAE SGGTITRQMR RLGNSVDWER ERFTMDEGLS NAVKEVFVRL YKEDLIYRGK RLVNWDPKLR TAISDLEVEN RESKGSMWHI RYPLADGAKT ADGKDYLVVA TTRPETLLGD TGVAVNPEDP RYKDLIGKYV ILPLVNRRIP IVGDEHADME KGTGCVKITP AHDFNDYEVG KRHALPMINI LTFDGDIRES AQVFDTKGNE SDVYSSEIPA EFQKLERFAA RKAVVAAVDA LGLLEEIKPH DLTVPYGDRG GVVIEPMLTD QWYVRADVLA KPAVEAVENG DIQFVPKQYE NMYFSWMRDI QDWCISRQLW WGHRIPAWYD EAGNVYVGRN EDEVRKENNL GADVVLRQDE DVLDTWFSSA LWTFSTLGWP ENTDALRQFH PTSVMVSGFD IIFFWIARMI MMTMHFIKDE NGKPQVPFHT VYMTGLIRDD EGQKMSKSKG NVIDPLDMVD GISLPELLEK RTGNMMQPQL ADKIRKRTEK QFPNGIEPHG TDALRFTLAA LASTGRDINW DMKRLEGYRN FCNKLWNASR FVLMNTEGQD CGFNGGEMTL SLADRWILAE FNQTIKAYRE ALDSFRFDIA AGILYEFTWN QFCDWYLELT KPVMNGGTEA ELRGTRHTLV TVLEGLLRLA HPIIPFITET IWQRVKVLCG ITADTIMLQP FPQYDASQVD EAALADTEWL KQAIVAVRNI RAEMNIAPGK PLELLLRGCS ADAERRVNEN RGFLQTLARL ESITVLPADD KGPVSVTKII DGAELLIPMA GLINKEDELA RLAKEVAKIE GEISRIENKL ANEGFVARAP EAVIAKEREK LEGYAKAKAK LIEQQAVIAA L
|
| |