Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_4829 |
Symbol | valS |
ID | 5589285 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 4821236 |
End bp | 4824091 |
Gene Length | 2856 bp |
Protein Length | 951 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640928438 |
Product | valyl-tRNA synthetase |
Protein accession | YP_001465766 |
Protein GI | 157158681 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.204033 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAAGA CATACAACCC ACAAGATATC GAACAGCCGC TTTACGAGCA CTGGGAAAAG CAGGGCTACT TTAAGCCTAA TGGCGATGAA AGCCAGGAAA GTTTCTGCAT CATGATCCCG CCGCCGAACG TCACCGGCAG TTTGCATATG GGTCACGCCT TCCAGCAAAC CATCATGGAT ACCATGATCC GCTATCAGCG CATGCAGGGC AAAAACACCC TGTGGCAGGT CGGTACTGAC CACGCCGGGA TCGCTACCCA GATGGTCGTT GAGCGCAAGA TTGCCGCAGA AGAAGGTAAA ACCCGTCACG ACTACGGCCG CGAAGCTTTC ATCGACAAAA TTTGGGAATG GAAAGCGGAA TCCGGCGGCA CGATTACCCG TCAGATGCGC CGTCTCGGCA ACTCCGTCGA CTGGGAGCGT GAACGCTTCA CCATGGACGA AGGCCTGTCC AATGCGGTGA AAGAAGTTTT CGTTCGTCTG TATAAAGAAG ACCTGATTTA CCGTGGCAAA CGCCTGGTAA ACTGGGATCC GAAACTGCGC ACCGCTATCT CTGACCTGGA AGTCGAAAAC CGCGAATCGA AAGGTTCGAT GTGGCACATC CGCTATCCGC TGGCTGACGG TGCGAAAACC GCAGACGGTA AAGATTATCT GGTGGTCGCG ACTACCCGTC CAGAAACCCT GCTGGGCGAT ACTGGCGTAG CCGTTAACCC GGAAGATCCG CGTTACAAAG ATCTGATTGG CAAATATGTC ATTCTGCCGC TGGTTAACCG TCGTATTCCG ATCGTTGGCG ACGAACACGC CGACATGGAA AAAGGCACCG GCTGCGTGAA AATCACTCCG GCGCACGACT TTAACGACTA TGAAGTGGGT AAACGTCACG CCCTGCCGAT GATCAACATC CTGACCTTTG ACGGCGATAT CCGTGAAAGC GCCCAGGTGT TCGATACCAA AGGTAACGAA TCTGACGTTT ATTCCAGCGA AATCCCTGCA GAGTTCCAGA AACTGGAGCG TTTTGCTGCA CGTAAAGCAG TCGTTGCCGC AGTTGACGCG CTTGGCCTGC TGGAAGAAAT TAAACCGCAC GACCTGACCG TTCCTTACGG CGACCGTGGC GGCGTAGTTA TCGAACCAAT GCTGACCGAC CAGTGGTACG TGCGTGCCGA TGTACTGGCG AAACCGGCGG TTGAAGCGGT TGAGAACGGC GACATTCAGT TCGTACCGAA GCAGTACGAA AACATGTACT TCTCCTGGAT GCGCGATATT CAGGACTGGT GTATCTCTCG TCAGCTGTGG TGGGGTCACC GTATCCCTGC ATGGTACGAC GAAGCAGGTA ACGTTTATGT TGGCCGCAAC GAAGAAGAAG TGCGCAAGGA AAATAACCTC GGTGCCGACG TTGCCCTGCG TCAGGACGAA GACGTTCTCG ACACCTGGTT CTCTTCTGCA CTGTGGACCT TCTCTACCCT TGGCTGGCCG GAAAACACCG ACGCCCTGCG TCAGTTCCAC CCAACCAGCG TGATGGTATC CGGCTTCGAC ATCATCTTCT TCTGGATTGC CCGCATGATC ATGATGACCA TGCACTTCAT CAAAGATGAA AATGGCAAAC CGCAGGTGCC GTTCCACACT GTTTACATGA CCGGTCTGAT TCGTGACGAC GAAGGCCAGA AGATGTCCAA ATCCAAGGGT AACGTTATCG ACCCGCTGGA TATGGTTGAC GGTATTTCGC TGCCAGAACT GCTGGAAAAA CGTACCGGCA ATATGATGCA GCCGCAGCTG GCGGACAAAA TCCGTAAGCG CACCGAGAAA CAGTTCCCGA ACGGCATTGA GCCGCACGGC ACCGATGCCC TGCGCTTCAC CCTGGCGGCG CTGGCGTCTA CCGGTCGTGA CATCAACTGG GATATGAAGC GCCTGGAAGG TTACCGTAAC TTCTGTAACA AGCTGTGGAA CGCCAGCCGC TTTGTGCTGA TGAACACAGA AGGTCAGGAT TGCGGCTTCA ACGGCGGTGA AATGACGCTG TCGCTGGCGG ACCGCTGGAT TCTGGCAGAG TTCAACCAGA CCATCAAAGC GTACCGCGAA GCGCTGGACA GCTTCCGCTT CGATATCGCC GCAGGCATTC TGTATGAGTT CACCTGGAAC CAGTTCTGTG ACTGGTATCT GGAGCTGACT AAGCCGGTAA TGAACGGTGG CACCGAAGCC GAACTGCGCG GTACTCGCCA TACGCTGGTG ACCGTGCTGG AAGGTCTGCT GCGCCTCGCG CATCCGATCA TTCCGTTCAT CACCGAAACC ATCTGGCAGC GTGTGAAAGT ACTTTGCGGT ATCACTGCCG ACACCATCAT GCTGCAGCCG TTCCCGCAGT ACGATGCATC TCAGGTTGAT GAAGCCGCAC TGGCCGACAC AGAGTGGTTG AAGCAGGCGA TCGTTGCGGT ACGTAACATC CGTGCAGAAA TGAACATCGC GCCGGGCAAA CTGCTGGAGC TGCTGCTGCG TGGTTGCAGC GCGGATGCAG AACGTCGCGT AAATGAAAAC CGTGGCTTCC TGCAAACCCT GGCGCGTCTG GAAAGCATCA CCGTGCTGCC TGCCGATGAC AAAGGTCCGG TTTCCGTTAC GAAGATCATC GACGGCGCAG AGCTGCTGAT CCCGATGGCT GGCCTCATCA ACAAAGAAGA TGAGCTGGCG CGTCTGGCGA AAGAAGTGGC GAAGATCGAC GGCGAAATCA GCCGTATCGA GAACAAACTG GCGAACGAAG GCTTTGTAGC GCGCGCACCG GAAGCGGTCA TCGCGAAAGA GCGTGAGAAG CTGGAAGGCT ATGCAGAAGC GAAAGCGAAG CTGATTGAAC AGCAGGCTGT TATCGCCGCG CTGTAA
|
Protein sequence | MEKTYNPQDI EQPLYEHWEK QGYFKPNGDE SQESFCIMIP PPNVTGSLHM GHAFQQTIMD TMIRYQRMQG KNTLWQVGTD HAGIATQMVV ERKIAAEEGK TRHDYGREAF IDKIWEWKAE SGGTITRQMR RLGNSVDWER ERFTMDEGLS NAVKEVFVRL YKEDLIYRGK RLVNWDPKLR TAISDLEVEN RESKGSMWHI RYPLADGAKT ADGKDYLVVA TTRPETLLGD TGVAVNPEDP RYKDLIGKYV ILPLVNRRIP IVGDEHADME KGTGCVKITP AHDFNDYEVG KRHALPMINI LTFDGDIRES AQVFDTKGNE SDVYSSEIPA EFQKLERFAA RKAVVAAVDA LGLLEEIKPH DLTVPYGDRG GVVIEPMLTD QWYVRADVLA KPAVEAVENG DIQFVPKQYE NMYFSWMRDI QDWCISRQLW WGHRIPAWYD EAGNVYVGRN EEEVRKENNL GADVALRQDE DVLDTWFSSA LWTFSTLGWP ENTDALRQFH PTSVMVSGFD IIFFWIARMI MMTMHFIKDE NGKPQVPFHT VYMTGLIRDD EGQKMSKSKG NVIDPLDMVD GISLPELLEK RTGNMMQPQL ADKIRKRTEK QFPNGIEPHG TDALRFTLAA LASTGRDINW DMKRLEGYRN FCNKLWNASR FVLMNTEGQD CGFNGGEMTL SLADRWILAE FNQTIKAYRE ALDSFRFDIA AGILYEFTWN QFCDWYLELT KPVMNGGTEA ELRGTRHTLV TVLEGLLRLA HPIIPFITET IWQRVKVLCG ITADTIMLQP FPQYDASQVD EAALADTEWL KQAIVAVRNI RAEMNIAPGK LLELLLRGCS ADAERRVNEN RGFLQTLARL ESITVLPADD KGPVSVTKII DGAELLIPMA GLINKEDELA RLAKEVAKID GEISRIENKL ANEGFVARAP EAVIAKEREK LEGYAEAKAK LIEQQAVIAA L
|
| |