Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECD_04124 |
Symbol | valS |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21(DE3) |
Kingdom | Bacteria |
Replicon accession | CP001509 |
Strand | - |
Start bp | 4392380 |
End bp | 4395235 |
Gene Length | 2856 bp |
Protein Length | 951 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | valyl-tRNA synthetase |
Protein accession | ACT45912 |
Protein GI | 253980242 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAAGA CATATAACCC ACAAGATATC GAACAGCCGC TTTACGAGCA CTGGGAAAAG CAGGGCTACT TTAAGCCTAA TGGCGATGAA AGCCAGGAAA GTTTCTGCAT CATGATCCCG CCGCCGAACG TCACCGGCAG TTTGCATATG GGTCACGCCT TCCAGCAAAC CATCATGGAT ACCATGATCC GCTATCAGCG CATGCAGGGC AAAAACACCC TGTGGCAGGT CGGTACTGAC CACGCCGGGA TCGCTACCCA GATGGTCGTT GAGCGCAAGA TTGCCGCAGA AGAAGGTAAA ACCCGTCACG ACTACGGCCG CGAAGCTTTC ATCGACAAAA TTTGGGAATG GAAAGCGGAA TCCGGCGGCA CCATTACCCG TCAGATGCGC CGTCTCGGCA ACTCCGTCGA CTGGGAGCGT GAACGCTTCA CCATGGACGA AGGCCTGTCC AATGCGGTGA AAGAAGTTTT CGTTCGTCTA TATAAAGAAG ACCTGATTTA CCGTGGCAAA CGCCTGGTAA ACTGGGACCC GAAACTGCGC ACCGCTATCT CTGACCTGGA AGTGGAAAAC CGCGAATCGA AAGGTTCGAT GTGGCACATC CGCTATCCGC TGGCTGACGG TGCGAAAACC GCAGACGGTA AAGATTATCT GGTGGTCGCG ACTACCCGTC CGGAAACCCT GCTGGGCGAT ACTGGCGTAG CCGTTAACCC GGAAGATCCG CGTTACAAAG ATCTGATTGG CAAATATGTC ATTCTGCCGC TGGTTAACCG TCGTATTCCG ATCGTTGGCG ACGAACACGC CGACATGGAA AAAGGCACCG GCTGCGTGAA AATCACTCCG GCGCACGACT TTAACGACTA TGAAGTGGGT AAACGTCACG CGCTGCCGAT GATCAACATC CTGACCTTTG ACGGCGATAT CCGTGAAAGC GCCCAGGTGT TCGATACCAA AGGTAACGAA TCTGACGTTT ATTCCAGCGA GATCCCGGCA GAATTCCAGA AACTGGAGCG TTTTGCCGCA CGTAAAGCAG TCGTTGCTGC GGTTGACGCA CTCGGCCTGC TGGAAGAAAT TAAACCGCAC GACCTGACCG TTCCTTACGG CGACCGTGGC GGCGTGGTTA TCGAACCAAT GCTGACCGAC CAGTGGTACG TACGTGCCGA TGTACTGGCG AAGCCGGCGG TTGAAGCGGT TGAGAACGGC GACATTCAGT TCGTACCGAA GCAGTACGAA AACATGTATT TCTCCTGGAT GCGCGATATT CAGGACTGGT GTATCTCTCG TCAGCTGTGG TGGGGTCACC GTATCCCGGC ATGGTATGAC GAAGCGGGTA ATGTTTATGT TGGCCGCAAC GAAGAAGAAG TGCGCAAGGA AAATAACCTC GGTGCCGACG TTGCCCTGCG TCAGGACGAA GACGTTCTCG ACACCTGGTT CTCTTCTGCG CTGTGGACCT TCTCTACCCT TGGCTGGCCG GAAAACACCG ACGCCCTGCG TCAGTTCCAC CCAACCAGCG TGATGGTATC CGGCTTCGAC ATCATCTTCT TCTGGATTGC CCGCATGATC ATGATGACCA TGCACTTCAT CAAAGATGAA AATGGCAAAC CGCAGGTGCC GTTCCACACC GTTTACATGA CCGGCCTGAT TCGTGATGAC GAAGGCCAGA AGATGTCTAA ATCCAAGGGT AACGTTATTG ATCCGCTGGA TATGGTTGAC GGTATTTCGC TGCCAGAACT GCTGGAAAAA CGTACCGGCA ATATGATGCA GCCGCAGCTG GCGGACAAAA TCCGTAAGCG CACCGAGAAG CAGTTCCCGA ACGGTATTGA GCCGCACGGT ACTGACGCGC TGCGCTTCAC CCTGGCGGCG CTGGCGTCTA CCGGTCGTGA CATCAACTGG GATATGAAGC GTCTGGAAGG TTACCGTAAC TTCTGTAACA AGCTGTGGAA CGCCAGCCGC TTTGTGCTGA TGAACACGGA AGGTCAGGAT TGCGGCTTTA ACGGCGGTGA AATGACGCTG TCGCTGGCGG ATCGCTGGAT TCTGGCTGAG TTCAACCAGA CCATCAAAGC GTACCGCGAA GCGCTGGACA GCTTCCGCTT CGATATCGCT GCAGGCATTC TGTATGAGTT CACCTGGAAC CAGTTCTGTG ACTGGTATCT GGAGCTGACT AAGCCGGTAA TGAACGGTGG AACCGAAGCC GAACTGCGCG GTACTCGCCA TACGCTGGTG ACCGTACTGG AAGGTCTGCT GCGCCTCGCG CATCCGATCA TTCCGTTCAT CACCGAAACC ATCTGGCAGC GTGTGAAAGT ACTTTGCGGT ATCACTGCCG ACACCATCAT GCTGCAGCCG TTCCCGCAGT ACGATGCATC TCAGGTTGAT GAAGCCGCAC TGGCCGACAC CGAATGGCTG AAACAGGCGA TCGTTGCGGT ACGTAACATC CGTGCAGAAA TGAATATCGC GCCGGGCAAA CCGCTGGAGC TGCTGCTGCG TGGTTGCAGC GCGGATGCAG AACGTCGCGT AAATGAAAAC CGTGGCTTCC TGCAAACCCT GGCGCGTCTG GAAAGCATCA CCGTGCTGCC TGCCGATGAC AAAGGTCCGG TTTCCGTTGC GAAGATCATC GACGGCGCAG AGCTGCTGAT CCCGATGGCT GGCCTCATCA ACAAAGAAGA TGAGCTGGCG CGTCTGGCGA AAGAAGTGGC GAAGATCGAA GGCGAAATCA GCCGTATCGA GAACAAGCTG GCGAATGAAG GCTTTGTCGC CCGCGCACCG GAAGCGGTCA TCGCGAAAGA GCGTGAGAAG CTGGAAGGCT ATGCAGAAGC GAAAGCTAAG CTGATTGAAC AGCAGGCTGT TATCGCCGCG CTGTAA
|
Protein sequence | MEKTYNPQDI EQPLYEHWEK QGYFKPNGDE SQESFCIMIP PPNVTGSLHM GHAFQQTIMD TMIRYQRMQG KNTLWQVGTD HAGIATQMVV ERKIAAEEGK TRHDYGREAF IDKIWEWKAE SGGTITRQMR RLGNSVDWER ERFTMDEGLS NAVKEVFVRL YKEDLIYRGK RLVNWDPKLR TAISDLEVEN RESKGSMWHI RYPLADGAKT ADGKDYLVVA TTRPETLLGD TGVAVNPEDP RYKDLIGKYV ILPLVNRRIP IVGDEHADME KGTGCVKITP AHDFNDYEVG KRHALPMINI LTFDGDIRES AQVFDTKGNE SDVYSSEIPA EFQKLERFAA RKAVVAAVDA LGLLEEIKPH DLTVPYGDRG GVVIEPMLTD QWYVRADVLA KPAVEAVENG DIQFVPKQYE NMYFSWMRDI QDWCISRQLW WGHRIPAWYD EAGNVYVGRN EEEVRKENNL GADVALRQDE DVLDTWFSSA LWTFSTLGWP ENTDALRQFH PTSVMVSGFD IIFFWIARMI MMTMHFIKDE NGKPQVPFHT VYMTGLIRDD EGQKMSKSKG NVIDPLDMVD GISLPELLEK RTGNMMQPQL ADKIRKRTEK QFPNGIEPHG TDALRFTLAA LASTGRDINW DMKRLEGYRN FCNKLWNASR FVLMNTEGQD CGFNGGEMTL SLADRWILAE FNQTIKAYRE ALDSFRFDIA AGILYEFTWN QFCDWYLELT KPVMNGGTEA ELRGTRHTLV TVLEGLLRLA HPIIPFITET IWQRVKVLCG ITADTIMLQP FPQYDASQVD EAALADTEWL KQAIVAVRNI RAEMNIAPGK PLELLLRGCS ADAERRVNEN RGFLQTLARL ESITVLPADD KGPVSVAKII DGAELLIPMA GLINKEDELA RLAKEVAKIE GEISRIENKL ANEGFVARAP EAVIAKEREK LEGYAEAKAK LIEQQAVIAA L
|
| |