Gene EcolC_3754 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3754 
SymbolvalS 
ID6068110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4107039 
End bp4109894 
Gene Length2856 bp 
Protein Length951 aa 
Translation table11 
GC content55% 
IMG OID641603169 
Productvalyl-tRNA synthetase 
Protein accessionYP_001726688 
Protein GI170021734 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.775633 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAGA CATACAACCC ACAAGATATC GAACAGCCGC TTTACGAGCA CTGGGAAAAG 
CAGGGCTACT TTAAGCCTAA TGGCGATGAA AGCCAGGAAA GTTTCTGCAT CATGATCCCG
CCGCCGAACG TCACCGGCAG TTTGCATATG GGTCACGCCT TCCAGCAAAC CATCATGGAT
ACCATGATCC GCTATCAGCG CATGCAGGGC AAAAACACCC TGTGGCAGGT CGGTACTGAC
CACGCCGGGA TCGCTACCCA GATGGTCGTT GAGCGCAAGA TTGCCGCAGA AGAAGGTAAA
ACCCGTCACG ACTACGGCCG CGAAGCTTTC ATCGACAAAA TTTGGGAATG GAAAGCGGAA
TCCGGCGGCA CGATTACCCG TCAGATGCGC CGTCTCGGCA ACTCCGTCGA CTGGGAGCGT
GAACGCTTCA CCATGGACGA AGGCCTGTCC AATGCGGTGA AAGAAGTTTT CGTTCGTCTG
TATAAAGAAG ACCTGATTTA CCGTGGCAAA CGCCTGGTAA ACTGGGATCC GAAACTGCGC
ACCGCTATCT CTGACCTGGA AGTCGAAAAC CGCGAATCGA AAGGTTCGAT GTGGCACATC
CGCTATCCGC TGGCTGACGG TGCGAAAACC GCAGACGGTA AAGATTATCT GGTGGTCGCG
ACTACCCGTC CAGAAACCCT GCTGGGCGAT ACTGGCGTAG CCGTTAACCC GGAAGATCCG
CGTTACAAAG ATCTGATTGG CAAATATGTC ATTCTGCCGC TGGTTAACCG TCGTATTCCG
ATCGTTGGCG ACGAACACGC CGACATGGAA AAAGGCACCG GCTGCGTGAA AATCACTCCG
GCGCACGACT TTAACGACTA TGAAGTGGGT AAACGTCACG CCCTGCCGAT GATCAACATC
CTGACCTTTG ACGGCGATAT CCGTGAAAGC GCCCAGGTGT TCGATACCAA AGGTAACGAA
TCTGACGTTT ATTCCAGCGA AATCCCTGCA GAGTTCCAGA AACTGGAGCG TTTTGCTGCA
CGTAAAGCAG TCGTTGCCGC AGTTGACGCG CTTGGCCTGC TGGAAGAAAT TAAACCGCAC
GACCTGACCG TTCCTTACGG CGACCGTGGC GGCGTAGTTA TCGAACCAAT GCTGACCGAC
CAGTGGTACG TGCGTGCCGA TGTCCTGGCG AAACCGGCGG TTGAAGCGGT TGAGAACGGC
GACATTCAGT TCGTACCGAA GCAGTACGAA AACATGTACT TCTCCTGGAT GCGCGATATT
CAGGACTGGT GTATCTCTCG TCAGTTGTGG TGGGGTCACC GTATCCCGGC ATGGTATGAC
GAAGCGGGTA ACGTTTATGT TGGCCGCAAC GAAGACGAAG TGCGTAAAGA AAATAACCTC
GGTGCTGATG TTGTCCTGCG TCAGGACGAA GACGTTCTCG ATACCTGGTT CTCTTCTGCG
CTGTGGACCT TCTCTACCCT TGGCTGGCCG GAAAATACCG ACGCCCTGCG TCAGTTCCAC
CCAACCAGCG TGATGGTATC TGGTTTCGAC ATCATTTTCT TCTGGATTGC CCGCATGATC
ATGATGACCA TGCACTTCAT CAAAGATGAA AATGGCAAAC CGCAGGTGCC GTTCCACACC
GTTTACATGA CCGGCCTGAT TCGTGATGAC GAAGGCCAGA AGATGTCCAA ATCCAAGGGT
AACGTTATCG ACCCACTGGA TATGGTTGAC GGTATTTCGC TGCCAGAACT GCTGGAAAAA
CGTACCGGCA ATATGATGCA GCCGCAGCTG GCGGACAAAA TCCGTAAGCG CACCGAGAAG
CAGTTCCCGA ACGGTATTGA GCCGCACGGT ACTGACGCGC TGCGCTTCAC CCTGGCGGCG
CTGGCGTCTA CCGGTCGTGA CATCAACTGG GATATGAAGC GTCTGGAAGG TTACCGTAAC
TTCTGTAACA AGCTGTGGAA CGCCAGCCGC TTTGTGCTGA TGAACACAGA AGGTCAGGAT
TGCGGCTTCA ACGGCGGCGA AATGACGCTG TCGCTGGCGG ACCGCTGGAT TCTGGCGGAG
TTCAACCAGA CCATCAAAGC GTACCGCGAA GCGCTGGACA GCTTCCGCTT CGATATCGCC
GCAGGCATTC TGTATGAGTT CACCTGGAAC CAGTTCTGTG ACTGGTATCT CGAGCTGACC
AAGCCGGTAA TGAACGGTGG CACCGAAGCA GAACTGCGCG GTACTCGCCA TACGCTGGTG
ACTGTACTGG AAGGTCTGCT GCGCCTCGCG CATCCGATCA TTCCGTTCAT CACCGAAACC
ATCTGGCAGC GTGTGAAAGT ACTTTGCGGT ATCACTGCCG ACACCATCAT GCTGCAGCCG
TTCCCGCAGT ACGATGCATC TCAGGTTGAT GAAGCCGCAC TGGCCGACAC CGAATGGCTG
AAACAGGCGA TCGTTGCGGT ACGTAACATC CGTGCAGAAA TGAACATCGC GCCGGGCAAA
CCGCTGGAGC TGCTGCTGCG TGGTTGCAGC GCGGATGCAG AACGTCGCGT AAATGAAAAC
CGTGGCTTCC TGCAAACCCT GGCGCGTCTG GAAAGTATCA CCGTGCTGCC TGCCGATGAC
AAAGGTCCGG TTTCCGTTAC GAAGATCATC GACGGTGCAG AGCTGCTGAT CCCGATGGCT
GGCCTCATCA ACAAAGAAGA TGAGCTGGCG CGTCTGGCGA AAGAAGTGGC GAAGATTGAA
GGTGAAATCA GCCGTATCGA GAACAAACTG GCGAACGAAG GCTTTGTCGC CCGCGCACCG
GAAGCGGTCA TCGCGAAAGA GCGTGAGAAG CTGGAAGGCT ATGCGAAAGC GAAAGCGAAA
CTGATTGAAC AGCAGGCTGT TATCGCCGCG CTGTAA
 
Protein sequence
MEKTYNPQDI EQPLYEHWEK QGYFKPNGDE SQESFCIMIP PPNVTGSLHM GHAFQQTIMD 
TMIRYQRMQG KNTLWQVGTD HAGIATQMVV ERKIAAEEGK TRHDYGREAF IDKIWEWKAE
SGGTITRQMR RLGNSVDWER ERFTMDEGLS NAVKEVFVRL YKEDLIYRGK RLVNWDPKLR
TAISDLEVEN RESKGSMWHI RYPLADGAKT ADGKDYLVVA TTRPETLLGD TGVAVNPEDP
RYKDLIGKYV ILPLVNRRIP IVGDEHADME KGTGCVKITP AHDFNDYEVG KRHALPMINI
LTFDGDIRES AQVFDTKGNE SDVYSSEIPA EFQKLERFAA RKAVVAAVDA LGLLEEIKPH
DLTVPYGDRG GVVIEPMLTD QWYVRADVLA KPAVEAVENG DIQFVPKQYE NMYFSWMRDI
QDWCISRQLW WGHRIPAWYD EAGNVYVGRN EDEVRKENNL GADVVLRQDE DVLDTWFSSA
LWTFSTLGWP ENTDALRQFH PTSVMVSGFD IIFFWIARMI MMTMHFIKDE NGKPQVPFHT
VYMTGLIRDD EGQKMSKSKG NVIDPLDMVD GISLPELLEK RTGNMMQPQL ADKIRKRTEK
QFPNGIEPHG TDALRFTLAA LASTGRDINW DMKRLEGYRN FCNKLWNASR FVLMNTEGQD
CGFNGGEMTL SLADRWILAE FNQTIKAYRE ALDSFRFDIA AGILYEFTWN QFCDWYLELT
KPVMNGGTEA ELRGTRHTLV TVLEGLLRLA HPIIPFITET IWQRVKVLCG ITADTIMLQP
FPQYDASQVD EAALADTEWL KQAIVAVRNI RAEMNIAPGK PLELLLRGCS ADAERRVNEN
RGFLQTLARL ESITVLPADD KGPVSVTKII DGAELLIPMA GLINKEDELA RLAKEVAKIE
GEISRIENKL ANEGFVARAP EAVIAKEREK LEGYAKAKAK LIEQQAVIAA L