Gene EcHS_A4514 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4514 
SymbolvalS 
ID5593019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4517167 
End bp4520022 
Gene Length2856 bp 
Protein Length951 aa 
Translation table11 
GC content55% 
IMG OID640923610 
Productvalyl-tRNA synthetase 
Protein accessionYP_001461051 
Protein GI157163733 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.00256873 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAGA CATATAACCC ACAAGATATC GAACAGCCGC TTTACGAGCA CTGGGAAAAG 
CAGGGCTACT TTAAGCCTAA TGGCGATGAA AGCCAGGAAA GTTTCTGCAT CATGATCCCG
CCGCCGAACG TCACCGGCAG TTTGCATATG GGTCACGCCT TCCAGCAAAC CATCATGGAT
ACCATGATCC GCTATCAGCG CATGCAGGGC AAAAACACCC TGTGGCAGGT CGGTACTGAC
CACGCCGGGA TCGCTACCCA GATGGTCGTT GAGCGCAAGA TTGCCGCAGA AGAAGGTAAA
ACCCGTCACG ACTACGGCCG CGAAGCTTTC ATCGACAAAA TCTGGGAATG GAAAGCGGAA
TCTGGCGGCA CCATTACCCG TCAGATGCGC CGTCTCGGCA ACTCCGTGGA CTGGGAGCGT
GAACGCTTCA CCATGGACGA AGGCCTGTCC AATGCGGTGA AAGAAGTTTT CGTTCGTCTG
TATAAAGAAG ACCTGATTTA CCGTGGCAAG CGCCTGGTAA ACTGGGACCC GAAACTGCGC
ACCGCTATCT CTGACCTGGA AGTGGAAAAC CGCGAATCGA AAGGTTCGAT GTGGCACATC
CGCTATCCGC TGGCTGACGG CGCGAAAACC GCAGACGGTA AAGATTATCT GGTGGTTGCG
ACTACCCGTC CGGAAACCCT GCTGGGCGAT ACTGGCGTAG CCGTTAACCC GGAAGATCCG
CGTTACAAAG ATCTGATTGG CAAATATGTC ATTCTGCCGC TGGTTAACCG TCGTATTCCG
ATCGTTGGCG ACGAACACGC CGACATGGAA AAAGGCACCG GCTGCGTGAA GATCACCCCG
GCGCACGACT TTAACGACTA TGAAGTCGGT AAACGTCACG CCCTGCCGAT GATCAACATC
CTGACCTTTG ACGGCGATAT CCGTGAAAGT GCCCAGGTGT TCGATACCAA AGGTAACGAA
TCTGACGTTT ATTCCAGCGA GATCCCGGCA GAATTCCAGA AACTGGAGCG TTTTGCTGCA
CGTAAAGCGG TCGTTGCTGC GGTTGACGCA CTCGGCCTGC TGGAAGAAAT TAAACCACAT
GACCTGACCG TTCCTTACGG CGACCGTGGC GGCGTGGTCA TCGAACCAAT GCTGACCGAC
CAGTGGTACG TGCGTGCCGA TGTCCTGGCG AAACCGGCAG TTGAAGCGGT TGAGAACGGC
GACATTCAGT TCGTACCGAA GCAGTACGAA AACATGTACT TCTCCTGGAT GCGCGATATT
CAGGACTGGT GTATCTCTCG TCAGTTGTGG TGGGGTCACC GTATCCCGGC ATGGTATGAC
GAAGCGGGTA ACGTTTATGT TGGCCGCAAC GAAGACGAAG TGCGTAAAGA AAATAACCTC
GGTGCTGATG TTGTCCTGCG TCAGGACGAA GACGTTCTCG ATACCTGGTT CTCTTCTGCG
CTGTGGACCT TCTCTACCCT TGGCTGGCCG GAAAACACCG ACGCCCTGCG TCAGTTCCAC
CCAACCAGCG TGATGGTATC TGGTTTCGAC ATCATCTTCT TCTGGATTGC CCGCATGATC
ATGATGACCA TGCACTTCAT CAAAGATGAA AATGGCAAAC CGCAGGTGCC GTTCCACACT
GTTTACATGA CCGGTCTGAT TCGTGACGAC GAAGGCCAGA AGATGTCCAA ATCCAAGGGT
AACGTTATCG ACCCGCTGGA TATGGTTGAC GGTATTTCGC TGCCAGAACT GCTGGAAAAA
CGTACCGGCA ATATGATGCA GCCGCAGCTG GCGGACAAAA TCCGTAAGCG CACCGAGAAA
CAGTTCCCGA ACGGCATTGA GCCGCACGGC ACCGATGCCC TGCGCTTCAC CCTGGCGGCA
CTGGCGTCTA CCGGTCGTGA CATCAACTGG GATATGAAGC GCCTGGAAGG TTACCGTAAC
TTCTGTAACA AGCTGTGGAA CGCCAGCCGC TTTGTGCTGA TGAACACAGA AGGTCAGGAT
TGCGGCTTCA ACGGCGGCGA GATGACGCTG TCGCTGGCGG ATCGCTGGAT TCTGGCGGAG
TTCAACCAGA CCATCAAAGC GTACCGCGAA GCGCTGGACA GCTTCCGCTT CGATATCGCC
GCAGGCATTC TGTATGAGTT CACCTGGAAC CAGTTCTGTG ACTGGTATCT GGAGCTGACT
AAGCCGGTAA TGAACGGTGG CACCGAAGCC GAACTGCGCG GTACTCGCCA TACGCTGGTG
ACCGTGCTGG AAGGTCTGCT GCGCCTCGCG CATCCGATCA TTCCGTTCAT CACCGAAACC
ATCTGGCAGC GTGTGAAAGT ACTTTGCGGT ATCACTGCCG ACACCATCAT GCTGCAGCCG
TTCCCGCAGT ACGATGCATC TCAGGTTGAT GAAGCCGCAC TGGCCGACAC AGAGTGGCTG
AAGCAGGCGA TCGTTGCGGT ACGTAACATC CGTGCAGAAA TGAACATCGC GCCGGGCAAA
CCGCTGGAGC TGCTACTGCG TGGTTGCAGC GCGGATGCAG AACGTCGCGT AAATGAAAAC
CGTGGCTTCC TGCAAACGCT GGCGCGTCTG GAAAGCATCA CCGTGCTGCC TGCCGATGAC
AAAGGTCCGG TTTCCGTTAC TAAGATCATC GACGGTGCAG AGCTGCTGAT CCCGATGGCT
GGCCTCATCA ACAAAGAAGA TGAGCTGGCG CGTCTGGCGA AAGAAGTGGC GAAGATTGAA
GGTGAAATCA GCCGTATCGA GAACAAGCTG GCGAATGAAG GCTTTGTCGC CCGCGCACCG
GAAGCGGTCA TCGCGAAAGA GCGTGAAAAG CTGGAAGGCT ATGCAGAAGC GAAAGCGAAG
CTGATTGAAC AGCAGGCTGT TATCGCCGCG CTGTAA
 
Protein sequence
MEKTYNPQDI EQPLYEHWEK QGYFKPNGDE SQESFCIMIP PPNVTGSLHM GHAFQQTIMD 
TMIRYQRMQG KNTLWQVGTD HAGIATQMVV ERKIAAEEGK TRHDYGREAF IDKIWEWKAE
SGGTITRQMR RLGNSVDWER ERFTMDEGLS NAVKEVFVRL YKEDLIYRGK RLVNWDPKLR
TAISDLEVEN RESKGSMWHI RYPLADGAKT ADGKDYLVVA TTRPETLLGD TGVAVNPEDP
RYKDLIGKYV ILPLVNRRIP IVGDEHADME KGTGCVKITP AHDFNDYEVG KRHALPMINI
LTFDGDIRES AQVFDTKGNE SDVYSSEIPA EFQKLERFAA RKAVVAAVDA LGLLEEIKPH
DLTVPYGDRG GVVIEPMLTD QWYVRADVLA KPAVEAVENG DIQFVPKQYE NMYFSWMRDI
QDWCISRQLW WGHRIPAWYD EAGNVYVGRN EDEVRKENNL GADVVLRQDE DVLDTWFSSA
LWTFSTLGWP ENTDALRQFH PTSVMVSGFD IIFFWIARMI MMTMHFIKDE NGKPQVPFHT
VYMTGLIRDD EGQKMSKSKG NVIDPLDMVD GISLPELLEK RTGNMMQPQL ADKIRKRTEK
QFPNGIEPHG TDALRFTLAA LASTGRDINW DMKRLEGYRN FCNKLWNASR FVLMNTEGQD
CGFNGGEMTL SLADRWILAE FNQTIKAYRE ALDSFRFDIA AGILYEFTWN QFCDWYLELT
KPVMNGGTEA ELRGTRHTLV TVLEGLLRLA HPIIPFITET IWQRVKVLCG ITADTIMLQP
FPQYDASQVD EAALADTEWL KQAIVAVRNI RAEMNIAPGK PLELLLRGCS ADAERRVNEN
RGFLQTLARL ESITVLPADD KGPVSVTKII DGAELLIPMA GLINKEDELA RLAKEVAKIE
GEISRIENKL ANEGFVARAP EAVIAKEREK LEGYAEAKAK LIEQQAVIAA L