Gene EcE24377A_4829 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_4829 
SymbolvalS 
ID5589285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp4821236 
End bp4824091 
Gene Length2856 bp 
Protein Length951 aa 
Translation table11 
GC content55% 
IMG OID640928438 
Productvalyl-tRNA synthetase 
Protein accessionYP_001465766 
Protein GI157158681 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.204033 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAGA CATACAACCC ACAAGATATC GAACAGCCGC TTTACGAGCA CTGGGAAAAG 
CAGGGCTACT TTAAGCCTAA TGGCGATGAA AGCCAGGAAA GTTTCTGCAT CATGATCCCG
CCGCCGAACG TCACCGGCAG TTTGCATATG GGTCACGCCT TCCAGCAAAC CATCATGGAT
ACCATGATCC GCTATCAGCG CATGCAGGGC AAAAACACCC TGTGGCAGGT CGGTACTGAC
CACGCCGGGA TCGCTACCCA GATGGTCGTT GAGCGCAAGA TTGCCGCAGA AGAAGGTAAA
ACCCGTCACG ACTACGGCCG CGAAGCTTTC ATCGACAAAA TTTGGGAATG GAAAGCGGAA
TCCGGCGGCA CGATTACCCG TCAGATGCGC CGTCTCGGCA ACTCCGTCGA CTGGGAGCGT
GAACGCTTCA CCATGGACGA AGGCCTGTCC AATGCGGTGA AAGAAGTTTT CGTTCGTCTG
TATAAAGAAG ACCTGATTTA CCGTGGCAAA CGCCTGGTAA ACTGGGATCC GAAACTGCGC
ACCGCTATCT CTGACCTGGA AGTCGAAAAC CGCGAATCGA AAGGTTCGAT GTGGCACATC
CGCTATCCGC TGGCTGACGG TGCGAAAACC GCAGACGGTA AAGATTATCT GGTGGTCGCG
ACTACCCGTC CAGAAACCCT GCTGGGCGAT ACTGGCGTAG CCGTTAACCC GGAAGATCCG
CGTTACAAAG ATCTGATTGG CAAATATGTC ATTCTGCCGC TGGTTAACCG TCGTATTCCG
ATCGTTGGCG ACGAACACGC CGACATGGAA AAAGGCACCG GCTGCGTGAA AATCACTCCG
GCGCACGACT TTAACGACTA TGAAGTGGGT AAACGTCACG CCCTGCCGAT GATCAACATC
CTGACCTTTG ACGGCGATAT CCGTGAAAGC GCCCAGGTGT TCGATACCAA AGGTAACGAA
TCTGACGTTT ATTCCAGCGA AATCCCTGCA GAGTTCCAGA AACTGGAGCG TTTTGCTGCA
CGTAAAGCAG TCGTTGCCGC AGTTGACGCG CTTGGCCTGC TGGAAGAAAT TAAACCGCAC
GACCTGACCG TTCCTTACGG CGACCGTGGC GGCGTAGTTA TCGAACCAAT GCTGACCGAC
CAGTGGTACG TGCGTGCCGA TGTACTGGCG AAACCGGCGG TTGAAGCGGT TGAGAACGGC
GACATTCAGT TCGTACCGAA GCAGTACGAA AACATGTACT TCTCCTGGAT GCGCGATATT
CAGGACTGGT GTATCTCTCG TCAGCTGTGG TGGGGTCACC GTATCCCTGC ATGGTACGAC
GAAGCAGGTA ACGTTTATGT TGGCCGCAAC GAAGAAGAAG TGCGCAAGGA AAATAACCTC
GGTGCCGACG TTGCCCTGCG TCAGGACGAA GACGTTCTCG ACACCTGGTT CTCTTCTGCA
CTGTGGACCT TCTCTACCCT TGGCTGGCCG GAAAACACCG ACGCCCTGCG TCAGTTCCAC
CCAACCAGCG TGATGGTATC CGGCTTCGAC ATCATCTTCT TCTGGATTGC CCGCATGATC
ATGATGACCA TGCACTTCAT CAAAGATGAA AATGGCAAAC CGCAGGTGCC GTTCCACACT
GTTTACATGA CCGGTCTGAT TCGTGACGAC GAAGGCCAGA AGATGTCCAA ATCCAAGGGT
AACGTTATCG ACCCGCTGGA TATGGTTGAC GGTATTTCGC TGCCAGAACT GCTGGAAAAA
CGTACCGGCA ATATGATGCA GCCGCAGCTG GCGGACAAAA TCCGTAAGCG CACCGAGAAA
CAGTTCCCGA ACGGCATTGA GCCGCACGGC ACCGATGCCC TGCGCTTCAC CCTGGCGGCG
CTGGCGTCTA CCGGTCGTGA CATCAACTGG GATATGAAGC GCCTGGAAGG TTACCGTAAC
TTCTGTAACA AGCTGTGGAA CGCCAGCCGC TTTGTGCTGA TGAACACAGA AGGTCAGGAT
TGCGGCTTCA ACGGCGGTGA AATGACGCTG TCGCTGGCGG ACCGCTGGAT TCTGGCAGAG
TTCAACCAGA CCATCAAAGC GTACCGCGAA GCGCTGGACA GCTTCCGCTT CGATATCGCC
GCAGGCATTC TGTATGAGTT CACCTGGAAC CAGTTCTGTG ACTGGTATCT GGAGCTGACT
AAGCCGGTAA TGAACGGTGG CACCGAAGCC GAACTGCGCG GTACTCGCCA TACGCTGGTG
ACCGTGCTGG AAGGTCTGCT GCGCCTCGCG CATCCGATCA TTCCGTTCAT CACCGAAACC
ATCTGGCAGC GTGTGAAAGT ACTTTGCGGT ATCACTGCCG ACACCATCAT GCTGCAGCCG
TTCCCGCAGT ACGATGCATC TCAGGTTGAT GAAGCCGCAC TGGCCGACAC AGAGTGGTTG
AAGCAGGCGA TCGTTGCGGT ACGTAACATC CGTGCAGAAA TGAACATCGC GCCGGGCAAA
CTGCTGGAGC TGCTGCTGCG TGGTTGCAGC GCGGATGCAG AACGTCGCGT AAATGAAAAC
CGTGGCTTCC TGCAAACCCT GGCGCGTCTG GAAAGCATCA CCGTGCTGCC TGCCGATGAC
AAAGGTCCGG TTTCCGTTAC GAAGATCATC GACGGCGCAG AGCTGCTGAT CCCGATGGCT
GGCCTCATCA ACAAAGAAGA TGAGCTGGCG CGTCTGGCGA AAGAAGTGGC GAAGATCGAC
GGCGAAATCA GCCGTATCGA GAACAAACTG GCGAACGAAG GCTTTGTAGC GCGCGCACCG
GAAGCGGTCA TCGCGAAAGA GCGTGAGAAG CTGGAAGGCT ATGCAGAAGC GAAAGCGAAG
CTGATTGAAC AGCAGGCTGT TATCGCCGCG CTGTAA
 
Protein sequence
MEKTYNPQDI EQPLYEHWEK QGYFKPNGDE SQESFCIMIP PPNVTGSLHM GHAFQQTIMD 
TMIRYQRMQG KNTLWQVGTD HAGIATQMVV ERKIAAEEGK TRHDYGREAF IDKIWEWKAE
SGGTITRQMR RLGNSVDWER ERFTMDEGLS NAVKEVFVRL YKEDLIYRGK RLVNWDPKLR
TAISDLEVEN RESKGSMWHI RYPLADGAKT ADGKDYLVVA TTRPETLLGD TGVAVNPEDP
RYKDLIGKYV ILPLVNRRIP IVGDEHADME KGTGCVKITP AHDFNDYEVG KRHALPMINI
LTFDGDIRES AQVFDTKGNE SDVYSSEIPA EFQKLERFAA RKAVVAAVDA LGLLEEIKPH
DLTVPYGDRG GVVIEPMLTD QWYVRADVLA KPAVEAVENG DIQFVPKQYE NMYFSWMRDI
QDWCISRQLW WGHRIPAWYD EAGNVYVGRN EEEVRKENNL GADVALRQDE DVLDTWFSSA
LWTFSTLGWP ENTDALRQFH PTSVMVSGFD IIFFWIARMI MMTMHFIKDE NGKPQVPFHT
VYMTGLIRDD EGQKMSKSKG NVIDPLDMVD GISLPELLEK RTGNMMQPQL ADKIRKRTEK
QFPNGIEPHG TDALRFTLAA LASTGRDINW DMKRLEGYRN FCNKLWNASR FVLMNTEGQD
CGFNGGEMTL SLADRWILAE FNQTIKAYRE ALDSFRFDIA AGILYEFTWN QFCDWYLELT
KPVMNGGTEA ELRGTRHTLV TVLEGLLRLA HPIIPFITET IWQRVKVLCG ITADTIMLQP
FPQYDASQVD EAALADTEWL KQAIVAVRNI RAEMNIAPGK LLELLLRGCS ADAERRVNEN
RGFLQTLARL ESITVLPADD KGPVSVTKII DGAELLIPMA GLINKEDELA RLAKEVAKID
GEISRIENKL ANEGFVARAP EAVIAKEREK LEGYAEAKAK LIEQQAVIAA L