Gene EcDH1_3739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3739 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4029968 
End bp4032823 
Gene Length2856 bp 
Protein Length951 aa 
Translation table11 
GC content55% 
IMG OID 
Productvalyl-tRNA synthetase 
Protein accessionACX41344 
Protein GI260450922 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAGA CATATAACCC ACAAGATATC GAACAGCCGC TTTACGAGCA CTGGGAAAAG 
CAGGGCTACT TTAAGCCTAA TGGCGATGAA AGCCAGGAAA GTTTCTGCAT CATGATCCCG
CCGCCGAACG TCACCGGCAG TTTGCATATG GGTCACGCCT TCCAGCAAAC CATCATGGAT
ACCATGATCC GCTATCAGCG CATGCAGGGC AAAAACACCC TGTGGCAGGT CGGTACTGAC
CACGCCGGGA TCGCTACCCA GATGGTCGTT GAGCGCAAGA TTGCCGCAGA AGAAGGTAAA
ACCCGTCACG ACTACGGCCG CGAAGCTTTC ATCGACAAAA TCTGGGAATG GAAAGCGGAA
TCTGGCGGCA CCATTACCCG TCAGATGCGC CGTCTCGGCA ACTCCGTCGA CTGGGAGCGT
GAACGCTTCA CCATGGACGA AGGCCTGTCC AATGCGGTGA AAGAAGTTTT CGTTCGTCTG
TATAAAGAAG ACCTGATTTA CCGTGGCAAA CGCCTGGTAA ACTGGGATCC GAAACTGCGC
ACCGCTATCT CTGACCTGGA AGTCGAAAAC CGCGAATCGA AAGGTTCGAT GTGGCACATC
CGCTATCCGC TGGCTGACGG TGCGAAAACC GCAGACGGTA AAGATTATCT GGTGGTCGCG
ACTACCCGTC CAGAAACCCT GCTGGGCGAT ACTGGCGTAG CCGTTAACCC GGAAGATCCG
CGTTACAAAG ATCTGATTGG CAAATATGTC ATTCTGCCGC TGGTTAACCG TCGTATTCCG
ATCGTTGGCG ACGAACACGC CGACATGGAA AAAGGCACCG GCTGCGTGAA AATCACTCCG
GCGCACGACT TTAACGACTA TGAAGTGGGT AAACGTCACG CCCTGCCGAT GATCAACATC
CTGACCTTTG ACGGCGATAT CCGTGAAAGC GCCCAGATGT TCGATACCAA AGGTAACGAA
TCTGACGTTT ATTCCAGCGA AATCCCTGCA GAGTTCCAGA AACTGGAGCG TTTTGCTGCA
CGTAAAGCAG TCGTTGCCGC AGTTGACGCG CTTGGCCTGC TGGAAGAAAT TAAACCGCAC
GACCTGACCG TTCCTTACGG CGACCGTGGC GGCGTAGTTA TCGAACCAAT GCTGACCGAC
CAGTGGTACG TGCGTGCCGA TGTCCTGGCG AAACCGGCGG TTGAAGCGGT TGAGAACGGC
GACATTCAGT TCGTACCGAA GCAGTACGAA AACATGTACT TCTCCTGGAT GCGCGATATT
CAGGACTGGT GTATCTCTCG TCAGTTGTGG TGGGGTCACC GTATCCCGGC ATGGTATGAC
GAAGCGGGTA ACGTTTATGT TGGCCGCAAC GAAGACGAAG TGCGTAAAGA AAATAACCTC
GGTGCTGATG TTGTCCTGCG TCAGGACGAA GACGTTCTCG ATACCTGGTT CTCTTCTGCG
CTGTGGACCT TCTCTACCCT TGGCTGGCCG GAAAATACCG ACGCCCTGCG TCAGTTCCAC
CCAACCAGCG TGATGGTATC TGGTTTCGAC ATCATTTTCT TCTGGATTGC CCGCATGATC
ATGATGACCA TGCACTTCAT CAAAGATGAA AATGGCAAAC CGCAGGTGCC GTTCCACACC
GTTTACATGA CCGGCCTGAT TCGTGATGAC GAAGGCCAGA AGATGTCCAA ATCCAAGGGT
AACGTTATCG ACCCACTGGA TATGGTTGAC GGTATTTCGC TGCCAGAACT GCTGGAAAAA
CGTACCGGCA ATATGATGCA GCCGCAGCTG GCGGACAAAA TCCGTAAGCG CACCGAGAAG
CAGTTCCCGA ACGGTATTGA GCCGCACGGT ACTGACGCGC TGCGCTTCAC CCTGGCGGCG
CTGGCGTCTA CCGGTCGTGA CATCAACTGG GATATGAAGC GTCTGGAAGG TTACCGTAAC
TTCTGTAACA AGCTGTGGAA CGCCAGCCGC TTTGTGCTGA TGAACACAGA AGGTCAGGAT
TGCGGCTTCA ACGGCGGCGA AATGACGCTG TCGCTGGCGG ACCGCTGGAT TCTGGCGGAG
TTCAACCAGA CCATCAAAGC GTACCGCGAA GCGCTGGACA GCTTCCGCTT CGATATCGCC
GCAGGCATTC TGTATGAGTT CACCTGGAAC CAGTTCTGTG ACTGGTATCT CGAGCTGACC
AAGCCGGTAA TGAACGGTGG CACCGAAGCA GAACTGCGCG GTACTCGCCA TACGCTGGTG
ACTGTACTGG AAGGTCTGCT GCGCCTCGCG CATCCGATCA TTCCGTTCAT CACCGAAACC
ATCTGGCAGC GTGTGAAAGT ACTTTGCGGT ATCACTGCCG ACACCATCAT GCTGCAGCCG
TTCCCGCAGT ACGATGCATC TCAGGTTGAT GAAGCCGCAC TGGCCGACAC CGAATGGCTG
AAACAGGCGA TCGTTGCGGT ACGTAACATC CGTGCAGAAA TGAACATCGC GCCGGGCAAA
CCGCTGGAGC TGCTGCTGCG TGGTTGCAGC GCGGATGCAG AACGTCGCGT AAATGAAAAC
CGTGGCTTCC TGCAAACCCT GGCGCGTCTG GAAAGTATCA CCGTGCTGCC TGCCGATGAC
AAAGGTCCGG TTTCCGTTAC GAAGATCATC GACGGTGCAG AGCTGCTGAT CCCGATGGCT
GGCCTCATCA ACAAAGAAGA TGAGCTGGCG CGTCTGGCGA AAGAAGTGGC GAAGATTGAA
GGTGAAATCA GCCGTATCGA GAACAAACTG GCGAACGAAG GCTTTGTCGC CCGCGCACCG
GAAGCGGTCA TCGCGAAAGA GCGTGAGAAG CTGGAAGGCT ATGCGGAAGC GAAAGCGAAA
CTGATTGAAC AGCAGGCTGT TATCGCCGCG CTGTAA
 
Protein sequence
MEKTYNPQDI EQPLYEHWEK QGYFKPNGDE SQESFCIMIP PPNVTGSLHM GHAFQQTIMD 
TMIRYQRMQG KNTLWQVGTD HAGIATQMVV ERKIAAEEGK TRHDYGREAF IDKIWEWKAE
SGGTITRQMR RLGNSVDWER ERFTMDEGLS NAVKEVFVRL YKEDLIYRGK RLVNWDPKLR
TAISDLEVEN RESKGSMWHI RYPLADGAKT ADGKDYLVVA TTRPETLLGD TGVAVNPEDP
RYKDLIGKYV ILPLVNRRIP IVGDEHADME KGTGCVKITP AHDFNDYEVG KRHALPMINI
LTFDGDIRES AQMFDTKGNE SDVYSSEIPA EFQKLERFAA RKAVVAAVDA LGLLEEIKPH
DLTVPYGDRG GVVIEPMLTD QWYVRADVLA KPAVEAVENG DIQFVPKQYE NMYFSWMRDI
QDWCISRQLW WGHRIPAWYD EAGNVYVGRN EDEVRKENNL GADVVLRQDE DVLDTWFSSA
LWTFSTLGWP ENTDALRQFH PTSVMVSGFD IIFFWIARMI MMTMHFIKDE NGKPQVPFHT
VYMTGLIRDD EGQKMSKSKG NVIDPLDMVD GISLPELLEK RTGNMMQPQL ADKIRKRTEK
QFPNGIEPHG TDALRFTLAA LASTGRDINW DMKRLEGYRN FCNKLWNASR FVLMNTEGQD
CGFNGGEMTL SLADRWILAE FNQTIKAYRE ALDSFRFDIA AGILYEFTWN QFCDWYLELT
KPVMNGGTEA ELRGTRHTLV TVLEGLLRLA HPIIPFITET IWQRVKVLCG ITADTIMLQP
FPQYDASQVD EAALADTEWL KQAIVAVRNI RAEMNIAPGK PLELLLRGCS ADAERRVNEN
RGFLQTLARL ESITVLPADD KGPVSVTKII DGAELLIPMA GLINKEDELA RLAKEVAKIE
GEISRIENKL ANEGFVARAP EAVIAKEREK LEGYAEAKAK LIEQQAVIAA L