Gene EcDH1_1543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1543 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp1680526 
End bp1682559 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content53% 
IMG OID 
Productmethionyl-tRNA synthetase 
Protein accessionACX39211 
Protein GI260448789 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.164847 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCAAG TCGCGAAGAA AATTCTGGTG ACGTGCGCAC TGCCGTACGC TAACGGCTCA 
ATCCACCTCG GCCATATGCT GGAGCACATC CAGGCTGATG TCTGGGTCCG TTACCAGCGA
ATGCGCGGCC ACGAGGTCAA CTTCATCTGC GCCGACGATG CCCACGGTAC ACCGATCATG
CTGAAAGCTC AGCAGCTTGG TATCACCCCG GAGCAGATGA TTGGCGAAAT GAGTCAGGAG
CATCAGACTG ATTTCGCAGG CTTTAACATC AGCTATGACA ACTATCACTC GACGCACAGC
GAAGAGAACC GCCAGTTGTC AGAACTTATC TACTCTCGCC TGAAAGAAAA CGGTTTTATT
AAAAACCGCA CCATCTCTCA GCTGTACGAT CCGGAAAAAG GCATGTTCCT GCCGGACCGT
TTTGTGAAAG GCACCTGCCC GAAATGTAAA TCCCCGGATC AATACGGCGA TAACTGCGAA
GTCTGCGGCG CGACCTACAG CCCGACTGAA CTGATCGAGC CGAAATCGGT GGTTTCTGGC
GCTACGCCGG TAATGCGTGA TTCTGAACAC TTCTTCTTTG ATCTGCCCTC TTTCAGCGAA
ATGTTGCAGG CATGGACCCG CAGCGGTGCG TTGCAGGAGC AGGTGGCAAA TAAAATGCAG
GAGTGGTTTG AATCTGGCCT GCAACAGTGG GATATCTCCC GCGACGCCCC TTACTTCGGT
TTTGAAATTC CGAACGCGCC GGGCAAATAT TTCTACGTCT GGCTGGACGC ACCGATTGGC
TACATGGGTT CTTTCAAGAA TCTGTGCGAC AAGCGCGGCG ACAGCGTAAG CTTCGATGAA
TACTGGAAGA AAGACTCCAC CGCCGAGCTG TACCACTTCA TCGGTAAAGA TATTGTTTAC
TTCCACAGCC TGTTCTGGCC TGCCATGCTG GAAGGCAGCA ACTTCCGCAA GCCGTCCAAC
CTGTTTGTTC ATGGCTATGT GACGGTGAAC GGCGCAAAGA TGTCCAAGTC TCGCGGCACC
TTTATTAAAG CCAGCACCTG GCTGAATCAT TTTGACGCAG ACAGCCTGCG TTACTACTAC
ACTGCGAAAC TCTCTTCGCG CATTGATGAT ATCGATCTCA ACCTGGAAGA TTTCGTTCAG
CGTGTGAATG CCGATATCGT TAACAAAGTG GTTAACCTGG CCTCCCGTAA TGCGGGCTTT
ATCAACAAGC GTTTTGACGG CGTGCTGGCA AGCGAACTGG CTGACCCGCA GTTGTACAAA
ACCTTCACTG ATGCCGCTGA AGTGATTGGT GAAGCGTGGG AAAGCCGTGA ATTTGGTAAA
GCCGTGCGCG AAATCATGGC GCTGGCTGAT CTGGCTAACC GCTATGTCGA TGAACAGGCT
CCGTGGGTGG TGGCGAAACA GGAAGGCCGC GATGCCGACC TGCAGGCAAT TTGCTCAATG
GGCATCAACC TGTTCCGCGT GCTGATGACT TACCTGAAGC CGGTACTGCC GAAACTGACC
GAGCGTGCAG AAGCATTCCT CAATACGGAA CTGACCTGGG ATGGTATCCA GCAACCGCTG
CTGGGCCACA AAGTGAATCC GTTCAAGGCG CTGTATAACC GCATCGATAT GAGGCAGGTT
GAAGCACTGG TGGAAGCCTC TAAAGAAGAA GTAAAAGCCG CTGCCGCGCC GGTAACTGGC
CCGCTGGCAG ATGATCCGAT TCAGGAAACC ATCACCTTTG ACGACTTCGC TAAAGTTGAC
CTGCGCGTGG CGCTGATTGA AAACGCAGAG TTTGTTGAAG GTTCTGACAA ACTGCTGCGC
CTGACGCTGG ATCTCGGCGG TGAAAAACGC AATGTCTTCT CCGGTATTCG TTCTGCTTAC
CCGGATCCGC AGGCACTGAT TGGTCGTCAC ACCATTATGG TGGCTAACCT GGCACCACGT
AAAATGCGCT TCGGTATCTC TGAAGGCATG GTGATGGCTG CCGGTCCTGG CGGGAAAGAT
ATTTTCCTGC TAAGCCCGGA TGCCGGTGCT AAACCGGGTC ATCAGGTGAA ATAA
 
Protein sequence
MTQVAKKILV TCALPYANGS IHLGHMLEHI QADVWVRYQR MRGHEVNFIC ADDAHGTPIM 
LKAQQLGITP EQMIGEMSQE HQTDFAGFNI SYDNYHSTHS EENRQLSELI YSRLKENGFI
KNRTISQLYD PEKGMFLPDR FVKGTCPKCK SPDQYGDNCE VCGATYSPTE LIEPKSVVSG
ATPVMRDSEH FFFDLPSFSE MLQAWTRSGA LQEQVANKMQ EWFESGLQQW DISRDAPYFG
FEIPNAPGKY FYVWLDAPIG YMGSFKNLCD KRGDSVSFDE YWKKDSTAEL YHFIGKDIVY
FHSLFWPAML EGSNFRKPSN LFVHGYVTVN GAKMSKSRGT FIKASTWLNH FDADSLRYYY
TAKLSSRIDD IDLNLEDFVQ RVNADIVNKV VNLASRNAGF INKRFDGVLA SELADPQLYK
TFTDAAEVIG EAWESREFGK AVREIMALAD LANRYVDEQA PWVVAKQEGR DADLQAICSM
GINLFRVLMT YLKPVLPKLT ERAEAFLNTE LTWDGIQQPL LGHKVNPFKA LYNRIDMRQV
EALVEASKEE VKAAAAPVTG PLADDPIQET ITFDDFAKVD LRVALIENAE FVEGSDKLLR
LTLDLGGEKR NVFSGIRSAY PDPQALIGRH TIMVANLAPR KMRFGISEGM VMAAGPGGKD
IFLLSPDAGA KPGHQVK