Gene EcDH1_1250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1250 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp1346796 
End bp1348811 
Gene Length2016 bp 
Protein Length671 aa 
Translation table11 
GC content54% 
IMG OID 
ProductDNA ligase, NAD-dependent 
Protein accessionACX38924 
Protein GI260448502 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000262994 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATCAA TCGAACAACA ACTGACAGAA CTGCGAACGA CGCTTCGCCA TCATGAATAT 
CTTTATCATG TGATGGATGC GCCGGAAATT CCCGACGCTG AATACGACAG GCTGATGCGC
GAACTGCGCG AGCTGGAAAC CAAACATCCA GAACTGATTA CGCCTGATTC GCCTACTCAA
CGTGTAGGCG CTGCGCCGCT GGCGGCTTTC AGCCAGATAC GCCATGAAGT ACCAATGCTG
TCACTGGATA ACGTTTTTGA TGAAGAAAGC TTTCTTGCTT TCAACAAACG TGTGCAGGAC
CGTCTGAAAA ACAACGAGAA AGTCACCTGG TGCTGTGAGC TGAAGCTGGA TGGTCTTGCC
GTCAGTATTC TGTATGAAAA TGGCGTTTTA GTCAGTGCCG CGACCCGTGG CGATGGCACC
ACCGGGGAAG ATATCACGTC TAATGTGCGT ACTATTCGCG CCATTCCGCT GAAGCTGCAC
GGAGAGAATA TCCCGGCGCG TCTGGAAGTG CGTGGTGAAG TGTTCCTGCC GCAGGCGGGG
TTCGAAAAGA TTAACGAAGA TGCGCGACGC ACGGGCGGGA AAGTGTTTGC TAACCCACGT
AATGCGGCAG CTGGTTCACT GCGTCAGCTT GATCCGCGTA TTACAGCGAA GCGACCGCTC
ACTTTTTTCT GCTATGGCGT TGGTGTTCTG GAAGGTGGCG AGCTGCCGGA TACTCATCTT
GGCCGTTTAC TGCAATTTAA AAAGTGGGGG TTGCCGGTCA GCGATCGGGT AACGCTTTGT
GAATCGGCGG AAGAAGTGCT GGCGTTCTAT CACAAAGTGG AAGAAGACCG CCCGACGCTG
GGCTTTGATA TCGACGGCGT GGTGATTAAG GTCAACTCAC TGGCACAGCA GGAGCAGCTT
GGCTTTGTCG CGCGTGCCCC GCGCTGGGCG GTAGCGTTTA AATTCCCGGC GCAGGAGCAG
ATGACCTTTG TGCGTGACGT CGAGTTTCAG GTTGGGCGTA CTGGCGCGAT TACGCCTGTT
GCGCGTCTGG AACCTGTCCA TGTTGCAGGC GTGCTGGTGA GTAACGCAAC CTTACACAAT
GCGGATGAAA TCGAACGTCT TGGTTTACGC ATTGGCGATA AAGTGGTGAT TCGCCGCGCT
GGCGACGTGA TCCCGCAGGT GGTTAACGTC GTGCTTTCTG AACGCCCGGA AGATACCCGT
GAGGTTGTAT TCCCGACGCA TTGTCCGGTA TGTGGTTCTG ACGTTGAGCG TGTGGAAGGT
GAAGCGGTTG CCCGCTGTAC CGGTGGCCTG ATTTGCGGTG CGCAGCGTAA AGAGTCGCTG
AAACACTTTG TTTCCCGCCG TGCGATGGAT GTTGACGGAA TGGGCGACAA AATCATCGAT
CAGCTGGTTG AAAAAGAATA TGTCCATACT CCGGCAGATC TGTTCAAACT CACCGCAGGC
AAACTGACCG GACTGGAGCG TATGGGGCCA AAATCGGCAC AAAACGTGGT TAACGCGCTG
GAAAAAGCGA AAGAAACCAC CTTTGCTCGC TTCCTCTATG CACTTGGCAT CCGTGAAGTC
GGCGAGGCCA CCGCAGCAGG TCTGGCGGCA TATTTCGGCA CGCTGGAAGC GCTGGAAGCC
GCTTCGATTG AAGAGCTGCA AAAGGTGCCT GATGTTGGCA TTGTCGTTGC ATCCCACGTT
CACAACTTCT TTGCCGAAGA AAGCAACCGC AATGTCATCA GCGAGCTGTT GGCGGAAGGT
GTTCACTGGC CTGCGCCGAT CGTTATCAAC GCGGAAGAGA TTGACAGCCC GTTTGCTGGT
AAAACCGTGG TGCTTACGGG CAGCTTAAGC CAGATGTCGC GTGATGACGC TAAAGCTCGA
CTGGTCGAAC TGGGCGCGAA AGTCGCGGGC AGCGTGTCGA AGAAAACCGA TCTGGTGATA
GCGGGTGAAG CTGCAGGATC TAAACTGGCG AAGGCGCAGG AACTGGGCAT TGAAGTCATC
GACGAAGCGG AAATGCTGCG TTTGCTGGGT AGCTGA
 
Protein sequence
MESIEQQLTE LRTTLRHHEY LYHVMDAPEI PDAEYDRLMR ELRELETKHP ELITPDSPTQ 
RVGAAPLAAF SQIRHEVPML SLDNVFDEES FLAFNKRVQD RLKNNEKVTW CCELKLDGLA
VSILYENGVL VSAATRGDGT TGEDITSNVR TIRAIPLKLH GENIPARLEV RGEVFLPQAG
FEKINEDARR TGGKVFANPR NAAAGSLRQL DPRITAKRPL TFFCYGVGVL EGGELPDTHL
GRLLQFKKWG LPVSDRVTLC ESAEEVLAFY HKVEEDRPTL GFDIDGVVIK VNSLAQQEQL
GFVARAPRWA VAFKFPAQEQ MTFVRDVEFQ VGRTGAITPV ARLEPVHVAG VLVSNATLHN
ADEIERLGLR IGDKVVIRRA GDVIPQVVNV VLSERPEDTR EVVFPTHCPV CGSDVERVEG
EAVARCTGGL ICGAQRKESL KHFVSRRAMD VDGMGDKIID QLVEKEYVHT PADLFKLTAG
KLTGLERMGP KSAQNVVNAL EKAKETTFAR FLYALGIREV GEATAAGLAA YFGTLEALEA
ASIEELQKVP DVGIVVASHV HNFFAEESNR NVISELLAEG VHWPAPIVIN AEEIDSPFAG
KTVVLTGSLS QMSRDDAKAR LVELGAKVAG SVSKKTDLVI AGEAAGSKLA KAQELGIEVI
DEAEMLRLLG S