Gene EcDH1_3040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3040 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3263081 
End bp3266962 
Gene Length3882 bp 
Protein Length1293 aa 
Translation table11 
GC content57% 
IMG OID 
Productamino acid adenylation domain protein 
Protein accessionACX40668 
Protein GI260450246 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAGC ATTTACCTTT GGTCGCCGCA CAGCCCGGCA TCTGGATGGC AGAAAAACTG 
TCAGAATTAC CCTCCGCCTG GAGCGTGGCG CATTACGTTG AGTTAACCGG AGAGGTTGAT
TCGCCATTAC TGGCCCGCGC GGTGGTTGCC GGACTAGCGC AAGCAGATAC GCTGCGGATG
CGTTTTACGG AAGATAACGG CGAAGTCTGG CAGTGGGTCG ATGATGCGCT GACGTTCGAA
CTGCCAGAAA TTATCGACCT ACGAACCAAC ATTGATCCGC ACGGTACTGC GCAGGCATTA
ATGCAGGCGG ATTTGCAACA AGATCTGCGC GTCGATAGCG GTAAACCACT GGTCTTTCAT
CAGCTGATAC AGGTGGCGGA TAACCGCTGG TACTGGTATC AGCGTTATCA CCATTTGCTG
GTCGATGGCT TTAGTTTCCC GGCCATTACC CGCCAGATCG CCAATATTTA CTGCACATGG
CTGCGTGGCG AACCAACGCC TGCTTCGCCA TTTACGCCTT TCGCTGATGT AGTGGAAGAG
TACCAGCAAT ACCGCGAAAG CGAAGCCTGG CAGCGTGATG CGGCATTCTG GGCAGAACAG
CGTCGTCAAC TGCCGCCGCC CGCGTCACTT TCTCCGGCAC CTTTACCGGG GCGCAGCGCC
TCGGCAGATA TTCTGCGCCT GAAACTGGAA TTTACCGACG GGGAATTCCG CCAGCTGGCT
ACGCAACTTT CAGGTGTGCA GCGTACCGAT TTAGCCCTTG CGCTGGCAGC CTTGTGGCTG
GGGCGATTGT GCAATCGTAT GGACTACGCC GCCGGATTTA TCTTTATGCG TCGACTGGGC
TCGGCGGCGC TGACGGCTAC CGGACCCGTG CTCAACGTTT TGCCGTTGGG TATTCACATT
GCGGCGCAAG AAACGCTGCC GGAACTGGCA ACCCGACTGG CAGCACAACT GAAAAAAATG
CGTCGTCATC AACGTTACGA TGCCGAACAA ATTGTCCGTG ACAGCGGGCG AGCGGCAGGT
GATGAACCGC TGTTTGGTCC GGTACTCAAT ATCAAGGTAT TTGATTACCA ACTGGATATT
CCTGATGTTC AGGCGCAAAC CCATACCCTG GCAACCGGTC CGGTTAATGA CCTTGAACTG
GCCCTGTTCC CGGATGTACA CGGTGATTTG AGTATTGAGA TCCTCGCCAA TAAACAGCGT
TACGATGAGC CAACGTTAAT CCAGCATGCT GAACGCCTGA AAATGCTGAT TGCCCAGTTC
GCCGCGGATC CGGCGCTGTT GTGCGGCGAT GTCGATATTA TGCTGCCAGG TGAGTATGCG
CAGCTGGCGC AGCTCAACGC CACTCAGGTT GAGATTCCAG AAACCACGCT TAGCGCGCTG
GTGGCAGAAC AAGCGGCAAA AACACCGGAT GCTCCGGCGC TGGCAGATGC GCGTTACCTG
TTCAGCTATC GGGAAATGCG CGAGCAGGTG GTGGCGCTGG CGAATCTGCT GCGTGAGCGC
GGCGTTAAAC CAGGGGACAG CGTGGCGGTG GCACTACCGC GCTCGGTCTT TTTGACCCTG
GCACTCCATG CGATAGTTGA AGCTGGAGCG GCCTGGCTAC CGCTGGATAC CGGCTATCCG
GACGATCGCC TGAAAATGAT GCTGGAAGAT GCGCGTCCGT CGCTGTTAAT TACCACCGAC
GATCAACTGC CGCGCTTTAG CGATGTTCCC AATTTAACAA GCCTTTGCTA TAACGCCCCG
CTTACACCGC AGGGCAGTGC GCCGCTGCAA CTTTCACAAC CGCATCACAC GGCTTATATC
ATCTTTACCT CTGGCTCCAC CGGCAGGCCG AAAGGGGTAA TGGTCGGGCA GACGGCTATC
GTCAACCGCC TGCTTTGGAT GCAAAATCAT TATCCGCTTA CAGGCGAAGA TGTCGTTGCC
CAAAAAACGC CGTGCAGTTT TGATGTCTCG GTGTGGGAGT TTTTCTGGCC GTTTATCGCA
GGGGCAAAAC TGGTGATGGC TGAACCGGAA GCGCACCGCG ACCCGCTCGC TATGCAGCAA
TTCTTTGCCG AATATGGCGT AACGACCACG CACTTTGTGC CGTCGATGCT GGCGGCATTT
GTTGCCTCGC TGACGCCGCA AACCGCTCGC CAGAGTTGCG CGACGTTGAA ACAGGTTTTC
TGTAGTGGTG AGGCCTTACC GGCTGATTTA TGCCGCGAAT GGCAACAGTT AACTGGCGCG
CCGTTGCATA ATCTATATGG CCCGACGGAA GCGGCGGTAG ATGTCAGCTG GTATCCGGCT
TTTGGCGAGG AACTGGCACA GGTGCGCGGC AGCAGTGTGC CGATTGGTTA TCCGGTATGG
AATACGGGTC TGCGTATTCT TGATGCGATG ATGCATCCGG TGCCGCCGGG TGTGGCGGGT
GATCTCTATC TCACTGGCAT TCAACTGGCG CAGGGCTATC TCGGACGCCC CGATCTGACC
GCCAGCCGCT TTATTGCCGA TCCTTTTGCC CCAGGTGAAC GGATGTACCG TACCGGAGAC
GTTGCCCGCT GGCTGGATAA CGGCGCGGTG GAGTACCTCG GGCGCAGTGA TGATCAGCTA
AAAATTCGCG GGCAGCGTAT CGAACTGGGC GAAATCGATC GCGTGATGCA GGCGCTGCCG
GATGTCGAAC AAGCCGTTAC CCACGCCTGT GTGATTAACC AGGCGGCTGC CACCGGTGGT
GATGCGCGTC AATTGGTGGG CTATCTGGTG TCGCAATCGG GCCTGCCGTT GGATACCAGC
GCATTGCAGG CGCAGCTTCG TGAAACATTG CCACCACATA TGGTACCGGT GGTTCTGCTG
CAACTTCCAC AGTTACCACT TAGCGCCAAC GGCAAGCTGG ATCGCAAAGC CTTACCGTTG
CCTGAACTGA AGGCACAAGC GCCAGGGCGT GCGCCGAAAG CGGGCAGTGA AACGATTATC
GCCGCGGCAT TCTCGTCGTT GCTGGGGTGT GACGTGCAGG ATGCCGATGC TGATTTCTTC
GCGCTTGGCG GTCATTCGCT ACTGGCAATG AAACTGGCAG CGCAGTTAAG TCGGCAGGTT
GCCCGCCAGG TGACGCCGGG GCAAGTGATG GTCGCGTCAA CTGTCGCCAA ACTGGCAACG
ATTATTGATG CTGAAGAAGA CAGCACCCGG CGTATGGGAT TCGAAACCAT TCTGCCGTTG
CGTGAAGGTA ATGGCCCGAC GCTGTTTTGT TTCCATCCTG CGTCCGGTTT TGCCTGGCAG
TTCAGCGTGC TCTCGCGTTA TCTCGATCCA CAATGGTCGA TTATCGGCAT TCAGTCACCG
CGCCCCAATG GCCCCATGCA GACGGCGGCA AACCTGGATG AAGTCTGCGA AGCGCATCTG
GCAACGTTAC TTGAACAACA ACCGCATGGC CCTTATTACC TGCTGGGGTA TTCCCTTGGC
GGTACGCTGG CGCAGGGTAT TGCGGCGCGA CTGCGTGCCC GTGGCGAACA GGTGGCATTT
CTTGGCTTGC TGGATACCTG GCCGCCAGAA ACGCAAAACT GGCAGGAAAA AGAAGCTAAT
GGTCTGGACC CGGAAGTGCT GGCGGAGATT AACCGCGAAC GCGAGGCCTT CCTGGCAGCA
CAGCAGGGAA GTACTTCAAC GGAGTTGTTT ACCACCATTG AAGGCAACTA CGCTGATGCT
GTGCGCCTGC TGACGACTGC TCATAGCGTA CCGTTTGACG GTAAAGCGAC GCTGTTTGTT
GCTGAACGCA CACTTCAGGA AGGTATGAGT CCCGAACGCG CCTGGTCGCC GTGGATAGCG
GAGCTGGATA TCTATCGTCA GGATTGTGCG CATGTGGATA TTATCTCTCC AGGGACGTTT
GAAAAAATTG GGCCGATTAT TCGCGCAACG CTAAACAGGT AA
 
Protein sequence
MSQHLPLVAA QPGIWMAEKL SELPSAWSVA HYVELTGEVD SPLLARAVVA GLAQADTLRM 
RFTEDNGEVW QWVDDALTFE LPEIIDLRTN IDPHGTAQAL MQADLQQDLR VDSGKPLVFH
QLIQVADNRW YWYQRYHHLL VDGFSFPAIT RQIANIYCTW LRGEPTPASP FTPFADVVEE
YQQYRESEAW QRDAAFWAEQ RRQLPPPASL SPAPLPGRSA SADILRLKLE FTDGEFRQLA
TQLSGVQRTD LALALAALWL GRLCNRMDYA AGFIFMRRLG SAALTATGPV LNVLPLGIHI
AAQETLPELA TRLAAQLKKM RRHQRYDAEQ IVRDSGRAAG DEPLFGPVLN IKVFDYQLDI
PDVQAQTHTL ATGPVNDLEL ALFPDVHGDL SIEILANKQR YDEPTLIQHA ERLKMLIAQF
AADPALLCGD VDIMLPGEYA QLAQLNATQV EIPETTLSAL VAEQAAKTPD APALADARYL
FSYREMREQV VALANLLRER GVKPGDSVAV ALPRSVFLTL ALHAIVEAGA AWLPLDTGYP
DDRLKMMLED ARPSLLITTD DQLPRFSDVP NLTSLCYNAP LTPQGSAPLQ LSQPHHTAYI
IFTSGSTGRP KGVMVGQTAI VNRLLWMQNH YPLTGEDVVA QKTPCSFDVS VWEFFWPFIA
GAKLVMAEPE AHRDPLAMQQ FFAEYGVTTT HFVPSMLAAF VASLTPQTAR QSCATLKQVF
CSGEALPADL CREWQQLTGA PLHNLYGPTE AAVDVSWYPA FGEELAQVRG SSVPIGYPVW
NTGLRILDAM MHPVPPGVAG DLYLTGIQLA QGYLGRPDLT ASRFIADPFA PGERMYRTGD
VARWLDNGAV EYLGRSDDQL KIRGQRIELG EIDRVMQALP DVEQAVTHAC VINQAAATGG
DARQLVGYLV SQSGLPLDTS ALQAQLRETL PPHMVPVVLL QLPQLPLSAN GKLDRKALPL
PELKAQAPGR APKAGSETII AAAFSSLLGC DVQDADADFF ALGGHSLLAM KLAAQLSRQV
ARQVTPGQVM VASTVAKLAT IIDAEEDSTR RMGFETILPL REGNGPTLFC FHPASGFAWQ
FSVLSRYLDP QWSIIGIQSP RPNGPMQTAA NLDEVCEAHL ATLLEQQPHG PYYLLGYSLG
GTLAQGIAAR LRARGEQVAF LGLLDTWPPE TQNWQEKEAN GLDPEVLAEI NREREAFLAA
QQGSTSTELF TTIEGNYADA VRLLTTAHSV PFDGKATLFV AERTLQEGMS PERAWSPWIA
ELDIYRQDCA HVDIISPGTF EKIGPIIRAT LNR