Gene EcDH1_3941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3941 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4248741 
End bp4250156 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content54% 
IMG OID 
Productreplicative DNA helicase 
Protein accessionACX41541 
Protein GI260451119 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGGAA ATAAACCCTT CAACAAACAG CAGGCTGAAC CCCGCGAACG CGATCCACAA 
GTTGCCGGGC TGAAAGTGCC TCCGCACTCG ATCGAAGCGG AGCAGTCGGT GTTGGGCGGT
TTAATGCTAG ATAACGAACG CTGGGATGAT GTAGCCGAGC GTGTGGTAGC AGACGATTTT
TACACCCGCC CACACCGTCA TATCTTTACT GAAATGGCGC GTTTGCAGGA AAGCGGTAGC
CCTATCGATC TGATTACTCT TGCGGAATCG CTGGAACGCC AGGGGCAACT CGATAGCGTC
GGTGGTTTTG CTTATCTGGC AGAGCTGTCA AAAAATACGC CAAGTGCGGC TAACATCAGT
GCCTATGCGG ACATCGTGCG TGAACGTGCC GTTGTCCGTG AGATGATCTC GGTTGCGAAT
GAGATTGCCG AAGCTGGTTT TGATCCGCAG GGGCGTACCA GCGAAGATCT GCTGGATCTG
GCTGAATCCC GCGTCTTTAA AATTGCCGAA AGTCGTGCGA ACAAAGACGA AGGGCCGAAG
AACATCGCCG ATGTGCTCGA CGCAACCGTG GCGCGTATTG AGCAGTTGTT TCAGCAGCCA
CACGATGGCG TTACCGGGGT AAACACCGGT TATGACGATC TCAACAAAAA AACCGCTGGC
TTGCAGCCGT CGGATTTGAT CATCGTCGCC GCGCGTCCGT CGATGGGTAA AACAACATTT
GCGATGAACC TCGTCGAAAA CGCGGCGATG TTGCAGGATA AACCGGTACT TATCTTCTCG
CTGGAGATGC CATCAGAACA GATCATGATG CGTTCTCTGG CGTCGCTGTC GCGCGTTGAC
CAGACTAAAA TCCGTACCGG GCAGCTCGAT GACGAAGACT GGGCGCGCAT TTCCGGCACC
ATGGGTATTT TGCTCGAAAA ACGCAATATC TATATCGATG ACTCCTCCGG CCTGACGCCA
ACGGAAGTGC GTTCCCGCGC ACGCCGTATT GCCCGTGAAC ACGGCGGCAT CGGGCTTATC
ATGATCGACT ACCTGCAACT GATGCGCGTA CCGGCGCTTT CCGATAACCG TACGCTGGAA
ATTGCAGAAA TCTCTCGCTC GCTGAAAGCA CTGGCGAAAG AACTGAACGT GCCGGTGGTG
GCGCTGTCCC AGTTGAACCG TTCTCTGGAA CAACGTGCCG ACAAACGCCC GGTCAACTCC
GACCTGCGTG AATCTGGCTC TATCGAGCAG GATGCGGACT TGATCATGTT TATCTATCGT
GATGAGGTGT ATCACGAAAA CAGTGATTTA AAAGGCATCG CGGAAATTAT TATCGGTAAA
CAACGTAACG GCCCAATCGG GACGGTACGC CTGATCTTTA ACGGTCAATG GTCGCGCTTC
GACAACTATG CGGGGCCGCA GTACGACGAC GAATAA
 
Protein sequence
MAGNKPFNKQ QAEPRERDPQ VAGLKVPPHS IEAEQSVLGG LMLDNERWDD VAERVVADDF 
YTRPHRHIFT EMARLQESGS PIDLITLAES LERQGQLDSV GGFAYLAELS KNTPSAANIS
AYADIVRERA VVREMISVAN EIAEAGFDPQ GRTSEDLLDL AESRVFKIAE SRANKDEGPK
NIADVLDATV ARIEQLFQQP HDGVTGVNTG YDDLNKKTAG LQPSDLIIVA ARPSMGKTTF
AMNLVENAAM LQDKPVLIFS LEMPSEQIMM RSLASLSRVD QTKIRTGQLD DEDWARISGT
MGILLEKRNI YIDDSSGLTP TEVRSRARRI AREHGGIGLI MIDYLQLMRV PALSDNRTLE
IAEISRSLKA LAKELNVPVV ALSQLNRSLE QRADKRPVNS DLRESGSIEQ DADLIMFIYR
DEVYHENSDL KGIAEIIIGK QRNGPIGTVR LIFNGQWSRF DNYAGPQYDD E