Gene EcDH1_4166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_4166 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4514325 
End bp4516487 
Gene Length2163 bp 
Protein Length720 aa 
Translation table11 
GC content58% 
IMG OID 
ProductDNA helicase II 
Protein accessionACX41766 
Protein GI260451344 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGTTT CTTACCTGCT CGACAGCCTT AATGACAAAC AGCGCGAAGC GGTGGCCGCG 
CCACGCAGCA ACCTTCTGGT GCTGGCGGGC GCGGGCAGTG GTAAGACGCG CGTACTGGTG
CATCGTATCG CCTGGTTGAT GAGCGTGGAA AACTGCTCGC CATACTCGAT TATGGCGGTG
ACGTTTACCA ACAAAGCGGC GGCGGAGATG CGTCATCGTA TCGGGCAACT GATGGGCACG
AGCCAGGGCG GTATGTGGGT CGGCACCTTC CACGGGCTGG CGCACCGTTT GCTGCGTGCG
CACCATATGG ACGCCAATCT GCCGCAGGAT TTCCAGATCC TCGACAGTGA AGACCAGCTA
CGCCTGCTTA AGCGTCTGAT CAAAGCCATG AACCTCGACG AGAAGCAGTG GCCGCCGCGG
CAGGCAATGT GGTACATCAA CAGCCAGAAA GATGAAGGCC TGCGTCCGCA TCATATTCAA
AGCTACGGTA ATCCGGTGGA GCAGACCTGG CAGAAGGTGT ATCAGGCGTA TCAGGAAGCG
TGTGACCGCG CGGGCCTGGT GGACTTCGCC GAGCTGCTGC TGCGCGCTCA CGAGTTGTGG
CTTAACAAGC CGCATATCCT GCAACACTAC CGCGAACGTT TTACCAATAT CCTGGTGGAC
GAATTCCAGG ATACCAACAA CATTCAGTAC GCGTGGATCC GCCTGCTGGC GGGCGACACC
GGCAAAGTGA TGATCGTCGG TGATGACGAC CAGTCAATCT ACGGCTGGCG CGGGGCGCAG
GTGGAGAATA TTCAGCGTTT CCTTAATGAT TTCCCCGGTG CCGAAACTAT TCGTCTGGAG
CAAAACTACC GCTCTACCAG CAATATTCTG AGCGCCGCTA ACGCCCTGAT TGAAAACAAT
AACGGGCGTC TGGGTAAAAA ACTGTGGACC GATGGCGCGG ACGGTGAGCC TATTTCCCTC
TATTGCGCTT TTAACGAACT CGATGAAGCG CGTTTTGTGG TTAACCGCAT CAAAACCTGG
CAGGACAACG GCGGAGCGCT TGCCGAGTGC GCCATTCTCT ACCGCAGCAA CGCCCAGTCG
CGGGTGCTCG AAGAGGCGTT ATTGCAGGCC AGTATGCCGT ACCGTATTTA CGGCGGGATG
CGCTTCTTCG AACGCCAGGA AATCAAAGAT GCGCTCTCGT ATCTGCGCCT GATTGCCAAC
CGCAACGACG ACGCGGCCTT TGAGCGTGTG GTGAATACGC CAACGCGGGG TATTGGTGAC
CGGACGCTGG ACGTGGTACG TCAGACATCG CGCGATCGCC AGTTAACACT CTGGCAGGCA
TGTCGTGAGC TGTTGCAGGA AAAAGCCCTC GCCGGGCGAG CTGCCAGCGC CTTGCAGCGA
TTTATGGAAT TAATCGACGC CTTAGCGCAG GAAACTGCCG ATATGCCGCT GCATGTACAG
ACTGACCGGG TAATTAAAGA CTCCGGCCTG CGTACCATGT ATGAGCAGGA GAAGGGCGAA
AAAGGTCAGA CGCGTATCGA AAACTTAGAG GAACTGGTGA CGGCAACGCG CCAGTTCAGC
TACAACGAAG AAGACGAAGA TTTAATGCCG CTGCAGGCGT TCCTCTCCCA TGCGGCACTG
GAAGCAGGTG AAGGGCAGGC GGATACCTGG CAGGATGCGG TGCAGTTGAT GACGCTACAC
TCGGCGAAAG GCCTGGAGTT CCCGCAGGTG TTTATCGTTG GTATGGAAGA GGGCATGTTC
CCAAGCCAGA TGTCGCTGGA TGAAGGCGGG CGTCTGGAAG AAGAACGCCG TCTGGCCTAC
GTTGGCGTAA CCCGCGCGAT GCAGAAACTG ACGCTGACCT ACGCGGAAAC CCGCCGTCTG
TATGGTAAAG AGGTTTACCA TCGCCCGTCG CGCTTTATCG GCGAGCTGCC GGAAGAGTGT
GTGGAAGAGG TGCGCCTGCG CGCCACGGTA AGCCGCCCGG TCAGCCATCA GCGGATGGGT
ACGCCGATGG TCGAGAACGA CAGCGGCTAC AAGCTCGGCC AGCGCGTACG CCACGCTAAG
TTTGGTGAAG GCACCATTGT CAATATGGAA GGCAGCGGTG AGCATAGCCG TTTGCAGGTG
GCATTTCAGG GCCAGGGTAT TAAATGGCTG GTGGCGGCAT ACGCCCGGCT GGAGTCGGTG
TAA
 
Protein sequence
MDVSYLLDSL NDKQREAVAA PRSNLLVLAG AGSGKTRVLV HRIAWLMSVE NCSPYSIMAV 
TFTNKAAAEM RHRIGQLMGT SQGGMWVGTF HGLAHRLLRA HHMDANLPQD FQILDSEDQL
RLLKRLIKAM NLDEKQWPPR QAMWYINSQK DEGLRPHHIQ SYGNPVEQTW QKVYQAYQEA
CDRAGLVDFA ELLLRAHELW LNKPHILQHY RERFTNILVD EFQDTNNIQY AWIRLLAGDT
GKVMIVGDDD QSIYGWRGAQ VENIQRFLND FPGAETIRLE QNYRSTSNIL SAANALIENN
NGRLGKKLWT DGADGEPISL YCAFNELDEA RFVVNRIKTW QDNGGALAEC AILYRSNAQS
RVLEEALLQA SMPYRIYGGM RFFERQEIKD ALSYLRLIAN RNDDAAFERV VNTPTRGIGD
RTLDVVRQTS RDRQLTLWQA CRELLQEKAL AGRAASALQR FMELIDALAQ ETADMPLHVQ
TDRVIKDSGL RTMYEQEKGE KGQTRIENLE ELVTATRQFS YNEEDEDLMP LQAFLSHAAL
EAGEGQADTW QDAVQLMTLH SAKGLEFPQV FIVGMEEGMF PSQMSLDEGG RLEEERRLAY
VGVTRAMQKL TLTYAETRRL YGKEVYHRPS RFIGELPEEC VEEVRLRATV SRPVSHQRMG
TPMVENDSGY KLGQRVRHAK FGEGTIVNME GSGEHSRLQV AFQGQGIKWL VAAYARLESV