Gene EcDH1_2232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2232 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2391515 
End bp2395417 
Gene Length3903 bp 
Protein Length1300 aa 
Translation table11 
GC content53% 
IMG OID 
ProductATP-dependent helicase HrpA 
Protein accessionACX39880 
Protein GI260449458 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.95685 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGAAC AACAAAAATT GACCTTTACG GCCTTGCAGC AGCGGCTGGA TTCGCTGATG 
CTGCGTGACA GACTGCGTTT TTCTCGCCGT CTGCACGGCG TGAAGAAGGT TAAAAATCCT
GATGCACAAC AGGCCATTTT CCAGGAGATG GCGAAAGAGA TTGACCAGGC GGCAGGGAAA
GTCCTGCTGC GTGAAGCGGC ACGACCGGAA ATTACTTATC CTGACAATTT ACCGGTTAGT
CAGAAAAAAC AGGACATTCT CGAAGCGATT CGTGATCACC AGGTGGTGAT CGTCGCCGGG
GAAACGGGTT CTGGTAAAAC GACTCAGTTA CCGAAAATCT GTATGGAGCT GGGGCGCGGG
ATTAAAGGAC TGATCGGCCA TACCCAGCCG CGTCGTCTGG CGGCAAGAAC AGTGGCGAAC
CGTATTGCGG AAGAGCTGAA AACGGAGCCG GGCGGTTGCA TCGGTTACAA AGTGCGTTTC
AGCGATCACG TAAGTGATAA CACGATGGTC AAGCTGATGA CCGACGGTAT CCTGCTGGCG
GAGATCCAGC AAGACCGCCT GCTGATGCAG TACGACACTA TCATTATTGA CGAAGCGCAC
GAACGCAGCC TGAATATCGA TTTTTTGCTC GGCTATTTGA AAGAGTTGCT GCCGCGGCGT
CCTGACCTAA AAATCATTAT CACTTCCGCG ACTATCGACC CGGAACGCTT TTCGCGCCAC
TTTAATAATG CGCCGATTAT TGAAGTCTCC GGTCGGACCT ATCCGGTGGA AGTGCGCTAT
CGCCCGATTG TTGAAGAAGC CGATGACACC GAGCGCGATC AGTTGCAGGC GATTTTTGAC
GCCGTAGACG AACTGAGTCA GGAAAGCCAT GGCGACATTC TGATCTTTAT GAGCGGCGAG
CGGGAAATCC GCGATACCGC CGATGCGCTG AACAAGCTGA ACTTACGCCA TACCGAAATC
TTGCCGCTTT ATGCGCGGCT TTCGAACAGC GAACAAAATA GGGTATTCCA GTCGCACAGC
GGACGGCGCA TTGTGCTGGC GACCAACGTC GCGGAAACGT CGCTGACCGT ACCGGGGATT
AAATACGTTA TCGACCCCGG TACAGCGCGT ATCAGCCGCT ACAGCTATCG CACCAAAGTG
CAGCGTTTGC CGATTGAGCC GATTTCCCAG GCGTCTGCCA ATCAGCGTAA AGGCCGCTGT
GGTCGTGTGT CCGAAGGGAT CTGTATTCGT CTCTATTCCG AAGACGATTT CCTCTCGCGC
CCGGAGTTTA CCGATCCGGA GATTCTGCGT ACCAACCTGG CCTCGGTTAT TTTGCAGATG
ACCGCGCTGG GGCTGGGCGA TATCGCTGCG TTCCCGTTTG TCGAAGCACC GGATAAACGC
AATATCCAGG ATGGCGTGCG TCTGCTCGAA GAGCTGGGCG CGATCACCAC TGATGAACAG
GCCAGCGCCT ATAAACTGAC GCCGCTCGGT CGCCAGCTCT CGCAGTTGCC TGTCGACCCA
CGTCTGGCGC GTATGGTGCT GGAAGCGCAA AAACATGGCT GCGTGCGTGA GGCGATGATT
ATCACGTCCG CGCTCTCCAT TCAGGATCCG CGCGAACGTC CGATGGACAA ACAGCAGGCA
TCGGACGAAA AACATCGTCG CTTCCACGAC AAAGAGTCTG ACTTTCTCGC GTTTGTGAAT
CTGTGGAATT ATCTTGGCGA GCAGCAAAAG GCGCTTTCTT CCAACGCCTT CCGTCGCCTG
TGTCGTACCG ATTATCTCAA CTATCTGCGC GTGCGCGAAT GGCAGGATAT CTACACCCAG
TTGCGTCAGG TGGTGAAAGA ACTTGGCATT CCGGTTAACA GCGAACCGGC GGAGTATCGC
GAAATTCACA TTGCGTTGCT GACCGGTTTA CTTTCCCATA TCGGCATGAA AGATGCCGAT
AAACAAGAAT ATACCGGCGC ACGTAACGCG CGTTTCTCCA TCTTCCCCGG TTCTGGTTTA
TTCAAAAAAC CGCCTAAATG GGTAATGGTG GCGGAACTGG TAGAAACCAG CCGCCTGTGG
GGGCGCATTG CTGCGCGTAT CGACCCGGAA TGGGTGGAGC CAGTTGCTCA GCATTTGATT
AAACGCACCT ACAGCGAACC GCACTGGGAA CGGGCGCAGG GCGCGGTGAT GGCAACGGAA
AAAGTCACTG TTTATGGTTT GCCGATTGTT GCCGCGCGCA AGGTCAACTA CAGCCAGATC
GATCCGGCGT TATGTCGTGA ACTCTTTATT CGCCACGCGC TGGTGGAAGG TGACTGGCAG
ACGCGTCACG CATTCTTCCG TGAAAACCTG AAACTACGGG CGGAAGTAGA AGAGCTGGAA
CACAAATCAC GTCGCCGCGA TATTCTGGTT GATGACGAAA CGTTGTTTGA GTTCTACGAC
CAGCGCATCA GCCACGATGT AATCTCCGCT CGCCACTTCG ACAGCTGGTG GAAAAAAGTC
AGCCGCGAAA CGCCTGATTT GCTCAACTTT GAAAAAAGCA TGTTGATCAA AGAGGGCGCA
GAAAAAATCA GCAAGCTGGA TTACCCGAAC TTCTGGCATC AGGGCAATCT CAAGCTGCGT
TTGAGCTATC AGTTTGAGCC CGGCGCGGAT GCTGACGGTG TGACCGTACA TATTCCGCTG
CCGTTACTTA ACCAGGTTGA GGAAAGCGGG TTTGAATGGC AGATCCCCGG TCTGCGCCGC
GAACTGGTGA TTGCTCTGAT TAAATCGTTG CCGAAACCGG TACGCCGTAA TTTTGTACCC
GCGCCAAACT ATGCCGAAGC GTTTTTAGGC CGCGTCAAAC CGCTGGAGTT ACCGTTGCTC
GACAGCCTTG AGCGCGAGTT ACGGCGGATG ACCGGCGTTA CCGTTGACCG CGAAGACTGG
CACTGGGATC AGGTGCCCGA TCACCTGAAA ATTACCTTCC GCGTGGTGGA TGACAAAAAC
AAGAAGCTAA AAGAAGGGCG CTCGCTACAA GATCTGAAAG ATGCGCTGAA AGGCAAAGTG
CAGGAAACGC TATCTGCGGT GGCGGATGAC GGTATCGAGC AGAGCGGCTT ACATATCTGG
AGTTTTGGTC AGCTGCCGGA AAGCTACGAA CAGAAGCGTG GCAACTACAA AGTGAAGGCG
TGGCCGGCGC TGGTGGATGA GCGCGACAGT GTGGCGATCA AACTGTTTGA TAACCCGCTG
GAGCAAAAGC AGGCAATGTG GAACGGTCTT CGCCGTCTAC TGCTGCTGAA TATTCCATCG
CCAATCAAAT ATTTACATGA AAAGTTACCG AACAAAGCCA AGCTGGGACT GTACTTTAAC
CCGTATGGCA AAGTGCTGGA GCTGATCGAC GACTGTATCT CCTGCGGTGT GGATAAATTG
ATCGACGCCA ATGGTGGCCC GGTCTGGACG GAAGAAGGCT TTGCTGCGCT GCATGAAAAA
GTGCGTGCCG AACTGAACGA CACGGTGGTG GATATTGCGA AGCAGGTCGA GCAAATCCTT
ACGGCAGTGT TCAATATCAA CAAACGTCTG AAAGGGCGGG TGGATATGAC CATGGCGCTG
GGGCTTTCTG ACATTAAAGC GCAGATGGGC GGGTTGGTAT ATCGCGGTTT TGTCACTGGT
AACGGCTTCA AACGGCTGGG CGACACGCTG CGATATTTGC AGGCGATTGA AAAACGGCTG
GAAAAACTGG CGGTTGATCC ACATCGCGAC CGTGCGCAGA TGCTGAAAGT CGAAAACGTC
CAGCAGGCGT GGCAGCAATG GATCAACAAA CTGCCGCCCG CACGTCGTGA GGATGAAGAC
GTGAAAGAGA TCCGTTGGAT GATAGAAGAG TTGCGCGTTA GTTACTTCGC TCAACAACTT
GGTACGCCTT ATCCGATTTC AGATAAGCGT ATTTTGCAGG CGATGGAGCA GATTAGCGGT
TAA
 
Protein sequence
MTEQQKLTFT ALQQRLDSLM LRDRLRFSRR LHGVKKVKNP DAQQAIFQEM AKEIDQAAGK 
VLLREAARPE ITYPDNLPVS QKKQDILEAI RDHQVVIVAG ETGSGKTTQL PKICMELGRG
IKGLIGHTQP RRLAARTVAN RIAEELKTEP GGCIGYKVRF SDHVSDNTMV KLMTDGILLA
EIQQDRLLMQ YDTIIIDEAH ERSLNIDFLL GYLKELLPRR PDLKIIITSA TIDPERFSRH
FNNAPIIEVS GRTYPVEVRY RPIVEEADDT ERDQLQAIFD AVDELSQESH GDILIFMSGE
REIRDTADAL NKLNLRHTEI LPLYARLSNS EQNRVFQSHS GRRIVLATNV AETSLTVPGI
KYVIDPGTAR ISRYSYRTKV QRLPIEPISQ ASANQRKGRC GRVSEGICIR LYSEDDFLSR
PEFTDPEILR TNLASVILQM TALGLGDIAA FPFVEAPDKR NIQDGVRLLE ELGAITTDEQ
ASAYKLTPLG RQLSQLPVDP RLARMVLEAQ KHGCVREAMI ITSALSIQDP RERPMDKQQA
SDEKHRRFHD KESDFLAFVN LWNYLGEQQK ALSSNAFRRL CRTDYLNYLR VREWQDIYTQ
LRQVVKELGI PVNSEPAEYR EIHIALLTGL LSHIGMKDAD KQEYTGARNA RFSIFPGSGL
FKKPPKWVMV AELVETSRLW GRIAARIDPE WVEPVAQHLI KRTYSEPHWE RAQGAVMATE
KVTVYGLPIV AARKVNYSQI DPALCRELFI RHALVEGDWQ TRHAFFRENL KLRAEVEELE
HKSRRRDILV DDETLFEFYD QRISHDVISA RHFDSWWKKV SRETPDLLNF EKSMLIKEGA
EKISKLDYPN FWHQGNLKLR LSYQFEPGAD ADGVTVHIPL PLLNQVEESG FEWQIPGLRR
ELVIALIKSL PKPVRRNFVP APNYAEAFLG RVKPLELPLL DSLERELRRM TGVTVDREDW
HWDQVPDHLK ITFRVVDDKN KKLKEGRSLQ DLKDALKGKV QETLSAVADD GIEQSGLHIW
SFGQLPESYE QKRGNYKVKA WPALVDERDS VAIKLFDNPL EQKQAMWNGL RRLLLLNIPS
PIKYLHEKLP NKAKLGLYFN PYGKVLELID DCISCGVDKL IDANGGPVWT EEGFAALHEK
VRAELNDTVV DIAKQVEQIL TAVFNINKRL KGRVDMTMAL GLSDIKAQMG GLVYRGFVTG
NGFKRLGDTL RYLQAIEKRL EKLAVDPHRD RAQMLKVENV QQAWQQWINK LPPARREDED
VKEIRWMIEE LRVSYFAQQL GTPYPISDKR ILQAMEQISG