Gene EcDH1_0035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_0035 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp33626 
End bp35128 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content57% 
IMG OID 
Producthistidine kinase 
Protein accessionACX37733 
Protein GI260447311 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones82 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGACGT TGTTCTCCCG CTTAATTACC GTTATTGCCT GCTTTTTTAT CTTCTCTGCC 
GCATGGTTTT GCCTGTGGAG TATCAGCCTG CATCTGGTTG AGCGCCCTGA TATGGCGGTG
CTGTTATTTC CGTTTGGTCT GCGTCTGGGG CTAATGCTGC AATGCCCGCG CGGATACTGG
CCCGTATTGC TGGGCGCGGA GTGGCTGCTG ATTTACTGGC TAACGCAGGC GGTCGGTTTA
ACCCATTTTC CGTTATTGAT GATCGGTAGT TTACTGACGT TACTGCCCGT AGCGCTGATC
TCGCGCTATC GCCATCAGCG TGACTGGCGC ACCTTGCTGT TACAGGGGGC GGCGTTAACG
GCGGCGGCGT TGTTGCAGTC GCTGCCCTGG CTTTGGCACG GCAAAGAGTC GTGGAATGCG
CTGTTGCTGA CTTTAACTGG CGGCCTGACG CTGGCCCCGA TATGTCTGGT GTTCTGGCAC
TATCTCGCCA ATAACACCTG GCTGCCGCTC GGTCCGTCAC TGGTTTCTCA GCCAATCAAC
TGGCGCGGGC GACATCTGGT CTGGTACTTG CTGCTGTTTG TTATCAGTCT CTGGCTCCAG
TTGGGATTGC CGGACGAACT GTCGCGCTTT ACGCCATTCT GTCTGGCGCT GCCGATTATC
GCGCTGGCCT GGCACTATGG TTGGCAAGGG GCGCTGATTG CGACGTTGAT GAACGCCATC
GCGCTGATCG CCAGTCAAAC CTGGCGCGAT CATCCGGTGG ATTTATTGCT CTCGCTGCTG
GTGCAAAGTC TGACAGGGTT GTTGCTTGGC GCTGGCATCC AGCGGTTGCG TGAACTTAAC
CAGTCGCTGC AAAAGGAACT GGCGCGCAAT CAGCATCTGG CTGAACGGTT GCTGGAAACC
GAAGAGAGCG TGCGCCGTGA TGTGGCGCGT GAGCTGCATG ATGATATCGG TCAGACCATC
ACTGCTATTC GTACTCAGGC GGGCATTGTT CAGCGGCTGG CGGCAGATAA CGCCAGCGTG
AAGCAGAGCG GGCAGCTCAT CGAACAACTA TCGCTGGGCG TTTACGACGC GGTGCGCCGT
TTGTTGGGTC GGTTACGTCC GCGCCAGTTG GATGATCTCA CCCTGGAGCA GGCCATCCGC
TCACTGATGC GGGAAATGGA GCTGGAAGGG CGCGGTATTG TCAGCCATCT CGAATGGCGA
ATCGATGAAT CAGCGTTAAG CGAAAACCAG CGCGTGACGC TGTTTCGTGT CTGCCAGGAA
GGGCTGAACA ACATTGTGAA ACATGCTGAT GCCAGCGCGG TCACCCTGCA AGGCTGGCAG
CAGGATGAAC GGTTGATGCT GGTTATTGAA GACGATGGCA GCGGTTTGCC GCCGGGTTCC
GGGCAACAAG GTTTTGGCCT CACCGGAATG CGCGAGCGCG TAACGGCGCT GGGTGGCACA
TTACACATTT CCTGTCTGCA CGGCACGCGT GTCAGCGTTT CTCTACCTCA ACGCTATGTC
TAA
 
Protein sequence
MKTLFSRLIT VIACFFIFSA AWFCLWSISL HLVERPDMAV LLFPFGLRLG LMLQCPRGYW 
PVLLGAEWLL IYWLTQAVGL THFPLLMIGS LLTLLPVALI SRYRHQRDWR TLLLQGAALT
AAALLQSLPW LWHGKESWNA LLLTLTGGLT LAPICLVFWH YLANNTWLPL GPSLVSQPIN
WRGRHLVWYL LLFVISLWLQ LGLPDELSRF TPFCLALPII ALAWHYGWQG ALIATLMNAI
ALIASQTWRD HPVDLLLSLL VQSLTGLLLG AGIQRLRELN QSLQKELARN QHLAERLLET
EESVRRDVAR ELHDDIGQTI TAIRTQAGIV QRLAADNASV KQSGQLIEQL SLGVYDAVRR
LLGRLRPRQL DDLTLEQAIR SLMREMELEG RGIVSHLEWR IDESALSENQ RVTLFRVCQE
GLNNIVKHAD ASAVTLQGWQ QDERLMLVIE DDGSGLPPGS GQQGFGLTGM RERVTALGGT
LHISCLHGTR VSVSLPQRYV