Gene EcDH1_4073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_4073 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4409495 
End bp4410868 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content55% 
IMG OID 
Producthistidine kinase 
Protein accessionACX41673 
Protein GI260451251 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAGGCA GCTTAACCGC GCGCATCTTC GCCATCTTCT GGCTGACGCT GGCGCTGGTG 
TTGATGTTGG TTTTGATGTT ACCCAAGCTC GATTCACGCC AGATGACCGA GCTTCTGGAT
AGCGAACAGC GTCAGGGGCT GATGATTGAG CAGCATGTCG AAGCGGAGCT GGCGAACGAT
CCGCCCAACG ATTTAATGTG GTGGCGGCGT CTGTTCCGGG CGATTGATAA GTGGGCACCG
CCAGGACAGC GTTTGTTATT GGTGACCACC GAAGGCCGCG TGATCGGCGC TGAACGCAGC
GAAATGCAGA TCATTCGTAA CTTTATTGGT CAGGCCGATA ACGCCGATCA TCCGCAGAAG
AAAAAGTATG GCCGCGTGGA ACTGGTCGGT CCGTTCTCCG TGCGTGATGG CGAAGATAAT
TACCAACTTT ATCTGATTCG TCCGGCCAGC AGTTCTCAAT CCGATTTCAT TAACTTACTG
TTTGACCGCC CGCTATTACT GCTGATTGTC ACCATGTTGG TCAGTACGCC GCTGCTGTTG
TGGTTGGCCT GGAGTCTGGC AAAACCGGCG CGTAAGCTGA AAAACGCTGC CGATGAAGTT
GCCCAGGGAA ACTTACGCCA GCACCCGGAA CTGGAAGCGG GGCCACAGGA ATTCCTTGCC
GCAGGTGCCA GTTTTAACCA GATGGTCACC GCGCTGGAGC GCATGATGAC CTCTCAGCAG
CGTCTGCTTT CTGATATCTC TCACGAGCTG CGCACCCCGC TGACGCGTCT GCAACTGGGT
ACGGCGTTAC TGCGCCGTCG TAGCGGTGAA AGCAAGGAAC TGGAGCGTAT TGAAACCGAA
GCGCAACGTC TGGACAGCAT GATCAACGAT CTGTTGGTGA TGTCACGTAA TCAGCAAAAA
AACGCGCTGG TTAGCGAAAC CATCAAAGCC AACCAGTTGT GGAGTGAAGT GCTGGATAAC
GCGGCGTTCG AAGCCGAGCA AATGGGCAAG TCGTTGACAG TTAACTTCCC GCCTGGGCCG
TGGCCGCTGT ACGGCAATCC GAACGCCCTG GAAAGTGCGC TGGAAAACAT TGTTCGTAAT
GCTCTGCGTT ATTCCCATAC GAAGATTGAA GTGGGCTTTG CGGTAGATAA AGACGGTATC
ACCATTACGG TGGACGACGA TGGTCCTGGC GTTAGCCCGG AAGATCGCGA ACAGATTTTC
CGTCCGTTCT ATCGTACCGA TGAAGCACGC GATCGTGAAT CTGGCGGTAC AGGTTTGGGG
CTGGCGATTG TTGAAACCGC CATTCAGCAG CATCGTGGCT GGGTGAAGGC AGAAGACAGC
CCGCTGGGCG GTTTACGGCT GGTGATTTGG TTGCCGCTGT ATAAGCGGAG TTAA
 
Protein sequence
MIGSLTARIF AIFWLTLALV LMLVLMLPKL DSRQMTELLD SEQRQGLMIE QHVEAELAND 
PPNDLMWWRR LFRAIDKWAP PGQRLLLVTT EGRVIGAERS EMQIIRNFIG QADNADHPQK
KKYGRVELVG PFSVRDGEDN YQLYLIRPAS SSQSDFINLL FDRPLLLLIV TMLVSTPLLL
WLAWSLAKPA RKLKNAADEV AQGNLRQHPE LEAGPQEFLA AGASFNQMVT ALERMMTSQQ
RLLSDISHEL RTPLTRLQLG TALLRRRSGE SKELERIETE AQRLDSMIND LLVMSRNQQK
NALVSETIKA NQLWSEVLDN AAFEAEQMGK SLTVNFPPGP WPLYGNPNAL ESALENIVRN
ALRYSHTKIE VGFAVDKDGI TITVDDDGPG VSPEDREQIF RPFYRTDEAR DRESGGTGLG
LAIVETAIQQ HRGWVKAEDS PLGGLRLVIW LPLYKRS