Gene EcDH1_3006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3006 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3227226 
End bp3228884 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content50% 
IMG OID 
Productsignal transduction histidine kinase regulating citrate/malate metabolism 
Protein accessionACX40634 
Protein GI260450212 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGCAGC TTAACGAGAA TAAACAGTTT GCATTTTTCC AAAGACTGGC ATTTCCGCTG 
CGTATCTTTT TGCTGATTCT GGTGTTCTCA ATATTTGTCA TTGCAGCCCT GGCGCAATAT
TTTACGGCCA GTTTTGAGGA CTATTTAACG CTTCATGTAC GCGACATGGC AATGAATCAG
GCGAAAATTA TTGCCTCCAA TGACAGTGTC ATCTCTGCGG TGAAAACGCG TGACTACAAA
CGGCTGGCGA CCATCGCTAA CAAATTACAA AGAGATACCG ATTTTGATTA TGTGGTGATT
GGGGACCGGC ACTCGATCCG CCTTTACCAT CCTAATCCGG AGAAAATTGG TTATCCTATG
CAGTTCACCA AACAGGGCGC GCTGGAGAAA GGGGAGAGCT ACTTCATTAC CGGGAAAGGG
TCAATGGGGA TGGCGATGCG CGCCAAAACG CCAATCTTTG ATGACGATGG AAAAGTCATC
GGCGTGGTGT CGATTGGCTA CCTGGTGAGT AAAATCGATA GCTGGCGGGC TGAGTTTTTA
TTACCGATGG CAGGTGTGTT TGTCGTGCTG TTAGGGATTC TGATGTTGCT GTCGTGGTTC
CTGGCCGCGC ATATCCGTCG GCAGATGATG GGCATGGAGC CAAAGCAAAT CGCACGCGTG
GTCCGTCAGC AAGAGGCGCT GTTTAGTTCG GTTTATGAAG GGCTGATTGC GGTGGATCCG
CATGGTTACA TTACCGCCAT CAATCGTAAC GCAAGAAAGA TGCTGGGGCT GAGTTCCCCC
GGACGGCAAT GGTTGGGTAA ACCCATTGTT GAAGTGGTCA GGCCCGCCGA TTTCTTTACC
GAACAGATTG ATGAAAAACG TCAGGATGTG GTGGCGAACT TTAACGGTCT GAGCGTTATT
GCCAACCGGG AAGCTATTCG TTCAGGTGAT GATTTGCTGG GGGCCATTAT CAGCTTTCGT
AGTAAAGACG AAATTTCCAC CCTCAATGCG CAACTGACGC AAATAAAACA ATACGTTGAG
AGCCTTCGTA CATTGCGACA CGAGCATCTC AATTGGATGT CGACGCTCAA TGGTCTGTTG
CAGATGAAAG AGTATGATCG CGTGCTGGCG ATGGTGCAGG GGGAGTCTCA GGCCCAGCAA
CAGCTTATTG ACAGCCTGCG CGAGGCGTTT GCCGATCGCC AGGTGGCGGG GCTGCTTTTT
GGTAAAGTGC AGCGCGCCCG GGAACTGGGG CTAAAAATGA TCATTGTCCC CGGTAGCCAG
CTTTCGCAAC TGCCGCCAGG ACTGGATAGC ACCGAGTTTG CAGCCATTGT GGGCAATTTA
CTTGATAACG CCTTCGAAGC CAGCCTGCGT AGCGATGAAG GAAACAAGAT CGTTGAATTA
TTCCTCAGCG ATGAAGGCGA TGATGTGGTG ATTGAAGTCG CCGATCAGGG CTGCGGCGTT
CCAGAGTCTC TACGAGACAA AATATTTGAG CAGGGGGTCA GTACGCGTGC TGACGAGCCC
GGTGAACATG GCATTGGGTT GTACTTGATT GCCAGCTACG TAACGCGCTG CGGTGGTGTT
ATCACTCTCG AAGATAATGA TCCCTGCGGT ACCTTATTTT CAATCTATAT TCCGAAAGTG
AAACCTAATG ACAGCTCCAT TAACCCTATT GATCGTTGA
 
Protein sequence
MLQLNENKQF AFFQRLAFPL RIFLLILVFS IFVIAALAQY FTASFEDYLT LHVRDMAMNQ 
AKIIASNDSV ISAVKTRDYK RLATIANKLQ RDTDFDYVVI GDRHSIRLYH PNPEKIGYPM
QFTKQGALEK GESYFITGKG SMGMAMRAKT PIFDDDGKVI GVVSIGYLVS KIDSWRAEFL
LPMAGVFVVL LGILMLLSWF LAAHIRRQMM GMEPKQIARV VRQQEALFSS VYEGLIAVDP
HGYITAINRN ARKMLGLSSP GRQWLGKPIV EVVRPADFFT EQIDEKRQDV VANFNGLSVI
ANREAIRSGD DLLGAIISFR SKDEISTLNA QLTQIKQYVE SLRTLRHEHL NWMSTLNGLL
QMKEYDRVLA MVQGESQAQQ QLIDSLREAF ADRQVAGLLF GKVQRARELG LKMIIVPGSQ
LSQLPPGLDS TEFAAIVGNL LDNAFEASLR SDEGNKIVEL FLSDEGDDVV IEVADQGCGV
PESLRDKIFE QGVSTRADEP GEHGIGLYLI ASYVTRCGGV ITLEDNDPCG TLFSIYIPKV
KPNDSSINPI DR