Gene Dhaf_4780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDhaf_4780 
Symbol 
ID7261809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfitobacterium hafniense DCB-2 
KingdomBacteria 
Replicon accessionNC_011830 
Strand
Start bp5106103 
End bp5108331 
Gene Length2229 bp 
Protein Length742 aa 
Translation table11 
GC content52% 
IMG OID643564691 
Producthistidine kinase 
Protein accessionYP_002461211 
Protein GI219670776 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGAGA AAAGATTTCC CTGGTGGATG AGGATTAAAG CCCGGGTTTT GGTTTTTGGA 
GTACTGATGT CGGCGGTGCC TTTGCTCATT TTGGGTTTGG CAAGTTTTAC GGCAGCCCAG
GCTTATCTGG AAGAAAGTAT TCAGAAACAA AATTCCGAGC GGGCTGCCCT TTTAGCCGGA
CAGATTCAGG ATTTTATCCA GAATAATGCC GATAGCTTAA TCCAGGTGAC TTCCACCAAC
GCTTTGGAAT TGGTGGGGGC GGACCCACTG GCCAGAGAAA CCGTACTGGG AACGATGCTG
CGCGGGATAC CCTATCTGGA GAGTTTGCAG GTGGCTGACC CTCAGGTTCA GGTTCTCGGC
AAGGTTTCCC GACGGGAAGT GGACTATCCG GTGCAAGCGG GGGAAACTCT TCCCTGCCTG
GATTTCTCTG CTTCGGAAAG CTATTCCCTC AGTGAAGTTT TTTTCTCTGT GGATGGCCGG
CCCCAGGTGT ATTTGACGGT GAATATCATC GATCCCCAGA CCCGCAGGAA TCTGGGCTAC
CTCCAAGCGA AGACGGACCT TAAGGCTCTT TTCAATAAAT TTACCAGTAT CCAAATTGGG
CAGGAGGGCA TCATCTATCT TACGGATGGG AAAGGAAAGC TCATTGGTCA TTCCGATTTC
AGCCGGGTGC TGAGTCAAGA GGATATGACC CGAAACCCCA GCGTCCGCAA TTTCTTGGCT
GGGAAGCCCC CCAGCCTTGC CGGGAATGAG TATCCCAATA CCGATGATAC TCCGGTCCTT
GGACTGTATG CGCCTGTAGG CAGTCCCCCT TGGGGAGTAT TTATTGAGCA GCCGGTGCAG
GAAGCTTATG AGCCCATTGC CCGCTTTGCC TTGCGGGTGA TGGGAATGAT GCTGGTTATT
ATTCTGGGGG TGACCTTGAT TAGCATCTAT TTTGGTCTGA AACTGACTCA ACCCATAGAG
AATTTGGAAG CGGGAGTTCG GAGGATTATT GCTACGGAGG ATCTACAGGC CGAAGTGACT
CAGGAAAGTG ATGATGAAAT TGGCCGGCTG GTGCAAGCCT TTAATAATCT TCTGCGCCGG
CTTGCCGATA AGACGGCTAA TCTGCAGGCT GAGCAGGAGC TGTTGGAAAC GGTAGTCCAT
GGGATTGGGG CAGGGATGGC TCTTTTGGAT CAGGAAAAAC GGCTTATCTG GTGGAATTCC
CTATTTGCCC GGTGGTTTGG CTCAGAGAAT AAAAACTTTA AGAAGCTGGC CTGTGAAGAA
CTGCTGCGCG GGGAAGGGCC GGAATCTTCC TTTGAAGAAA ACGGCAGGGT ACTGGCTTTG
GAAGTACAGG GAGACAAACG TTATCTGCGC CATTCTTATT ACCGGCTCAA CTCCGGAAAC
CCGGAGAATG CCGCTTATTT GCTGCTTCTG GAAGATGTGA CCCAGCAGGT AGAGATGGAG
GCCCGGGTCA TTCAGACAGA GAAAATGGCG ACCGTTGGGC TGTTAGCCTC CGGGGTGGCT
CATGAGATCA ACAATCCCCT GGCCATTCTT TCCGCCCACA ATGAGGATCT GCTGGATCGG
CTCCAGGAGG AAGGGGAACT GCCCGGCAAG GCTGAGATTG AAGGTATCCT GAGTATCATC
GCCAAACAGA TTGAGCGCTG CAAACAGGTG ACCGGCAGGC TTCTGGGCTA TGCCCGGCCG
GGCAGGCATG GTCCGGACAG GATGGATGCC AATAATGCCA TTGAGCAGAC GGCAGCTCTC
TTGGCCTATC GTCTTAAACA AAAGAAAATG GTCCTCATCA AAGAAAGTGA ACCCGGTCTC
TGGGTGGAGG GTGATGAAAA CGAATGGCAG CAGGTGGTGC TCAATATCCT AACCAATGCT
ATCGATGCCT CTGCAGAAGG CAGTCAGGTC ATCGTGCGGG CTCAACGGGT TAAGAGACCT
TTTGCCTCGG CGGAGGCCTT GCCTATAGAT AAAGGGGATG AAATACAAAT TGAGGTGGAA
GACCAAGGGC AGGGGATTTC AGCCCAGTAC CTTAAGAAGG TGTTTGATCC CTTCTTTACC
ACCAAACCTC CGGGTCAGGG CACGGGTCTG GGGCTTTTTG TCAGCTATGG TATCGTGCAA
AAAATGCAGG GTAAGCTGTT TATTGAGAGT ACCGAGGGGA AAGGGACAAC CGTTCGCATT
AATCTTCCCT TTCAGGGAAG GGGGGTAGGC CATGAATGTT CATCAACGTG TCTTGATCTT
AGACGATGA
 
Protein sequence
MSEKRFPWWM RIKARVLVFG VLMSAVPLLI LGLASFTAAQ AYLEESIQKQ NSERAALLAG 
QIQDFIQNNA DSLIQVTSTN ALELVGADPL ARETVLGTML RGIPYLESLQ VADPQVQVLG
KVSRREVDYP VQAGETLPCL DFSASESYSL SEVFFSVDGR PQVYLTVNII DPQTRRNLGY
LQAKTDLKAL FNKFTSIQIG QEGIIYLTDG KGKLIGHSDF SRVLSQEDMT RNPSVRNFLA
GKPPSLAGNE YPNTDDTPVL GLYAPVGSPP WGVFIEQPVQ EAYEPIARFA LRVMGMMLVI
ILGVTLISIY FGLKLTQPIE NLEAGVRRII ATEDLQAEVT QESDDEIGRL VQAFNNLLRR
LADKTANLQA EQELLETVVH GIGAGMALLD QEKRLIWWNS LFARWFGSEN KNFKKLACEE
LLRGEGPESS FEENGRVLAL EVQGDKRYLR HSYYRLNSGN PENAAYLLLL EDVTQQVEME
ARVIQTEKMA TVGLLASGVA HEINNPLAIL SAHNEDLLDR LQEEGELPGK AEIEGILSII
AKQIERCKQV TGRLLGYARP GRHGPDRMDA NNAIEQTAAL LAYRLKQKKM VLIKESEPGL
WVEGDENEWQ QVVLNILTNA IDASAEGSQV IVRAQRVKRP FASAEALPID KGDEIQIEVE
DQGQGISAQY LKKVFDPFFT TKPPGQGTGL GLFVSYGIVQ KMQGKLFIES TEGKGTTVRI
NLPFQGRGVG HECSSTCLDL RR