Gene Dole_2004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2004 
Symbol 
ID5694844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2426708 
End bp2428612 
Gene Length1905 bp 
Protein Length634 aa 
Translation table11 
GC content57% 
IMG OID641264602 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_001529885 
Protein GI158522015 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000423635 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAACGACG ATTTCAACCA TGACCATCAG GATCTGCTTT CCCAGATCGT TTCGGGCAGT 
CCGATTCCCA CTTTTGTGAT CGACCGGAAT CACCGGGTGA CCCACTTCAA CAAGGCCTGC
GAAGTGCTTA CCGGGAGAAA GGCGGAAGAG ATCATTGGAA CCGACGGCCA GTGGCGGGCC
TTTTATGCGG AAAAGCGCCC GGTGATGGCG GATATTGTTC TCGATGCCGC CATCGAAGAT
ATCGCGGGGA TGCTTGAAAC CCATTACCAG GGGAAGTTCC GGGCCTCGTC GGTCAGGTCC
GGGGCCTTTG AGGCCAGGGA TTTTTTCCCG GCCCTGGGCG AGGACGGGAA ATGGTTGTTT
TTTACGGCAG CCCCCATCCG AAACGCGGAC GGAGATGTTA TCGGCGCCAT AGAGACCCTT
CAGGACATCA CCCAGGAGAA GCGGGTGTCC CACCTGAACC GGTCCATGCT CCGGGTCAGC
AAGGCCCTGC ACCGGTATTC GTATCTGGAT GACCTGTTGT CCTTTATCAG CCAGGAAATC
AAAAAACTGC TGGGGGCGGA AGGGGCACTG GTGCTTTTAC TGGACGCCGA AACCAACGAG
TTGTACACCT CGGGACTGGC CTATGATGAC CCGGACCGTG AAAAACGGAT GAAAGAAGTC
CGCTTTTCCC TGGATGAGGT GTTGGCCGGG CAGGTGATTC GAACCGGCGA GCCGGTGGTG
ATGCATGATG CCGAGGCTCT GCCCCAATAC GCGGAGCGGG ACAGAAAAAT CGGATACACC
ACCCGGAGTC TCCTGGAAGT GCCCCTCGTG GTCGAGGACC GCACCATCGG TGTGCTTGCC
GGTATCAACA AAAAAGAGGG GCGCTTCTCC GGGCAGGATA TGGATGCCCT TGCCGCGCTG
GCCGGTACCG TGGCCCTGGC CATTGAGAAC ACCCGGCATC AAGAGGGGCT CCGGGCCTTT
TATCGTGAGG TGCAATCCCT GAACACGGCC AAGGGCAAGA CGATCAACCA CTTGTCCCAT
GAATTGAAAA CCCCGGTGGC CATTCTGTCC CAGGCCCTGC CCCTGCTCGA AGACGAGCTT
GCGGTTGTGC CCGAGGATAA CTGGAAACCC TATGTGGAAA TTATCGAGCG CCAGCTTAAC
CGCATCGTCG CAATCGAAAG CGAGGTGTCG GACATCATTA CGGACAAAGA ATACAAAGTT
GCGGGGCTTT ACAACGGCAT GGTCCTTCAA TGTGCCGACC TGCTTTCCGT TCTGGCCTTG
AAACATCAGG GCAAAAAAGA CATGGTGGAC CAGATTACCA CCCATATTCG GGAACTGTTT
TCACCGAAAA CCATGGTGGC CGCCACCATT GATCCCGGAA CGTTTATCCA GGGCCGTCTC
CGGGCGCTTG AACCCGGCTT TGCCCACCGG CAGGTGGATG TGCGTGTTCT TTTAGAGCCC
ACCCGGCCCA TTCAGATTCC AGAGGTGATT TTTGAAAAAG TCATGGACGG CCTGATCCGA
AACGCAATCG AAAACACACC TGATGAAGGC CGTGTGGATG TGGCCGTCTT TGAAAAAGGA
GGGTTGGTTC AGGTCTCGGT GAAGGATTAT GGGGTGGGCA TTCGAAAAGA CCATCAGGCC
AGAATTTTCG AAGGCTTTTT CCCTACCCAG GACATTCTGG CTTACAGCAC CCGGAAGCCT
TTTGATTTCA ACGCCGGCGG CAAAGGCGCC GATCTGCTCC GTATGAAGAT TTTTTCCGAA
ACCTGCGGTT TCACCCTGGA CATGGCGTCT ACCCGGTGTG CTGTGCTGGA TGCGCCCCGC
CAGGAATGCC CGGGCCGGAT ATCTGCGTGC CCTTCCGCCA CCGTAACCCG CCCCTGCCAT
GAATCAGGCG GAACCGAATT CATCCTCTTT TTCCACGGAG CCTGA
 
Protein sequence
MNDDFNHDHQ DLLSQIVSGS PIPTFVIDRN HRVTHFNKAC EVLTGRKAEE IIGTDGQWRA 
FYAEKRPVMA DIVLDAAIED IAGMLETHYQ GKFRASSVRS GAFEARDFFP ALGEDGKWLF
FTAAPIRNAD GDVIGAIETL QDITQEKRVS HLNRSMLRVS KALHRYSYLD DLLSFISQEI
KKLLGAEGAL VLLLDAETNE LYTSGLAYDD PDREKRMKEV RFSLDEVLAG QVIRTGEPVV
MHDAEALPQY AERDRKIGYT TRSLLEVPLV VEDRTIGVLA GINKKEGRFS GQDMDALAAL
AGTVALAIEN TRHQEGLRAF YREVQSLNTA KGKTINHLSH ELKTPVAILS QALPLLEDEL
AVVPEDNWKP YVEIIERQLN RIVAIESEVS DIITDKEYKV AGLYNGMVLQ CADLLSVLAL
KHQGKKDMVD QITTHIRELF SPKTMVAATI DPGTFIQGRL RALEPGFAHR QVDVRVLLEP
TRPIQIPEVI FEKVMDGLIR NAIENTPDEG RVDVAVFEKG GLVQVSVKDY GVGIRKDHQA
RIFEGFFPTQ DILAYSTRKP FDFNAGGKGA DLLRMKIFSE TCGFTLDMAS TRCAVLDAPR
QECPGRISAC PSATVTRPCH ESGGTEFILF FHGA