Gene Dole_2026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2026 
Symbol 
ID5694866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2455026 
End bp2456732 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content56% 
IMG OID641264624 
Productmetal dependent phosphohydrolase 
Protein accessionYP_001529907 
Protein GI158522037 
COG category[T] Signal transduction mechanisms 
COG ID[COG2206] HD-GYP domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00491297 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCAAA AAAATTCAAC GGAAGACCTT GACGTCAGTC GACTGTCCGC AGACAAAAAG 
CTCGGCCACC TGCTGGAGAG TGTCGTCCGT GAAGTGAAAC TTTATGCCGA AGGGCAGATT
GAACACATTC AGAAACTGGC CCAGATCGGC CTGGCCCTGT CCGGCCAGAA AAATCTCAAC
ACCCTGCTCG AGATGATCGT GGACGAGGCC CGGAAACTTT CCAGCGCCGA TGCCGGCACC
CTGTACATCG TGGAGCAGAA AAGCCGGTCC CTCCGGTTTG CCATTCTTCA AAACGACTCC
ATGAACATTC GAAAGGGCGG CGCAGGCGGC GACCTTTCCG ATGAAATGCC CAACGTTCCC
CTGGCCGACG AACAGGGCAA CCCCAACCAT GCCAATGTTT CCTCCTATGT GGCCCTGACC
GGGGAAAGCG TCAACATCGA AGATGTGTAT GAAGCCGGGG CGTTCGATTT TTCCGGCACC
AAGCGGTATG ACGCCGCCAC CGGCTACCGC TGCAAATCCA TGCTGGTCAT GCCGTTAAAA
AACCACGAGG ACAAGATTAT CGGCGTGTTG CAGCTGTTAA ACGCCAAGGA CCCCCAGACC
GGGGAAATCA TGAAGTTCCA TGCGGACATC GTGGGGCTGG TCGCTTCCCT GGCCTCCCAG
GCGGCCATCG CGCTGACCAA CACCCAGTTG ATCGAAGATC TCAAAGCTCT TTTCTACGCA
TTTATCAAAA GCATTGCCAC GGCCATTGAT GCCAAATCCC CTTTTACCGG GGGGCACATC
AACCGGGTGG TAAGCCTGAC CATGGATGTT GCCGAAGCGA TTCACGGCAC CAACACCGGT
CCTTTTGGAG AGATGCGCTT CACCGATGAC GAAATGGAAG AACTGCGCAT TGCCGCGTGG
ATGCATGACG TGGGCAAAAT CACCACGCCG GAGCATATTG TCAGCAAGAC CAACAAACTC
GAAGGCGTCT TTGATCGGAT TCACCTGATC GAAACACGGT TTCTGCTGAT CCTTCAGCTG
ATGGAAAACC GCCACCTGCG TGTCAAGATC GACCTCCTCA AAACCGACAA CAGTCCGGCG
GCCCTTAAAA AAATGGAGGC CATGGACCGC GAACTCCAGG CCCGGAAAGC GGAGATACTG
GAAAGCCTGG AACTTTTAAA GGCCGTAAAC ACGAACAAAG GCATGGTGGA TGAACGTGCG
GTAAAGCAGG TCCGAGAGAT TGCGGCCCGC ACCTACCATA TCGGCGGCAA CGCCTACCCC
TGGTTGTCTG AGAACGAGGC TGCCTGCCTG AGCATTCTCA AGGGCAACCT GCTGGACGAG
GAACGGCGGC TGGTGGAGCA GCATGCGGAG ATGACCATCA ACATCACCAG GGAACTCCCC
TTTCCGGACC GTTTTTCCCA CGTTCCCGAA TATGCCGGGG CCCATCACGA AAAGCTGGAC
GGTTCCGGAT ATCCTCTGGG ACTTACCGGT GACCAGATTC CCCTGCAGGC CAGGATCATC
GCCATTGCGG ATGTCTTCGA GGCCCTGACC GCGCCGGACC GGCCCTACAA ACGGCCCATG
CACATTTCAC AGGCATTGAA AATTCTTCAG GAGATGGCGG CGGCCGGCCA CATCGACGGG
GATATCGTCA GGATGTTTAT CAAACAAAAA GTTTACCAGG CATACGCGGA CAAAGAACTT
ACACCGGAGC AGCTCTCCAC TGCATAA
 
Protein sequence
MTQKNSTEDL DVSRLSADKK LGHLLESVVR EVKLYAEGQI EHIQKLAQIG LALSGQKNLN 
TLLEMIVDEA RKLSSADAGT LYIVEQKSRS LRFAILQNDS MNIRKGGAGG DLSDEMPNVP
LADEQGNPNH ANVSSYVALT GESVNIEDVY EAGAFDFSGT KRYDAATGYR CKSMLVMPLK
NHEDKIIGVL QLLNAKDPQT GEIMKFHADI VGLVASLASQ AAIALTNTQL IEDLKALFYA
FIKSIATAID AKSPFTGGHI NRVVSLTMDV AEAIHGTNTG PFGEMRFTDD EMEELRIAAW
MHDVGKITTP EHIVSKTNKL EGVFDRIHLI ETRFLLILQL MENRHLRVKI DLLKTDNSPA
ALKKMEAMDR ELQARKAEIL ESLELLKAVN TNKGMVDERA VKQVREIAAR TYHIGGNAYP
WLSENEAACL SILKGNLLDE ERRLVEQHAE MTINITRELP FPDRFSHVPE YAGAHHEKLD
GSGYPLGLTG DQIPLQARII AIADVFEALT APDRPYKRPM HISQALKILQ EMAAAGHIDG
DIVRMFIKQK VYQAYADKEL TPEQLSTA