Gene Dole_1124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1124 
Symbol 
ID5693958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp1333981 
End bp1335144 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content59% 
IMG OID641263718 
Productmetal-dependent phosphohydrolase 
Protein accessionYP_001529008 
Protein GI158521138 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000827426 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCCCCTA CATGGAATAC CATTAAGTCC GTACTTGAGC GGCGTGAACA GCAGACCCTT 
TCCTCCCGGG CCGAGTCCAG CGCGGCCGGC ATTCGGCGAA GAAGTGAAGA CCAACTGGAC
CGGGACTACC GGCTGGCCTT TGCCGTGGAC GTGGACCGCA TTCTTCACTC CCTGGCCTAT
ACCCGGTACA TCGACAAGAC CCAGGTCTTT TACCTGATTG AAAACGATCA CATCACCCAC
CGGGTGCTGC ATGTGCAGCT GGTGTCCAAG GTGGCCCGCA CCATCGGCCG GGCCCTGGGC
TTAAACGAAG ACCTGATCGA GGCCATTGCC CTGGGCCACG ATATCGGACA CGCCCCTTTC
GGCCATGAAG GCGAGACCTA CCTGTCGAAG CTGTGCGAAC AGGCGGGCAT CGGCCCGTTT
CTGCACAACG TGCAGAGCGT CCATTTTCTT GAATCGGTGG AGCGAAAGGG CAGGGGGTGC
AACCTCTGCC TTCAGACACT GGACGGCATT CTGTGCCATG ACGGGGAGAT CCACACCACC
AGCCTGAAGG CCGACCGGAA GAAAAATTTT CAGACTTTTG AAGAGGAGAT CGCGGCCAAG
CGGCGTGATC CCTCCCTGCA ACTGACCCCC ATGACCATGG AGGGGTGCGT GGTGCGGTTT
GCCGACACCA TCAGCTACAT CGGCCGGGAC ATCGAAGACG CCATTCGGCT GGGCCTGGTC
CGGCGGGAGG ATCTCCCGGC GCAAAGCACG ACCGTGCTGG GCGACACCAA CGGCAAGATC
GTCTACAGCC TGGTGACCGA CGTCGTGACC CAGAGCATGG ACAAGGACCA TGTGGCCTTC
AGCGAGGCGG TGTCCGCGGC TCTGCGGGCC CTGAAGCGGT TCAACTATGA GCATATCTAC
ATGAACCAGA GGATCAAGTC CGCTTCCAAC CGTATTGAGT CCCTGTTTGC CCTGCTTTTT
GAACGATACC ATGGCGACCT GGAGGCGGAT AACCGGTCAT CGGTGATTTT CACCCATTTT
TTAAAAGACA TGTCCCCGGA CTACCTGGAG CGGCACACGC CGCCGGAGGT TGTTCGGGAT
TTCATATCCG GCATGACGGA CAACTATTTT CTGCGCCAGT GTCCGCCGGA CATGCAGCCG
GTGCTTGATG TGGCCGGAAC GTGA
 
Protein sequence
MSPTWNTIKS VLERREQQTL SSRAESSAAG IRRRSEDQLD RDYRLAFAVD VDRILHSLAY 
TRYIDKTQVF YLIENDHITH RVLHVQLVSK VARTIGRALG LNEDLIEAIA LGHDIGHAPF
GHEGETYLSK LCEQAGIGPF LHNVQSVHFL ESVERKGRGC NLCLQTLDGI LCHDGEIHTT
SLKADRKKNF QTFEEEIAAK RRDPSLQLTP MTMEGCVVRF ADTISYIGRD IEDAIRLGLV
RREDLPAQST TVLGDTNGKI VYSLVTDVVT QSMDKDHVAF SEAVSAALRA LKRFNYEHIY
MNQRIKSASN RIESLFALLF ERYHGDLEAD NRSSVIFTHF LKDMSPDYLE RHTPPEVVRD
FISGMTDNYF LRQCPPDMQP VLDVAGT