Gene Dole_0439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_0439 
Symbol 
ID5693259 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp506560 
End bp507633 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content58% 
IMG OID641263021 
Productdeoxyguanosinetriphosphate triphosphohydrolase-like protein 
Protein accessionYP_001528326 
Protein GI158520456 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0162186 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGATTC GTTCAGATTT TGAAGAGCGG GAAAAAACCT TTATATCTCC TTATGGATGC 
CTGAGTTCCA ACCTTCGGGG CCGGGACCAC GAGGAGCCGG AGGGCGCCAT CCGCACCGAG
TTCCAGCTGG ACCGGGAGCG GATTGTTTAT TCTAACGCTT TTCGGCGGTT GAAGCACAAA
ACCCAGGTGT TTCTGTCGCC CCTGGGAGAC CAGTACCGCA CCCGGCTGAC CCACACCCTG
GAGGTGGCCC AGATATCCCG CACCCTGGCC CGGGCCATGC GGCTCAACGA AGACCTGGCC
GAGGCCGTGG CCCTGGGCCA CGACCTGGGC CACACGCCCT TTGGCCACAG CGGGGAGACC
GTGCTGCGAA AGATATCTCC CGCCGGTTTC GCCCACAACG AACAGAGCCT GCGGGTGGTG
GAGAAGCTGG AAAACAACGG TAAAGGCTTG AACCTCACCT TTGAAGTGCG GGACGGTATT
CTCAAGCACT CCAAGGGATA CGGCAACATT CTGGACGACG ACCCCAGCGA GATGGCAATC
ACCGTGGAAG GCCGCATCGT GCGGGTGGCC GACATCATGG CCTACCTGAA CCACGACCTG
GACGACGCCC TGCGGTGCAA TGTGATCGAG CGATCTCACA TTCCGGAAAA ATGCGTGAAG
GTGCTGGGCA AAAACCACTC TGAACGGGCC ACCACCATGA TCCGGGACGT GGTCTACTCC
AGCAGCTCCG AAGACGGCCT GCTGCGGCTG CGCATCAGCG ACCCGGTGTT TGAGGCCATG
ACCGAACTGC GCCATTTTCT GTACGACCAT GTGTACCGAT CTCCCAAGGT GCATGCCGAG
TTTGAAAAGG CCAACCGCAT TCTCACCGAG CTGTACGAGT TTTTCTACAA GCATACCGAC
ATGCTGGAGG CTGAACTTAA AAAAATGGAG ATGGGCAACT GCATGGACAC GGACGACACC
GACCGGGTGG TGTGCGACTT TATCGCCAGC ATCACCGACG AATACGCCCT GGCCCTCTAC
TCCAAGCTTT TTTTCCCCAC ACCCATTATC TATCCGGGGC CGATGCATGT CTGA
 
Protein sequence
MSIRSDFEER EKTFISPYGC LSSNLRGRDH EEPEGAIRTE FQLDRERIVY SNAFRRLKHK 
TQVFLSPLGD QYRTRLTHTL EVAQISRTLA RAMRLNEDLA EAVALGHDLG HTPFGHSGET
VLRKISPAGF AHNEQSLRVV EKLENNGKGL NLTFEVRDGI LKHSKGYGNI LDDDPSEMAI
TVEGRIVRVA DIMAYLNHDL DDALRCNVIE RSHIPEKCVK VLGKNHSERA TTMIRDVVYS
SSSEDGLLRL RISDPVFEAM TELRHFLYDH VYRSPKVHAE FEKANRILTE LYEFFYKHTD
MLEAELKKME MGNCMDTDDT DRVVCDFIAS ITDEYALALY SKLFFPTPII YPGPMHV