Gene Dtox_3038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3038 
Symbol 
ID8430032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp3234812 
End bp3235819 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content45% 
IMG OID645035294 
Productdeoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_003192413 
Protein GI258516191 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.622298 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAATTC GCCGGCAATC CGAACAATTA GAACGATTAT ACCTGTCTCC ATACGCTTGT 
TTTAGCGACA GCAGCCGGGG CAGGCTTTTC CCTGACAAAG AATGTGCGGT GCGAACCGTT
TTTCAGCGTG ACAGGGACAG AATTATTCAT TCCAAGAGTT TTAGGAGGCT GAAATATAAA
ACACAGGTAT TCATTATACC GGAAGGCGAT CATTTTCGTA CCAGATTGAC TCATACTTTG
GAGGTTGCTC AGATTGCCCG GACAATAGCC CGGGCTCTAC GACTAAATGA AGATCTTACT
GAGGCTATAG CGCTCGGACA TGATCTTGGA CATACTCCTT TTGGCCATGC CGGTGAAGCC
GCTTTAAACT CTATTATAGG GGAGACAGGT TTTAAGCATA ATTTACAAAG CCTGCGTGTT
GTTGATATTT TAGAGGGCGG ACGGGGCCTG AATCTAACGC ACGAAGTGAG AGACGGTATT
TTAAACCATA CCGGGCAGGT TCAGCCTCTG ACTCTGGAGG GACGGATAGT TAAGATTGCC
GATCGTATTG CTTATATAAA TCATGACATT GATGACGCAA TAAGAGGCGG TGTTTTAACA
ATGGAACAAT TGCCGGCTGC TTGCCTGGAT GTTCTGGGTT GGCAGCACAG GGAGCGGATT
AATACAATGG TCACAGATCT GATTAAAACA GGACTTTCGT GTCCCGGCAA AATCAGCATG
AGCGAGCCGA TTCAGCAGGC TACGGATGAA TTAAGGAGCT TTATGTTTCG TCATGTCTAT
ATTGGTTCGG AAGCCAAGCT GGAAGAAAAC AAAGCAGTTA ATCTTATCAG GGCGCTTTAT
AATTATTTCC TTAACAATCA GGCAGATTTG CCGGTTGAAC ATAGATTGAG GGTTGGAGAA
ACAGGCGTAA AACTGGTTAT AGCCGATTAT ATTGCAGGTA TGACCGATCG ATATGCTATA
GCCATGTTTA AAAAGTTATT TGTCCCGCAG GGCTTTCCGG TAGTCTAA
 
Protein sequence
MEIRRQSEQL ERLYLSPYAC FSDSSRGRLF PDKECAVRTV FQRDRDRIIH SKSFRRLKYK 
TQVFIIPEGD HFRTRLTHTL EVAQIARTIA RALRLNEDLT EAIALGHDLG HTPFGHAGEA
ALNSIIGETG FKHNLQSLRV VDILEGGRGL NLTHEVRDGI LNHTGQVQPL TLEGRIVKIA
DRIAYINHDI DDAIRGGVLT MEQLPAACLD VLGWQHRERI NTMVTDLIKT GLSCPGKISM
SEPIQQATDE LRSFMFRHVY IGSEAKLEEN KAVNLIRALY NYFLNNQADL PVEHRLRVGE
TGVKLVIADY IAGMTDRYAI AMFKKLFVPQ GFPVV