Gene Dtox_1965 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_1965 
Symbol 
ID8428947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp2097401 
End bp2098771 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content51% 
IMG OID645034293 
Producthypothetical protein 
Protein accessionYP_003191424 
Protein GI258515202 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000122492 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000114295 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACGAT CTCCCAGGAG GTTAATCGTT GTATGTATGT TGTTTATCTT GTTAGTATTA 
ATGTCCTCAC TGGCGGCTGC AGTTGAAGCT TTTGTACAGC TTCCTTCTCA AGAGTTGATG
AAGGTGCGTG TCACTGGTAC CAAAAACTTC GGTAATGAAG TTGTGTTCGA CAAAGAGGTG
GAGACAAGAG TCAATTCAAC CGCGGGGGAT GCTCTGGGGC AGGCGGCAGA AATCGAAATG
TCCGGTGACT ATGTGGAGAC CGTCGCCGGG ATCAAAGGAA ACCAGCAGGT GTACTGGTTT
TATTATATTA ACGGTCTGAT GTCAAAGGCT TTTGCCTATG GCTATAAACT GCGTCCGGGG
GATGTGGAGA ATTGGGATTT TCACGATTGG ACTTTTTATA TGATGGGACC GTCAGCGATG
CTGGGAGCCT TCCCGGAGCC CTGCCTGCAT GGTTACGGGG GCAAGGTGGC GCCTACTATG
GTGGTTTTTG CCCCGGGGTT TGAGGAAGAG GCCGCCGGGC TCAGGGACCG GTTGATCGCT
CTTGGTGTGT CAGGTGTGAA AATGAAGAAT CAGAACGATT TGACTATTGA TGAGAAGAAA
AACGATAATC TGTTTATTAT CGCTACGGCG GACCAGCCGC TTATAGCAAG CATGAATGAG
CAGTTCAAGA TTCATGAGCC GGTTTATTTT TCCGACGGTA AAATCAAGAC GCGTGATTTC
AGCGGAAATG ACAGTCAAAC ATTTGGGGCG GGGTATGGAG TGCTGAATGT GATCCAAAAC
CCATGGAATC CTAAGGGCAG CTGGGCCTGT CAAGGTGCGG TTTGGGCGAT AACCGGTCTT
GACGAAACGG GTGTGCGTCG CGCCGCCAAG GTACTGACCG GTTTTCCGAA AGAATTGAGT
CATTCTTTTG CTCTTGTTAT CGGCAACGGG GAGGTAATTA AGACCCCGGT GGGGCCCGGT
GGGGCCAAAA CAGTGGCTGT CAACACCGAG TCCGGTCTCT CTCCAGTGTC CGGTCCCTCT
CCGGTATCCG GGGATTCCCA GACACCGGCT GTAAGCGCGG GGACTGTCGC CCCGGCCCAG
GAGCCTGCTA AGCAGGATAA TCAAGAAGAG AATAAAGCTG CGCAAAAAAC CGACGCAGCA
GATAAGACAG AGACTGCTGA TAGTTCGGAT ATCTCAAAAT CAAGTGATGA AAATCAATCC
TCAGAACTTC CTACTGCTTC AGTTCTTCCC ACACTGAAGG AAAATGTCGC GCGCCATTGG
TGGGTTCTGT TTCCGACGGT AGGAGTGGCT GCTGTTCCTG CCTGCTACTA TATAAAGAGG
CACCGCAAAC TAAAAGAGAC TGACAATGCC GAGGAGCAGG AGTTGATATG A
 
Protein sequence
MKRSPRRLIV VCMLFILLVL MSSLAAAVEA FVQLPSQELM KVRVTGTKNF GNEVVFDKEV 
ETRVNSTAGD ALGQAAEIEM SGDYVETVAG IKGNQQVYWF YYINGLMSKA FAYGYKLRPG
DVENWDFHDW TFYMMGPSAM LGAFPEPCLH GYGGKVAPTM VVFAPGFEEE AAGLRDRLIA
LGVSGVKMKN QNDLTIDEKK NDNLFIIATA DQPLIASMNE QFKIHEPVYF SDGKIKTRDF
SGNDSQTFGA GYGVLNVIQN PWNPKGSWAC QGAVWAITGL DETGVRRAAK VLTGFPKELS
HSFALVIGNG EVIKTPVGPG GAKTVAVNTE SGLSPVSGPS PVSGDSQTPA VSAGTVAPAQ
EPAKQDNQEE NKAAQKTDAA DKTETADSSD ISKSSDENQS SELPTASVLP TLKENVARHW
WVLFPTVGVA AVPACYYIKR HRKLKETDNA EEQELI