Gene Dtox_4072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_4072 
Symbol 
ID8431086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp4240187 
End bp4241209 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content40% 
IMG OID645036273 
Productputative transposase 
Protein accessionYP_003193371 
Protein GI258517149 
COG category[S] Function unknown 
COG ID[COG5464] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01784] conserved hypothetical protein (putative transposase or invertase) 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGAAA GTGGCCAACC TAACCTAAAA CATCCACACC ATCCCCATGA CAAGGGTTAT 
AAGCAGCTTC TCAGCAATAA AAAGATATTT CTGGAATTAA TCAAAACCTT TGTGCAAGAG
GATTGGGTAA ACGAAATAGA AGAAGATGGG TTGCTCCTGG TTGATAAGTC ATTCGTGCTG
GAAGATTTCA GTGAAAAAGA GGCAGATGTG GTGTATCGGC TCCGGACGAA GGAAAAGGAT
GTCATATTCT ACGTACTGCT GGAGTTACAA TCTACTGTAG ATTTTCTGAT GCCCTTTCGT
TTACTGCAGT ACATGGTGCA GATATGGAGA GAAACGTACA ATAACACACA GAAAGAAGAA
AGGGATCGTA AAGACTTTAG GCTGCCGGCT ATAGTACCGG CGGTATTGTA CAATGGCAAG
AACAACTGGA CGGCAAAGAT GAATTTCAGG GAAATGCTGT CTGACTATCA AATATTCGGT
GGGCGGGTAC TGGACTTTAA TTACATACTG TTTGATGTCA ACAGATACAA AGATGAGCAA
CTGTACGAAA TGGCAAACCT GATAGCCAGC GTATTTGTTC TGGATCAGAC AATGAACCAT
AAGGAGTTAA TTAGACGCCT GAGGAAATTA ATCATGGTAT TGAGAAAACT AACTCAGGAT
GAATTTAGGC AAATTATGAT TTGGCTCAAA AATGTCATCA AGCCGAGAAT GCCGGTCCAT
TTGCATAAGG AAGTTGACCG CATTATGGAG GAAGCCAACC AGTTGGAGGT GGAATTTATG
ATTATGAACC TCGAGGTAAC ACTAGATGAG ATGCAGCAGC AGGCAAAAAA AGAGGGTATA
AAAGAGGGTA TTAAGGAAGG CATAAAAGAG GGTATTAAGG AAGGAATAAA AGAAGGCGAG
TTGAAAAAGG CTTTAGAAAC AGCAAGGGCG GCCTTGAAAG AAGGTATATC GGTTGATGTT
ATATGTAAAA TAACTGCACT GGATAAAGAA ATTGTACAGA AAATTAAAAA CGAGTTGCAT
TAA
 
Protein sequence
MTESGQPNLK HPHHPHDKGY KQLLSNKKIF LELIKTFVQE DWVNEIEEDG LLLVDKSFVL 
EDFSEKEADV VYRLRTKEKD VIFYVLLELQ STVDFLMPFR LLQYMVQIWR ETYNNTQKEE
RDRKDFRLPA IVPAVLYNGK NNWTAKMNFR EMLSDYQIFG GRVLDFNYIL FDVNRYKDEQ
LYEMANLIAS VFVLDQTMNH KELIRRLRKL IMVLRKLTQD EFRQIMIWLK NVIKPRMPVH
LHKEVDRIME EANQLEVEFM IMNLEVTLDE MQQQAKKEGI KEGIKEGIKE GIKEGIKEGE
LKKALETARA ALKEGISVDV ICKITALDKE IVQKIKNELH