Gene Dtox_4028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_4028 
Symbol 
ID8431042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp4205187 
End bp4206209 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content41% 
IMG OID645036237 
Productputative transposase 
Protein accessionYP_003193335 
Protein GI258517113 
COG category[S] Function unknown 
COG ID[COG5464] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01784] conserved hypothetical protein (putative transposase or invertase) 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGAAA GTGGCCAACC TAACCTAAAA CATCCACACC ATCCCCATGA CAAGGGTTAT 
AAGCAGCTTC TCAGCAATAA AAAGATATTT CTGGAATTAA TCAAAACCTT TGTGCAAGAG
GATTGGGTAA ACGAAATAGA AGAAGATGGG TTGCTCCTGG TTGATAAGTC ATTCGTGCTG
GAAGATTTCA GTGAAAAAGA GGCAGATGTG GTGTATCGGC TCCGGACGAA GGAAAAGGAT
GTCATATTCT ACGTACTGCT GGAGTTACAA TCTACTGTAG ATTTTCTGAT GCCCTTTCGT
TTACTGCAGT ACATGGTGCA GATATGGAGA GAAACGTACA ATAACACACA GAAAGAAGAA
AGGGATCGTA AAGACTTTAG GCTGCCGGCT ATAGTACCGG CGGTATTGTA CAATGGCAAG
AACAACTGGA TGGCAAAGAT GAATTTCAGG GAAATGCTGT CTGACTATCA AATATTCGGT
GGGCGGGTAC TGGACTTTAA TTACATACTG TTTGATGTCA ACAGATACAA AGATGAGCAA
CTGTACGAAA TGGCAAACCT GATAGCCAGC GTATTTGTTC TGGACCAGAC AATGAACCAT
AAGGAGTTAA TTAGACGCCT GAGGAAATTA ATCATGGTAT TGAGAAAACT AACTCAGGAT
GAATTCAGGC AAATTATGAT TTGGCTCAAA AATGTTATCA AGCCGAGAAT GCCGGTCCAT
TTGCATAAGG AAGTTGACCG CATTATGGAG GAAGCCAACC AGTTGGAGGT GGAATTTATG
ATTATGAACC TCGAGGTAAC ACTAGATGAG ATGCAGCAGC AGGCGAAAAA AGAGGGTATA
AAAGAGGGTA TTAAGGAAGG CATAAAAGAG GGTATTAAGG AAGGCATAAA AGAAGGCGAG
TTGAAAAAGG CTTTAGAAAC AGCAAGGGCG GCCTTGAAAG AAGGTATATC GGTTGATGTT
ATATGTAAAA TAACTGCACT GGATAAAGAA ATTGTACAGA AAATTAAAAA CGAGTTGCAT
TAA
 
Protein sequence
MTESGQPNLK HPHHPHDKGY KQLLSNKKIF LELIKTFVQE DWVNEIEEDG LLLVDKSFVL 
EDFSEKEADV VYRLRTKEKD VIFYVLLELQ STVDFLMPFR LLQYMVQIWR ETYNNTQKEE
RDRKDFRLPA IVPAVLYNGK NNWMAKMNFR EMLSDYQIFG GRVLDFNYIL FDVNRYKDEQ
LYEMANLIAS VFVLDQTMNH KELIRRLRKL IMVLRKLTQD EFRQIMIWLK NVIKPRMPVH
LHKEVDRIME EANQLEVEFM IMNLEVTLDE MQQQAKKEGI KEGIKEGIKE GIKEGIKEGE
LKKALETARA ALKEGISVDV ICKITALDKE IVQKIKNELH