Gene Dtox_3034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3034 
Symbol 
ID8430024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp3229053 
End bp3230174 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content45% 
IMG OID645035286 
Productprotein of unknown function DUF34 
Protein accessionYP_003192409 
Protein GI258516187 
COG category[S] Function unknown 
COG ID[COG3323] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR00486] dinuclear metal center protein, YbgI/SA1388 family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.849304 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGTAG CTAACAAAGA AATAGTTAAG CTGGTTGAAA ATTTGGCTCC TTTGAGGTTG 
GCCGAGGAAT GGGACAATTC GGGCTGGCAG CTGGGCGACC CGGGTGCCCC GACTGCTAAA
GTGATGCTGA CCCTGGATAT AACACCTCCT GTTGTGGAGG AAGCTGCCGC TGCCGGTGCC
GGATTGATTA TCAGTCACCA CCCAATGTTT TTAAAAGGGC TGAAAAACCT CTGCCTGGAC
CGGCCTGAGG GAAAACTAAT TGAGGCTTTA ATAAAAAAAG ATATAGCTGT ATATTCTGCT
CATACCAATC TGGACAGCGC GGCAGGCGGA GTGAACAGTG TTCTGGCTGA ACAGTTGGGC
TTAATTGAAG TCGATAATAT GCTTCCCGGT AAAGCGGAGC AGCTTTACAA ATTAGTAGTT
TTTGTACCGC TTGAGCAGGT TGAGCAAGTC AGAACCGCCA TTACAGAGGC GGGTGCAGGT
TGGATTGGCA ATTACCGGGA TTGTGCTTTT CAGTCAACAG GTATTGGTAC CTTTCGACCG
CTGGAAGGCA GCAAGCCTTT TATAGGACAA ACGGGCCTGC TGGAAAAAGT TGAAGAATTC
CGTTTGGAAA CTATCGTGCC GGAAAAAGAT AAAAAAAGCG TTATAGCAGC TATGCTTAAC
GCTCATCCTT ACGAGGAAGT TGCTTACGAT TTATATCCTC TGGCCAATAA TACAGCCGGA
CATGGTTTAG GTAGAATAGG TTGTTTGCCG CAGGAAGTAT CTCTAGGTGA TTTTGCTAAA
CTGGTAAAAA TGACCTTGCA GGTTGATGCT GTTCGTCTGG GGGGAAATGA ACACGGAAAA
CCTGTACGTA AAGTTGCTGT TTGTGGAGGG GCCGGGGCAT CTTTATGGAA GCAGGCTTTG
AGTAAGGGTG CTGATGTTTA TGTTACCGGA GATATTAAGT ATCATGAGGC TTTGGATATG
TCAACGGCGG GCCTAAGTTT CATAGATGCG GGCCATTTCC CCACTGAGAG AATTATTCTG
CCTGTTTTAT ATAAATATCT GATCAAAGTA TGCTCTAAGC ATAATTTTGC TGTGGATATA
TTGCTTTCTC AAAAGCAAAA TGATGTTTTT GTGTATGTTT AA
 
Protein sequence
MAVANKEIVK LVENLAPLRL AEEWDNSGWQ LGDPGAPTAK VMLTLDITPP VVEEAAAAGA 
GLIISHHPMF LKGLKNLCLD RPEGKLIEAL IKKDIAVYSA HTNLDSAAGG VNSVLAEQLG
LIEVDNMLPG KAEQLYKLVV FVPLEQVEQV RTAITEAGAG WIGNYRDCAF QSTGIGTFRP
LEGSKPFIGQ TGLLEKVEEF RLETIVPEKD KKSVIAAMLN AHPYEEVAYD LYPLANNTAG
HGLGRIGCLP QEVSLGDFAK LVKMTLQVDA VRLGGNEHGK PVRKVAVCGG AGASLWKQAL
SKGADVYVTG DIKYHEALDM STAGLSFIDA GHFPTERIIL PVLYKYLIKV CSKHNFAVDI
LLSQKQNDVF VYV