Gene Dtox_3800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3800 
Symbol 
ID8430810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp3976350 
End bp3977525 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content42% 
IMG OID645036026 
Productprotein of unknown function UPF0027 
Protein accessionYP_003193129 
Protein GI258516907 
COG category[S] Function unknown 
COG ID[COG1690] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAGG TGTATACCGA TGTTCTAGAA GAAGGTGCTG CTTCTCAGAT TGAAACTCTT 
TGCAATCAGG AGTTTGTAAA AGAAAGCAAA ATCCGGATTA TGCCGGATGT ACATGCCGGA
GCCGGTTGTA CTATTGGGAC TACCATGACC ATTGCGGATA AAGTGGTACC TAATTTAGTT
GGGGTGGATA TCGGTTGCGG CATGGAGACA ATTTCGATAA CTAATAAGCG ACTGGAACTG
CAAAAGTTAA ATAAGTTGAT TTATGAAAAA ATACCGTCAG GATTTGAGGT TAGGAAGACA
CCGCATCGCT ATAACGAGCA AATTGATTTA ACACAGCTCC GTTGTCACAA GGAAGTAAAA
TTGGATAGAG CAGAAAAGAG TATTGGAACA TTGGGTGGCG GCAATCATTT TATTGAAGTA
AACAAGGATG AGGAGGGCAA CCTGTATGTT GTGGTTCATT CGGGCAGCCG TCACTTAGGC
CTTGAAGTGG CAAAATATTA TCAAGAAGCT GGTTATAAGC AGTTAAACCG TAATGACAAA
TCCGACATTG AGGCACTTAT TGCCCGCTAT AAATCGGAAG GACGTGATAA AGAAATCCAG
AACGCTTTAA AGGAATTCAA GAATCAAGTA TTGACAGATA TTCCGTTTCC TCTTGCTTAT
GTAACCGGCA GTCTATTCGA GGACTATATA CATGATATGA ATATCGCCCA GCAGTTTGCA
GAACTGAACC GCAAAGCGAT GATTACAGAG ATCGTAAAAG GCATGAAATT GGACGTAGTG
GAGCAGTTCA CTACAACTCA TAACTACATT GATACCGATA CGATGATCTT ACGGAAAGGA
GCGGTATCTG CAAAGAAAGG GGAGAGGCTT TTAATTCCGA TCAATATGAG GGATGGCAGT
CTTATTTGTA TTGGCAAAGG CAATGAGGAC TGGAACTGCT CTGCTCCCCA TGGCGCAGGA
AGGCTCATGA GCAGGACCAA GGCAAAACAA AGTTTTACTG TTTCGGAATT TAAAAAACAG
ATGAAGGATG TCTATACTAC ATCTGTCAAT AAAGAAACCT TGGATGAGTG TCCTATGGCC
TATAAGAATA TGGATGACAT CGTAAACAAT ATCGGGCCGA CGGCAGATAT TGTGAAAGTT
ATTAAACCGA TTTATAACTT TAAGGCGGGG GAATAG
 
Protein sequence
MAKVYTDVLE EGAASQIETL CNQEFVKESK IRIMPDVHAG AGCTIGTTMT IADKVVPNLV 
GVDIGCGMET ISITNKRLEL QKLNKLIYEK IPSGFEVRKT PHRYNEQIDL TQLRCHKEVK
LDRAEKSIGT LGGGNHFIEV NKDEEGNLYV VVHSGSRHLG LEVAKYYQEA GYKQLNRNDK
SDIEALIARY KSEGRDKEIQ NALKEFKNQV LTDIPFPLAY VTGSLFEDYI HDMNIAQQFA
ELNRKAMITE IVKGMKLDVV EQFTTTHNYI DTDTMILRKG AVSAKKGERL LIPINMRDGS
LICIGKGNED WNCSAPHGAG RLMSRTKAKQ SFTVSEFKKQ MKDVYTTSVN KETLDECPMA
YKNMDDIVNN IGPTADIVKV IKPIYNFKAG E