Gene Dtox_3156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3156 
Symbol 
ID8430150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp3356340 
End bp3358067 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content31% 
IMG OID645035404 
Producthypothetical protein 
Protein accessionYP_003192523 
Protein GI258516301 
COG category[S] Function unknown 
COG ID[COG5293] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00485551 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATCTAA AGAGATTAGT GATATCAAGC CCAACAAAAC TAATTCGTAA TATTGAATTC 
AAGTTGGGTA TGAATTTGAT TGTTGATGAC ACACCGGTTG ATGATATTAA ATCAACCGGC
AACAATGTTG GTAAAACCAC TGTTCTAAAA CTTGTGGACT TCTGTTTAGG GGCAAAACCA
AACATTATTT ATACCGACAC TGAAAACAGG AAAGAAGTTT ATGACGTTGT AAAGAATTTT
GTAATTGATG AAAAAATTGA AATAACACTT ACTTTGACAG ATAGTTTAAG TACATCATGT
GGAAGAGAAG TAGAGATTAG AAGAAATTTC TTGTCAAGAA AAAATGCTGT CAGAGAAATT
AATGGTGGGC CTGTATTAGA CAAGGATTTT GAAGATGAAT TAGAAAGACA TATTATGCCT
GATAAAGAAG TGGAAAAACC CACTTTTAGA CAAGTTATCT CGCATAATAT TAGGTATAAA
GATGATAATA TAAACAATAC TTTAAAAACT CTTGACAAAT ATACGACAGA TGTCGAATAT
GAAACTTTAT ATTTATACTT ATTGGGTTGT TCGTTTGGAG ATGGTGCTTG GAAACAGGCT
CTTATAACAA TGATTAACCA AGAAAATGCT TTTAGAGAAA GATTAGAACA AAATCGAGAC
AAAACGACTT ATGAAATAGC CCTGTCAATG ATTGATGATG ATATTGCTAT GTTGAATGAG
AAAAAGGCTT TGTTCAATCT CAATGAAAAC TTTGAACAAG ATATGGAGCA GTTAAACTCA
ATCAAGTATA AAATAAATAA AAACAGTTCC TTGATCAGCA AGATAGAAAT CAGGAAAAAT
TTAATTGAGG AATCTGTTCA AGAATTGAAG CAAAGCCAGT CATCTATTGA CCTATTACAA
CTAAAAATTC TGTATAACGA AGTTAATATG AATATTTCAG GTATTCAAAA GACTTTTGAA
GATTTAGTAA CGTACCACAA CAAAATGCTG GTTGAAAAGT CACGGTTTAT TTCAAAAGAA
CTACCAGAAT TAACTGACAA TCTAAAGCAT GCCCAACAAG AATTGGCATT GTTGTTACAG
CAAGAAAAAG AACTTTCGAG CAAAATTTCC AAAAGCGATT CATTTGAGGA ATTAGAGGAA
ATAATTATAT CCTTAAATGA AAAGTATCGG ACAAAAGGGG AATATGAAAG TATTATCTCA
CAGGTAAATG AAGTCGAAAA TAATATTGCA AAACTAAATG AAAAAATAGA AAAAATTGAC
AAATACTTGT TTTCAAATGA CTTTGAAGAC CTCTTAAAAG AGCAAATTAA GAAGTTCAAT
AAATTTTTTT CAAAAATATC CCAAGAATTA TATGGCGAGA AATACGCGTT AACTTATAAA
AAAGATATTA ACAAAAAAGG ACAGCAGGTA TACAAGTTTA ACGCATTTAA TGCAAACATG
AGTTCCGGAA AGAAACAGGG TGAAATATTG TGTTTTGATT TGGCATACAC TATGTTTGCT
GACGAAGAAA ACATTCCCTG TCTCCATTTT TTACTTAATG ATAAGAAAGA ATTAATGCAC
GACAACCAAT TAATAAAGGT TGCAGAGTTC GTTCGAGATA ATAATACCCA GTTGGTATTA
TCGATTCTAA AGGATAAACT CCCTGAACAA GCGTTAAATA CGGCTCATAT TGCCGTAGAA
TTGTCCCAAA AAGATAAGTT GTTTAGAATC GAAACAATGG ATAATTAG
 
Protein sequence
MYLKRLVISS PTKLIRNIEF KLGMNLIVDD TPVDDIKSTG NNVGKTTVLK LVDFCLGAKP 
NIIYTDTENR KEVYDVVKNF VIDEKIEITL TLTDSLSTSC GREVEIRRNF LSRKNAVREI
NGGPVLDKDF EDELERHIMP DKEVEKPTFR QVISHNIRYK DDNINNTLKT LDKYTTDVEY
ETLYLYLLGC SFGDGAWKQA LITMINQENA FRERLEQNRD KTTYEIALSM IDDDIAMLNE
KKALFNLNEN FEQDMEQLNS IKYKINKNSS LISKIEIRKN LIEESVQELK QSQSSIDLLQ
LKILYNEVNM NISGIQKTFE DLVTYHNKML VEKSRFISKE LPELTDNLKH AQQELALLLQ
QEKELSSKIS KSDSFEELEE IIISLNEKYR TKGEYESIIS QVNEVENNIA KLNEKIEKID
KYLFSNDFED LLKEQIKKFN KFFSKISQEL YGEKYALTYK KDINKKGQQV YKFNAFNANM
SSGKKQGEIL CFDLAYTMFA DEENIPCLHF LLNDKKELMH DNQLIKVAEF VRDNNTQLVL
SILKDKLPEQ ALNTAHIAVE LSQKDKLFRI ETMDN