Gene Dtox_0641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_0641 
Symbol 
ID8427579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp668319 
End bp669362 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content53% 
IMG OID645033008 
Productprotein of unknown function UPF0157 
Protein accessionYP_003190183 
Protein GI258513961 
COG category[S] Function unknown 
COG ID[COG2320] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000199394 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000137407 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAACATCG CATTACAACC CGCCGCCACT GTAGATGCCG AGGCTATGGC TGCCATTCAG 
AAAAAGGCCT TCAAGCGGCT ATACGACATC TACCACGACG AGGGTAACCC ATTCTTGCGC
GGCGTAGACG AAATCATGCA CTGGCTTGAA CGCCCGAACT GGAAGGTATT TAAAATTTTT
GCAGACGGCA TATTGGTTGG CAGTATCGCC GCCTGCGAGC GGAATGGGCG GCCCGGCGAG
TGCTATCTTG CGAGGTTGTA TGTCTTGCCG GAGATGCAGG GTAAAGGTAT TGCAAGCAGG
GCAGTTTTAC TCTGCGAGTC GAAGTTTCCG AACGCATCCC ATTGGTCTTT GGATTTTCCT
GCCGACCAAC CCGCCAATCG GCGCTGTTAT GAAAAAGCGG GATATTGGGA TACCGGCGAA
ACACGGGAGC AAAGCGAGGG AAAAATCACA CTTGCGCTCT ATGAAAAACG CATCCCCGCT
TTCCACGATA TCAAACAGAA TATGGACCAC CGCGCAGCGC TGTTTCCAAT TGTCCTGCGG
GAGTACAACC CCGCCTATCC CAGGTGGTTC GCAGAGGAAA AGGCGAATTT GACACGGCTG
GTTGGCGCTG AAAACATCGA ACGCATCAGT CATTACGGCA GCACCGCCGT TCCCGGCTTA
CTCGCCAAGC CTACGGTGGA TATTCTGCTT GAAATTAAGC CCGATACGGA TATAGATAGA
TTGAAAGCGG CGCTGCCAGA AGGTGAATAT GTATGTCTGC AGCCACCCGC ACTGACAATA
GACGAGCAGC CGCCTCACTT GATGATTCTC AAAGGCTACA CGCCCTGCGG TTTTGCCGAA
AAGGTGTTCC ATATTCATGT GCGATACGCC GGGAGATGGG ACGAGCTGAT TTTCCGTGAT
TACCTTATCG CTCACCCCGG AGCAGCGGCA GAATACGCGG CGCTCAAGCG TGAATTATTT
AAGCGATTTG AACTTGACCG GGATGGATAT ACCGAGGCCA AAAGCGCGTT TATTCGAGAA
GTTACTGAGA GGGCGAAGGA GTAG
 
Protein sequence
MNIALQPAAT VDAEAMAAIQ KKAFKRLYDI YHDEGNPFLR GVDEIMHWLE RPNWKVFKIF 
ADGILVGSIA ACERNGRPGE CYLARLYVLP EMQGKGIASR AVLLCESKFP NASHWSLDFP
ADQPANRRCY EKAGYWDTGE TREQSEGKIT LALYEKRIPA FHDIKQNMDH RAALFPIVLR
EYNPAYPRWF AEEKANLTRL VGAENIERIS HYGSTAVPGL LAKPTVDILL EIKPDTDIDR
LKAALPEGEY VCLQPPALTI DEQPPHLMIL KGYTPCGFAE KVFHIHVRYA GRWDELIFRD
YLIAHPGAAA EYAALKRELF KRFELDRDGY TEAKSAFIRE VTERAKE