Gene Dtox_1494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_1494 
Symbol 
ID8428451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp1547438 
End bp1549135 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content54% 
IMG OID645033825 
Producthypothetical protein 
Protein accessionYP_003190981 
Protein GI258514759 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000835765 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGAC GTCTCAACAT TATTCTTGTG TTTGTTCTGC TGATCTCCAT GCTTTTGCCT 
GTTACAGCCC TTGCCACCGA TGGTGAGGGC AACATAGACA ACGGCGGCGG CGGAATGGGC
AGCGGGACCA ATGAAAATTA CTGGAATGGC GGTGATGAGG GCGTAAGAGT CACAGTGATT
CGTGCCAGCG ACAGATCACC TGTTACCGTA CCTGTTGATT TCTCGAACAG GATGCCATCA
ATAGCCGTAC ATTTTGGAAA GGTATCAAAA ATATCATATA CAGTGGGCAG AAGCTTGTCT
CCTGTTACGA CAACGTATGT TTGTGAAAAG CCGGATATTG CTATGCCGAG AATAATAAGT
ACAAGCAGCG GACAGGCCAG TATCGATCAG ATAAAACAAT ACTTCTGCTC AGAGTATATG
GTCATGACGG TTGCTCAGAT TACAGGCATG AACTACGACA TTTTAACCAA CGGCGAATAT
AAGCTCCTGC TGGAACCAAT CGCCTACATG ACTTTTCAGG GTGTGAAAAT GGCGATGACC
GCGACCGAAG CCGCACTCTA CGACCAACAG CTGAACGGCG GCCTGCGGAG CAAGATGGTC
TCACTCAGCC ATAAGAATCT GCCCTTGGCA ATGTTTCTTG AAACGCCTGA TCTCGGCTAC
CCGGCATGGG GCGGCTCAAC GACCACGGCC GCGTCGAACA CAGACATTCT CTCATCCCTC
GGCCTCGGTA TTGTGCGTTT CAACGAAGCG GAGCCGGAGC CTCCGGAGGT AACGGCGGCT
GACTACGAGT ACAGGATTGA TACGGAGGTT GTCACATCGG TAACTGTTCG CGGAGGCCAG
GCAGATCCCG ACCGTCCTGT CGCGGTCCGC TTCACAATTG GAGGCCAGAC ATATAATGTC
GGCAGCATAT ACTACCCGGC AGGCGACAGC CAACTGGTAT GGGTGCGCTG GAGGACACCG
TCCACCCCGC AGACCATGAC CATTCATGTG TCAGTATCGG GCGGAGGCTC CGCAAGTCAG
GGGACCATTA CAGCGAGGAT CGTGGATCTC TCCGGAAACG AACCGCCCAA TCCGGTGGCC
GATGACCGGA ATAATTCCTA CACGTTGGCC CCGATACCCA ACAAGGCACA GAAAACCTCT
GCCTCCTGGG GCGTATGGCG TCCATGGTGG CATGCCCATT GGGTATGGAT TTCTACAGGT
GAAGATAGCG GCTATTGGGA GGACGAGGGC TGGTGGGAGT TCGACTGGCT TTCATATAAC
GCCAGCCTTT CATCCTCCAT GAATGTTGTA CCGGACGCCA AAGCCCCGAC CGCATCCGGA
AACACTTTGA AAAGCGGATA CGGCATCAAT CAGTCTGTCA CCGCCAATGT CAGCACCAAC
CAATCCTCGG CAGTTACCGA TGCGCAGACA GCCGTCACAT ATTTCCCCGA ATGGAGGTAC
GAAACGTATT GGCGGCAGCT GGAGCGCACG CAGTCCGGAT ACAGCTCCAA ATTCGAGTTT
AAGTCCAACA AATACTCAAC CTACAAGCGC CGGACGCATT TCACCCCCAT ATGGTTCCCC
GATGGGAGCT ATACACCGTA TACCTGGCTC ATCGACTGCT GGACTCCGGT CGGTATGCTT
TCCATGAACC TCACCGACTC GGTGACCATC CGGGGCAGCC TTTGGGATGA CTGGCACATT
GCGCCAGTGA AACCGTAA
 
Protein sequence
MKRRLNIILV FVLLISMLLP VTALATDGEG NIDNGGGGMG SGTNENYWNG GDEGVRVTVI 
RASDRSPVTV PVDFSNRMPS IAVHFGKVSK ISYTVGRSLS PVTTTYVCEK PDIAMPRIIS
TSSGQASIDQ IKQYFCSEYM VMTVAQITGM NYDILTNGEY KLLLEPIAYM TFQGVKMAMT
ATEAALYDQQ LNGGLRSKMV SLSHKNLPLA MFLETPDLGY PAWGGSTTTA ASNTDILSSL
GLGIVRFNEA EPEPPEVTAA DYEYRIDTEV VTSVTVRGGQ ADPDRPVAVR FTIGGQTYNV
GSIYYPAGDS QLVWVRWRTP STPQTMTIHV SVSGGGSASQ GTITARIVDL SGNEPPNPVA
DDRNNSYTLA PIPNKAQKTS ASWGVWRPWW HAHWVWISTG EDSGYWEDEG WWEFDWLSYN
ASLSSSMNVV PDAKAPTASG NTLKSGYGIN QSVTANVSTN QSSAVTDAQT AVTYFPEWRY
ETYWRQLERT QSGYSSKFEF KSNKYSTYKR RTHFTPIWFP DGSYTPYTWL IDCWTPVGML
SMNLTDSVTI RGSLWDDWHI APVKP