Gene Dtox_1963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_1963 
Symbol 
ID8428945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp2091509 
End bp2094538 
Gene Length3030 bp 
Protein Length1009 aa 
Translation table11 
GC content50% 
IMG OID645034291 
ProductS-layer domain protein 
Protein accessionYP_003191422 
Protein GI258515200 
COG category[S] Function unknown 
COG ID[COG1520] FOG: WD40-like repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000481253 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0221131 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAAAA AGAGGATTTT ACTTCTATTT TTTGCACTTT TGTTTGTTTG CCTGCTGATT 
CCGGTTGCGG GATGGGCCGC CGGAGGCACA TGGCCGCAGT TTCAAAATGA TCTCTACAAT
GATGGACTGA CCACTGATTC GGCTCCGCTG AGCAGCGCTT CTGTAGCCTG GCAGCAGCAG
GTTGGGAACA CCTCTATGGC GGGCATTAAC CATACCCCCC TGGTTGCCGG GGGCAGTGTT
TTTGCCATGG ATGCATTTGG AAAGTTATGG TCATTTGATA TGAAAACAGG GACTGAGAAA
TGGTCCACTC AATTGAGCTG CTCTATAAAG CAATTTCAAC TTTCCACGCC GGCCTATGAC
AGCGGCAAGC TGTATGCGGC CACCAACGAT GGGCATGTTT ATGCTCTGGA CGCCGGCAGC
GGGAGTATTT TGTGGGATAT ACCGCTGCAG TTGGCATCAA AGTACAGCCA ACTGAACACG
CCGGTTAAGT ATGTTGACGG GAAGATTTAC ATTGGAGCCT GGAATCCTGA TTCCGCTTCT
AACGAGTTTT ATTATTGCCT GGATGCTGTT ACCGGTGCTC CGGGAATTGG CGGTCAGTAC
CAGATGCAGA ATTCGGCTTG CGGAGGTGGA TATTACTGGA CCGGCGCTTG TATTGTGGGT
AAGTACATGA TTTTTGGTTC CGAGCGGTCG ACACTTACCT GTCTTGATAA AGATACCGGT
GCCTTGATTT CTTCCGTGAG TCTGAAGGTA TACAGCAGCG GAGCCAAGGA AATACGCTCC
TCCGTAAGCT ATGATTCCAC AACAGGCATG GTCTTTCTGA CTGATCAGGG CGGTAATTGC
TGGTCCTTTA AATTCGACTC GGACAGCGGT AATCTGATCT ATCTGTGGAA CAAGACTATT
GATAAAACCA GCACTTCGAC CCCGGCGGTA TATGACGGCA AGCTTTACGT GGGCAGCGGA
ACCTATGCTT CCAGGGGCGG TCTTTACTGT CTGGACGAGC AGACAGGAAG CGAGCTTTGG
AACTTCATGC CCACCGGTGA CGGGGAAGTT TCGGTACCGG GCGTACAGGC ATCACCTGCA
ATAGCCGTGC AAAATGGAAC CCCGTACATT TATTTTGCCT CAATTTGTGA GAATTCTCTG
GTTTATTGCC TGGATCAAAA CGGTAATCAA GTCTGGCAGT TTGCCGTTCC CAATTATACT
TACACTACTA CCAGCATCGC CGTGGCTGAT GGTTGGTTGT ATTTGGGCAA TGACTACGGG
TGGTTGTATG CTTTAAAGGG AACAGCAGTT CCGGTCACGG GAGTTTCTCT TAACAAAACC
ACCGACACCA TTACTGTCGG AGCAACAGAC ACGCTCATAG CCAATATCGC TCCGGTTAAT
GCCACAAATC AGAGCGTGAC CTGGGTTTCT GACAATACTG CAGCAGCTGC CGTGGATCAG
AGCGGAAAGG TTACGGCGGT GGCTCAGGGG ACAGCCAATG TTACTGCTAC CACGGCGGAC
GGTGGTTATA CAGCAACTTG TGTTGTAACA GTCACCGGTG GTGGTGGCGG TGGCGGCTCA
TCGACAACGA GCAAGGTGAA TCTTGTCATT AAAGATAAAA ACGGAAGTAC TCGCTTTAAT
AACAATATAA GTGTTCAGGC GGGAGATACC GTTATGGATG TCTTATTTGC CGCGGCAGAG
AAGGATTCGG CTATCGATCC CCAGGTGGAA TGGGAAAACG CCTACATTAT GGGGGGTTAT
GTATGGAGCA TCTACGGTGT GGAAAGTCCC TGGGGGAAAA TGTCTGAAGG ATGGGTATTT
CAGGTAAACG GTGTCATGTC AAATAAAGGT GCGGCCAAAT ATATCGTCAG CGATGGCGAC
AACATTTTAT GGGAATGGAG TGCAATGGAA CCGGTTACGG GAATTACTCT GGACAAGACC
AGCAGCACTG TCAATGTGGG CGGCACTGTC CAATTGAATG CCAATATTAA ACCAGCAAAT
GCCTCCAATA GGAGTATTAA CTGGACCTCC GACAACACTG CAGTAGCTAC CGTCGATAGT
GACGGCAAGG TTACCGGCGT GTCCGCCGGC TCCGCCAAAA TCACTGCCGC CACTGCTGAC
GGCGGTTATA AGGCGACCTG TGTGGTTACT GTTCAGGCCG CAGCCGGAGG ATCGGCCTCA
ACGACAGGGT CAGGAATAAG CTTGAACAAG ACCACTGACA CGATTAAAAT TGGAGCCACA
GATCAATTGA CAGTGACGAT CTCTGCAACC GGTGTTTCAG ATCAGGATAT CAAATGGGCA
TCCGACAATA CCACAGTGGC TACCGTCGAT AGCAAGGGCA TGGTTACCGG AGTTTCCGTC
GGTACCGCTA AAATCACCGC CGCAACGGCA GACGGCAGAT ATACAGCGAC TTGTGTGGTT
ACTGTTCAGT CTGTTGAGCC GGCGCAGCAG GTTCAACAGA CCGTTCAGCC GCAATCTCAA
ACTCAATCCC AGTCTGCAGT TGCATTTGAA GACCTGCAGG CAGGCTATTG GGCCAGGGAA
GCTATTGAAT ATATGGTTGC CGGAGGTTAT CTGAAGGGAT ATGAGGATGG TACTTTCAGG
CCTGATCAGC CCATAACCAG GGCGGAATTT ACTGCTTTAA CGGTGAAAGT AATGGGTTTG
CAGGAAGCAG ATGGCAGAGA CATATTTAAG GATGTGCATT CCGGTGACTG GTATTACGAT
ATTGTGAACA TCGCCTTTAC ACATGATTTG GTTTCCGGCT ACGGGGATGG CATGTTTGGC
CCTAACGAAC CGGTTACCCG GGAACAGGTA GTGTCAATGA TCAGCCGTGT TTTAGCGCAA
AAAGAGGGCC AGCAGAAGGA GACAGCAGTA AAAGATGAAA TATTGCAGCA ATTCAATGAT
GCCGGGGAGA TTTCCGATTG GGCCCGGCCT GCTGTGGCCA TAGTGATCAA CAAGGGTATA
GTCAATGGAT ATGAAGACGG TACCTTCAGG CCGAATTCGC CCGCTACCAG GGCCGAATGT
GTAGTAATGC TCAGAAAGTT GCTGCCCTAG
 
Protein sequence
MSKKRILLLF FALLFVCLLI PVAGWAAGGT WPQFQNDLYN DGLTTDSAPL SSASVAWQQQ 
VGNTSMAGIN HTPLVAGGSV FAMDAFGKLW SFDMKTGTEK WSTQLSCSIK QFQLSTPAYD
SGKLYAATND GHVYALDAGS GSILWDIPLQ LASKYSQLNT PVKYVDGKIY IGAWNPDSAS
NEFYYCLDAV TGAPGIGGQY QMQNSACGGG YYWTGACIVG KYMIFGSERS TLTCLDKDTG
ALISSVSLKV YSSGAKEIRS SVSYDSTTGM VFLTDQGGNC WSFKFDSDSG NLIYLWNKTI
DKTSTSTPAV YDGKLYVGSG TYASRGGLYC LDEQTGSELW NFMPTGDGEV SVPGVQASPA
IAVQNGTPYI YFASICENSL VYCLDQNGNQ VWQFAVPNYT YTTTSIAVAD GWLYLGNDYG
WLYALKGTAV PVTGVSLNKT TDTITVGATD TLIANIAPVN ATNQSVTWVS DNTAAAAVDQ
SGKVTAVAQG TANVTATTAD GGYTATCVVT VTGGGGGGGS STTSKVNLVI KDKNGSTRFN
NNISVQAGDT VMDVLFAAAE KDSAIDPQVE WENAYIMGGY VWSIYGVESP WGKMSEGWVF
QVNGVMSNKG AAKYIVSDGD NILWEWSAME PVTGITLDKT SSTVNVGGTV QLNANIKPAN
ASNRSINWTS DNTAVATVDS DGKVTGVSAG SAKITAATAD GGYKATCVVT VQAAAGGSAS
TTGSGISLNK TTDTIKIGAT DQLTVTISAT GVSDQDIKWA SDNTTVATVD SKGMVTGVSV
GTAKITAATA DGRYTATCVV TVQSVEPAQQ VQQTVQPQSQ TQSQSAVAFE DLQAGYWARE
AIEYMVAGGY LKGYEDGTFR PDQPITRAEF TALTVKVMGL QEADGRDIFK DVHSGDWYYD
IVNIAFTHDL VSGYGDGMFG PNEPVTREQV VSMISRVLAQ KEGQQKETAV KDEILQQFND
AGEISDWARP AVAIVINKGI VNGYEDGTFR PNSPATRAEC VVMLRKLLP