Gene Dtox_0148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_0148 
Symbol 
ID8427071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp164800 
End bp167097 
Gene Length2298 bp 
Protein Length765 aa 
Translation table11 
GC content47% 
IMG OID645032539 
ProductYD repeat protein 
Protein accessionYP_003189729 
Protein GI258513507 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAAATGA CTGCGGCGGG AACCACCTAT CAGGACGTTC CCACCTACGC AGGGGAGCCG 
CTAACCTTAT CCGGCATGAT TAATACCAGC GGTGTCAGCG GCAGTGGTGC CTGCTATCGG
ATCGACTACT ATGACGCCTC GAACAACCTG ATCTCCGGGA CTTCCGTACA AACCGCCTGT
ATCAGCGGAA CTCAGGGCTG GACAAGAATG GCAAGTATGG CCAACGCTCC TGCAAATGCC
AATTACGCCC GGCTCCAATG CATACTAAAC GGCAGCGGTA CAGCCTATTT CGACGACGTG
AAATTGACAC CTATAAACAG CATGCAATAT ACTTACGATA AGACCAGCGA CTATACCAAT
CCCGGCAGCT ATACCGGAGG AAACTACATG ACCCTCTCGG AAGATGCCCT GGGTATTCAA
AATGCTTACG CATACGATGC AAACGTCGGC AACATGATAG GACATGCCGA TCCTCTAAAT
CATATTACCT GGTTTAACTA TGACACCCTG AACCGTCTCA TCAGAGTAAC AGATCCTTTA
AATCGCAAAG CCTATTACCA GTACGATCCC GTAAGCAACC TGATTTATAC ACGAGATCCC
AGAAGCGCGT CATCATCGGA CAACACCTAC AGCACATTCT ACGGGCCAAA CAACCTGAAC
CGGCTTTCTG CCCTGACCGA TTCACAGAAC CGGAGCGCCA CCTATACCTA TGACAGATCC
GGCAACCTGA CGGGAATTGC CCTGCCCAAC GGCCAAAGCG AAAGCCTGGA ATATGATAAC
GCCAACCGCC TGAGCAAAAT TACACTAAGT GATGGCAAAT ACTATAACTA TTACTATGAC
GGAGCCGGAG AACTGATCAG CGTTACAGAT CAAAACGGAG CCGGTTGCTC CTGGAACTAC
GACGGAGCAC ACAGAGTAAC CGGTACAACA GATCCCTTAG GCTATCAGCT TAACTACTCC
CTGGATAAAA GCGGCAATCT TACACTCCAA TCCGGCATCA ACTACAGTTG CCGCTACAAC
TATGACAATG GCAATAAAAT GTACAAAGTA TCCCTGCCCG GTGCAATAAT CTACTATGGC
CGAGATGACC AAGGGCGTGT TTTTAACGTT GAGTACAACC CGTCCTACAT TGTTAATCAC
CAACCGCATT ACGCCACCAG CCAAAGAATA ATCAACTATC TGGTCAACGG TTGGTGCAGC
AGTATTCAGG ATCAGTACTT TCCCTATCGA TCCGGTTACT CCTACGGTTA CTATGCTGAC
GGTACTATCT CCGGCTACAG CTCGTGGAAC GGCACACACA GTTTCAGCTA TGATGTTGAC
GGTAGGCTTG CCTCCTGGAC ACACGGAGGA ATTCAACAGA ATTACACATA TGATGCCGCC
GGCAACCTCA CGACCAAAGG AAACAGGACA TTTGCCTACA ACAACATCAA CGAGATCACG
AGTCCGGGCT TCACCTACGA TCAAAACGGC AACATGACCG GCGACGGCAG CTTCAATTAT
ACCTACAACG CCTTGAATCA GCTCGTCCGG GTCAATAAGG TATCCGACGG AAGCCTTGTG
GCCACCTATA CCTACAACCA CGACGGTACC AGGAGAAATA AAGTCACCGC TCAAGGAACA
ACCAACTACA ACTGGGATGC CTCCGGGAAC TTAATCAGGG AAATCGGCCC CAATGGTACC
TATTGTTACT ACTATCCCTT GGGTAAACTA ATCGCCTTCA AGAATAACCA GCAGTTGTAT
ATAGTGCACG ATAACCTGCG GGGTGATGTC ATCAGCTTAT CAATGACGGA TGACTACGGA
AACACAGATC AGGAAAACAT GTATGACTAC GACCCATGGG GCACTCCTAT CTGCGAGGAT
GAATCGGTAA AGTCACCCTT CCGCTACGCC GGTTATTACT ATGATACTGA GACGGGATTG
TATTATTTAA AGAGCAGGTA TTACAGCCCG GCGTTGGGGA GGTTTTTGAC GAGGGACGAT
CATAGTTATA TAAAGGATAA AGACCCACAA ACGATGAACC TTTATAGTTA TGCTGGTAAC
AATCCTGTAA GTAACGTAGA TCCGACAGGG GAGATTCCTG TTTACGCAAC CTGGAAAGCA
TTTGAAGACA AACTTGGGGA ATTACTTAAT ACTAGCAAAA ATTGGTCAAA AGGATATGGA
AACAGAATTG TTGATTACAT CACAAAGACA GGAGAAGCTT GGGAAGCCAA ATCGGGTGAG
TATATATCGA ATAGCCCACA GTTAAGAGAT TTTATGAGAC AATTTGGAGA TAAGTTTAGA
TTATATAGAA ACGATTAA
 
Protein sequence
MKMTAAGTTY QDVPTYAGEP LTLSGMINTS GVSGSGACYR IDYYDASNNL ISGTSVQTAC 
ISGTQGWTRM ASMANAPANA NYARLQCILN GSGTAYFDDV KLTPINSMQY TYDKTSDYTN
PGSYTGGNYM TLSEDALGIQ NAYAYDANVG NMIGHADPLN HITWFNYDTL NRLIRVTDPL
NRKAYYQYDP VSNLIYTRDP RSASSSDNTY STFYGPNNLN RLSALTDSQN RSATYTYDRS
GNLTGIALPN GQSESLEYDN ANRLSKITLS DGKYYNYYYD GAGELISVTD QNGAGCSWNY
DGAHRVTGTT DPLGYQLNYS LDKSGNLTLQ SGINYSCRYN YDNGNKMYKV SLPGAIIYYG
RDDQGRVFNV EYNPSYIVNH QPHYATSQRI INYLVNGWCS SIQDQYFPYR SGYSYGYYAD
GTISGYSSWN GTHSFSYDVD GRLASWTHGG IQQNYTYDAA GNLTTKGNRT FAYNNINEIT
SPGFTYDQNG NMTGDGSFNY TYNALNQLVR VNKVSDGSLV ATYTYNHDGT RRNKVTAQGT
TNYNWDASGN LIREIGPNGT YCYYYPLGKL IAFKNNQQLY IVHDNLRGDV ISLSMTDDYG
NTDQENMYDY DPWGTPICED ESVKSPFRYA GYYYDTETGL YYLKSRYYSP ALGRFLTRDD
HSYIKDKDPQ TMNLYSYAGN NPVSNVDPTG EIPVYATWKA FEDKLGELLN TSKNWSKGYG
NRIVDYITKT GEAWEAKSGE YISNSPQLRD FMRQFGDKFR LYRND