Gene Dtox_0422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_0422 
Symbol 
ID8427357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp433750 
End bp438939 
Gene Length5190 bp 
Protein Length1729 aa 
Translation table11 
GC content53% 
IMG OID645032804 
Productcell wall/surface repeat protein 
Protein accessionYP_003189982 
Protein GI258513760 
COG category 
COG ID 
TIGRFAM ID[TIGR02543] Listeria/Bacterioides repeat 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00920532 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000607429 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGCGAA ATATTTTAAG AAAGGCACTC TCGCTGCTGT TGGCACTAGT GTTGCTGGTA 
CCTTTGGCAT CCCCGGCATT TGCCGCCGAG GATATAGAGG AAAGCACCTT GCAGTCGGAT
GCCCCCATGA TCCTGGGCGC AGGGAATGGA CTTGCCGTGT GGGCAGGCGA GAAGCTGGCT
GGCGGCGTGG TAAGCAGTAT CGCCGGCTAC CCCATGAACC AGGCTATGGG CAAGATTTTC
GGGACACAAA CGGATCAGGT CTTGCAAGCC CTTGAGGCAA TCAAGGTTCA AATTCAGGGT
CTTAAAAATG ATATCGCAGA GATGTCCAAA AAGCTTGACA GACAGGAGCT TCGAAATTTC
TTGAACGGCT ACGGGGATTC CATCAACAAC TACATCAGCG TTTACAATGA GCTTAGCAGT
GCGCAAAAAA TAACCGATCC CGCCATGTTA AAAATCCTTT TTGAAAAAAT ATTTAAAGGG
GAGGATCCGA ACTACATGGT CGGCGGCAAT TCGCTGAAAA ACGCTACCAT AAACCTCGGA
AATCAGCTAA GGATCGTAAA AGCAGTGGAG GGCGGTTCTT GCAATATCTT CGGTGCCATT
GACCTCTATG ACCGATACGT GAATATCTGG GAGCATCAAG GTTATGTAAT GCGGGAGGAC
TTCCGCAACA AGCAGCTGTC GATATACACA TTGTTCTCAG CCATGAGCCA GCTTGCCTGT
CAGTCGATTA TCGATAACAA TCAAGGAGAC ACTCCGTCGG CCAGGCTCGC AAAGTACCAA
GCTGAGAGTC GAATGAAGGA CTTGAAGGAC GATGCCGAGC TGATGGATAA GATGGTCAAA
CGCACCTCCG TCGTCAATCC TGACAATACC ATCAGCGTTT ACCGGCACCC CAACCTGCGC
ATCGCCCGTG ACATCAAGCA GGGGATTGAC CTGTGTGCGT TCTCCCCCGA CATCAAAGTC
GCATATGTGA CCAAGAATGC CTGGGGGGAT GAAGCCGCTT GGGCGGACTT TTCGCAAAGT
GCAGAGCGGT ATACCTCCGT GGATTACCTC ATGAGCTTGA CGAATGACAA AACTGCCCGG
CGCACTGCAT GGAGCGGGGG CTGGGAGGTG TACACCAACC AGCCGACCCC TGCCGAGTAT
CAAATACTTC GGGAAAATTA CGGCGGGCAG CAATCTTTGT ATAATATCTT TTTTGACGCG
GACAAAGGCA ACTTTAATAA TGTTGGGAAT AGACCTGCGG AATTAGCCTT TTTGTGCAAC
CACTATGTCT CTCGGAATTG CCGCGACGGC TGGATCAACT GGGAATCCAA CAACCATGTA
GGGAACAATG GAGAGGTTTA TGGATGGTGG AAGTTTTCTG CGGCCCGGTT CATTTCCGAC
ACGTATGCCG GTAGCGCCCG CGATAACTGG TTCGATCCCT CAAACGCCTT CATCGTCAAT
AAATACTTGG GTGAGGTGAA GCAGGGCTCG CTTCAGGGGC CTAATTCCCC GCCGGACATC
GAATACAAGG CGACAATTTC GGGCATGGAT ACGGAATATG AGGTAGGATA TGACAGCGGC
ATCACTCTCG AAATTGACAA AACCGGTGAC GCGTATCAGT GGGTTGTCAA CAAGAACGAT
GACAAGGGCT TTGTGGAGCT TGAGGGAGAG ACGGGCAAAA CCTACTCCAT TAGCGACGGA
CTTACCTCCG ACATGAACGG CTGGCAGTAC AGTTGCACAG TTATAGATAA CCCGGCGGAG
CCTGATGGAG AGCCAACCTA CACCCACGCG CTCCCCGTGA CGCTCAACCT CACAGGCGAT
GGCATATCCG ACCCGGTGAC CGAGCATGAG GTGGGTGATG CAGATGCGCT CAAGACCGCG
CTGGACAAGG TGGATTCCGG TGAATGGAAT CAGCACACCT TGAAGCTGGC GGATGACATT
ACCTATCCAA ATCCCATTGC ACCGGATGGT ATGTGCAGCG TGACGCTCGA CTTAAACGGG
TACACGCTGA CAGTGCAGCC CGGAGCGAAT GCCGAATCGA ATGTGAATGC CATGTCCAAC
AATCCGCAGA TCGCCGCGAT TTGCTTAAAT CAAGACCAAC TGACAATTGA GGATAGCTCA
ATTGACGGTT TGGGCTTACT GAACATTGTC GCCGGGCCGG GAATTGAATA CGGCATCTAC
GCGGCAAATG ACAGCGGATT CGATGGTTAT GACGCAGTGA CAGAAGAAGC TGTCACTGTG
ACCTCCACGG TTGGCGGCAC CGCGATCTAC GCTGCCGATG ACAGCTTTGT CGGCGTAAAA
GGCAGCGTCC GGGCGGAGGG CGAGGATGCC TATGGCATAG AGTGCTTAAA CAGCGGCAGC
GCTGTCATCG TGGACAAAGA CGTGACGGTT TCGGGCAAAT CCGCCTGCGG CGCATATGTG
GCGTCCTTCG ACGGCGGTAC CTCAACCCAA GTCGTACGTA TATTTGGCGA CCTCACAATA
TCCGGCGAAA ATAGCCGCGC TGCGCTTTTG GACGCGGAGG CTGAACTTAC CGTAAGAGGC
AGCGTAACGG TGACGGGCGG CAGAGAAGGC ATCAGCGTAA GCAATGGTTA TGTTGAGACT
TGGGGCAACG TCACAGCGCC GGATTATGCT ATCAACGCCC GGAGTAATAA AGCTAGCGTG
GTGGTGCTCG GCAATATCTC TGTGACGAGG GAGAATGCCG TGGCGGTGTC CTTTGTTGGC
GCCGAAATCC GTGTGGGCGG GAATGTCAGC TCGTCAAATA GCGGCGGAGT CGGCATACTG
GCGGCAACTT GGATGGATCC CGACGACGGC GACGTCAAGT GTGGCGCATC GGTTACGGCA
GAAGGCAAAA TTGTTGCCGT CACGCCCCTG CTGATTGAGA GCACGCCCGT GACCGACGAG
AGTGAAAAAG CCAGCACGGA CCTTGATTAT AATGTATTTA CCATACCGGG AAAAGGCACA
AGTGCTGTCA AGGCAATGCC CGGCGCATTT GAGGTAAAGG CCACCTCTCT TGTCACCTTT
GACAAAAACG GCGGCGACAC CGAGGCAAGC CCCAATACAA AAGCAGTCAT CACAGGCGGC
AAGGCGGGTA TCCTGCCAAC CGCACCTGGG AGGAGCGGAT ATACCTTCAA CGGCTGGAAC
ACGCAGGTCA ACGGAAGCGG AACTGCATTT ACGGCGAATA CCGATGTCAC CGGGCCGATT
ACTGTGTATG CACAATGGGA GTACAGCGTA GTAAATGCAG CCATCAGCCC GGCAAGTGCG
AGTTATGACC TGAACAGCCC CGGCGATGTA AGTACCGCCA TCATATGGAA CAGCGCTTCT
ACGATTACGG ATGTGGTATA TGGGACAACG TCTTTGACAA CACCTGCTGC ATATATCGTT
ACCGAAAGTG CGCTAACCAT CAAGAGCAGC TATCTTGAGG AACAGGGTTT TTCCGAAGGA
AATACGGCAG AATTTATGAT AGACTTTGAC AAAGGCGATT CTGCCAAGCT TACTGTTAAC
ATCGTAAACA ATTATATACT TTCCGATGAT GCCGGTTTGA GCGATTTGAA AGTAGGAGGT
AGCACAGTAA GCGGTTTTGA TCCGAATGTT TTCGCATATA GTGTGGAACT GCCATATGAC
ACCCTGCCCG GCAGCCAAGC TGCGACAGTA AGCGCAGACG TTTATAATAC AAAGGCTGTC
GCAAGTATAA CACAAGCGAT ATCCCTGCCC GGCAGTGCAG CCGTGAAGGT AACCGCTGAG
GATAAAACAA CTACTAAGAC TTATACCGTC AACTTCACGC TGGAACAAGC ACCTACTACC
TATACCATCA CCGTCCAGAC TGACGGCAAC GGTACGGCAA ACGCTAATAT TAGCTCTGCA
TCACAGGGAA CACAAATTAC TCTGACTGCC AATGCGAATA ACGGCTATCG GTTCAAGGAA
TGGCAGGTCA TCGACGGCGG TGTTACAATT ACCGGCAACA AATTCAATAT GCCCGCCTCA
AATGTGACGG TAAAGGTGAT TTTTGAATAT AAATCAGGCG GTGGCTCCGG CGGTGGCGGC
AGCAGTATTC CGACCACACC GACCATCCCT GCGGCAGGTG GCACGGTTTC TGTGAACTAC
ACCGCCTCAA ATGGCACGGC CTCCCTCTCG CTGCCCATGG TCAAGGTCAA CGAAATCATC
GAAAAGAGCA AGAGCAACGA AGCGGTACTC GACCTCTCCA AGGTAAGCGG CATCACGGCG
GCGTCCATTC CCAAGGATGT GCTGGCAACC TTTGCAAAAG CGGGTCTTGA CACGACTGTC
AAGCTGCCTG TCGGCACCAT CACCCTTGAC GAGGATGCAA CAGTCTCGGT TGCGCAGCAG
GCTTCCGGCA GCAATCTGGC CATTGAATTG AAACAGGCGG CAACCAGCTC TCTCACCGAC
GAACAAAAGA AATCGGTCAA GAGCGGCGAT ATTGTGCTGG ACATCAACAT CACCTCCGGC
ACGAAGAAGA TTAGTACCTT TGATGGGACG CTCGCCGTTT CTATGCCCTA TACCGGCCCG
CAGCCTGTAG CCGTATGGTA TTTGAACGAC GGGGGCGAGT TGGAGAAACT TGACTGCACT
TTCAAAAACG GCGTGGTCAG CTTTAACCTT GACCATCTCT CTCTCTATGT GGTGGGGCAA
GACACAGCGA AGCCTACGTG GGTCAATCCC TTTACCGATG CGAAGGAAGC CGATTGGTTC
TATGCGGCGG TCAGCTTCTG TGCAGAGAAA GGTATCACCA ACGGCACATC GGCAACGACA
TTCAGCCCCA ACGCAACACT GACACGCGAA CAGTTTATCA CTATGCTGCT GAGAGCCTAC
GGCATCGAGC CAATTGTAAA CCCCTCCGAC AACTTTAGCG ATGCTGGCAG CACCTACTAC
ACCGGCTATC TCGCAGCGGC CAAGGATAGG GGTATTTCCA ATGGCGTAGG CGATAACAAA
TTCGCTCCGG GAAAAGCAAT TACCCGGCAG GAGATGTTTA CGCTATTGTA CAACGTCATC
AAGTTTCTGA ACAAGCTGCC CACCACAGAT AATGGCAAGA CCCTTGCCGA CTTCACCGAT
AGCGGCGATG TGGCAACTTG GGCGCGTGAT GCCATGATTA TGTTAATTAA ATCTGGCACA
GTTTCCGGCA GCGGCGGTAA GCTCGATCCG ATAGGCGGTT CGACCCGCGC GCAGATGGCG
CAGGTGCTGT ATAACCTGTT GGGGAAGTAA
 
Protein sequence
MKRNILRKAL SLLLALVLLV PLASPAFAAE DIEESTLQSD APMILGAGNG LAVWAGEKLA 
GGVVSSIAGY PMNQAMGKIF GTQTDQVLQA LEAIKVQIQG LKNDIAEMSK KLDRQELRNF
LNGYGDSINN YISVYNELSS AQKITDPAML KILFEKIFKG EDPNYMVGGN SLKNATINLG
NQLRIVKAVE GGSCNIFGAI DLYDRYVNIW EHQGYVMRED FRNKQLSIYT LFSAMSQLAC
QSIIDNNQGD TPSARLAKYQ AESRMKDLKD DAELMDKMVK RTSVVNPDNT ISVYRHPNLR
IARDIKQGID LCAFSPDIKV AYVTKNAWGD EAAWADFSQS AERYTSVDYL MSLTNDKTAR
RTAWSGGWEV YTNQPTPAEY QILRENYGGQ QSLYNIFFDA DKGNFNNVGN RPAELAFLCN
HYVSRNCRDG WINWESNNHV GNNGEVYGWW KFSAARFISD TYAGSARDNW FDPSNAFIVN
KYLGEVKQGS LQGPNSPPDI EYKATISGMD TEYEVGYDSG ITLEIDKTGD AYQWVVNKND
DKGFVELEGE TGKTYSISDG LTSDMNGWQY SCTVIDNPAE PDGEPTYTHA LPVTLNLTGD
GISDPVTEHE VGDADALKTA LDKVDSGEWN QHTLKLADDI TYPNPIAPDG MCSVTLDLNG
YTLTVQPGAN AESNVNAMSN NPQIAAICLN QDQLTIEDSS IDGLGLLNIV AGPGIEYGIY
AANDSGFDGY DAVTEEAVTV TSTVGGTAIY AADDSFVGVK GSVRAEGEDA YGIECLNSGS
AVIVDKDVTV SGKSACGAYV ASFDGGTSTQ VVRIFGDLTI SGENSRAALL DAEAELTVRG
SVTVTGGREG ISVSNGYVET WGNVTAPDYA INARSNKASV VVLGNISVTR ENAVAVSFVG
AEIRVGGNVS SSNSGGVGIL AATWMDPDDG DVKCGASVTA EGKIVAVTPL LIESTPVTDE
SEKASTDLDY NVFTIPGKGT SAVKAMPGAF EVKATSLVTF DKNGGDTEAS PNTKAVITGG
KAGILPTAPG RSGYTFNGWN TQVNGSGTAF TANTDVTGPI TVYAQWEYSV VNAAISPASA
SYDLNSPGDV STAIIWNSAS TITDVVYGTT SLTTPAAYIV TESALTIKSS YLEEQGFSEG
NTAEFMIDFD KGDSAKLTVN IVNNYILSDD AGLSDLKVGG STVSGFDPNV FAYSVELPYD
TLPGSQAATV SADVYNTKAV ASITQAISLP GSAAVKVTAE DKTTTKTYTV NFTLEQAPTT
YTITVQTDGN GTANANISSA SQGTQITLTA NANNGYRFKE WQVIDGGVTI TGNKFNMPAS
NVTVKVIFEY KSGGGSGGGG SSIPTTPTIP AAGGTVSVNY TASNGTASLS LPMVKVNEII
EKSKSNEAVL DLSKVSGITA ASIPKDVLAT FAKAGLDTTV KLPVGTITLD EDATVSVAQQ
ASGSNLAIEL KQAATSSLTD EQKKSVKSGD IVLDINITSG TKKISTFDGT LAVSMPYTGP
QPVAVWYLND GGELEKLDCT FKNGVVSFNL DHLSLYVVGQ DTAKPTWVNP FTDAKEADWF
YAAVSFCAEK GITNGTSATT FSPNATLTRE QFITMLLRAY GIEPIVNPSD NFSDAGSTYY
TGYLAAAKDR GISNGVGDNK FAPGKAITRQ EMFTLLYNVI KFLNKLPTTD NGKTLADFTD
SGDVATWARD AMIMLIKSGT VSGSGGKLDP IGGSTRAQMA QVLYNLLGK