Gene Dtox_2870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_2870 
Symbol 
ID8429859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp3048594 
End bp3050741 
Gene Length2148 bp 
Protein Length715 aa 
Translation table11 
GC content48% 
IMG OID645035134 
ProductHedgehog/intein hint domain protein 
Protein accessionYP_003192258 
Protein GI258516036 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTGGG AAAATTGTGT AGATAACATT AGGGCCACTC TTGATCCAAA AGCTAAGGAA 
ATTAATGCAA TCAACTGGTC TTCAATAAGT CAGGACAGTC TGGACAACGA TACTTTCGCA
AAGGGAGAAC CCTTTTCTCT CATTTGCCCG TCCAGGCACT GCCAGGATTT TGCCGACTTT
TCTCCGCAGG CCCACAGGAT TTGTCAATGT CTGGATATGG AAGGCAAAAG AATACCAAAC
TGTGAACCTA AAGACTATTT GTACCCCTAT CCTTCCCAGG TTTGCTTAAA TATTAACAAT
CCCGCCGATT TCAGCGGCAA AAGGAGCAAG GCTTGCACCA GGCAAGGAGA CAGTTATTAT
CAACTTACCT GTCACTGCTG CTGCTCCTGC TTTGCCTACG GAACCAAGAT AGGCACACCC
AACGGGCTTA AGAAAATTGA GCAATTCGCA GTAGGCGATT TAGTTCTGGC AGCCAGTCTC
GAAAGCAATG CCGGAGGCAT AAAGTTAAAC TGGTCGCCGC TGAAAGTATC TTTCAGTTCC
GGCACGGGAC CCGACAGCCA CCAGCCGGCC ATGATATACA TCAGACACGG CGAAACAGGT
TCCATTATAG TTACTCCGGA CCACTTGTTT CTGATGCCCG AAGGAAAGCT GAAAAGGGCA
GATCGCCTCG TTCCCGGCAA AGATCAACTT GTCTCCTACG AAGGCCAGGC TGTGCCTGTT
CATGAAGTAC ACTTAGGCGA ATATGAAGGC GGAGTCCACC ATATTGCCAC TGATAACAGC
TTTACGGGAA GCCTGAACGG CCACCTGCTT CTGTCGGAAG GAGTGGTTTC CGGCGACTTT
AATCTGCAGA TACATGCCTC GGAATTAAAA GAAAACTACT TTATCGATGA CCATGACAGC
TATCCCAAAA TCGGTACCAA TGAATATAAA GAGCAAAATA CCGAGCTGCT GGAGGGCAGG
TATTTGAGTT TTCAAGCCTC CGGCATAAAT GAAGTTAACC CTGTGCCCCA GCCGTCGAAG
TTTTACGTGC ACGGGCAAAA TATCTCCTAT ATTCCACAAA CGGCAGCCAA ATACTTGAGC
AGCTTACAAG AGATAGATGT CAACGACAAC GCCGAAAGGC GCAGTTTTTC CGAAATGAGC
CTCGGAAACG CCGCGGTTAA TTACGCTCTT AAACTGTTCA GAGGATTTTA CCCGGACATC
ATCTTTCATC ATGACATAGC ACGGCTTGAG CCCAATGCAT TCGCCTTCAA CCAGTATGGA
AAGGATATAG TCGTCCTGTC CGGAGGGTTG ACCAGGATTA AAAACCTGCA GTTGGAAGGG
ATGACGATGG TATTGAGCCA CATGGTTACA CGCCTGCAAA AAATCAATCC AGTTGACTAT
AACGGTTTTA CTTCCGCAGC CATGGCAGAT TATTACAGCC CAGGTGTACT GCAGACAGTG
CTGTTTGACA AACTGTACGC CGATGTCCTG AAAAAGGGCA AAAAACAGCT TGAAGACGGT
ATCTATGCAT ATATAAACAA ACAGCATGAG GCCTTTGAGG AAGACCCGTA TGCCCCAACC
TGGGAAACCC GCCTAGACGC CATAGACGCG GGTTATGCCA TGGACTTTCC GCCGGAAGGG
ATAGGCGGAC CGGTATTCCA GGGATTGCAA GTGCAGGCTG CAAAAGCTTT TCCGCCGGCA
CTTGCCCCGA ACTCTTTTAT AACTGAGGAC ATTGACGCGG CAGCTTCCCG GCAGGTCTTT
GACCGGTTGA AGGAGAACAA AGTCTTGGAC GACCAGGGAG TACTGAACAC TAAATTCAGC
ATTGACACAG ATCTTTCCTT TTTATTTAAA GATAAGCCTG ATAATCTTAA CAGGTATCTT
ACTGAGGAGA TTCGTTTCAT CTTAAGACAT GTACCTGCCA GAATCCGCCT GACCTTTAAT
CTGCCGCTGT CTGCCGAAAA GGCATCTGCT GTCGGCAGCT ATGAGTTAGA ACCCGAAAGC
AATATTATGC ATGCCGGAGT AGATGAGAAG GATCCCGCCG TCCTGTGGCT CACCGCTCGT
TTGCAGAAGG AGACGGAATA TACTTTGACC GTTTCGAAAT ATTTAAAATC TAAGGATGGC
TCTACTGTTG ATCCCCAAAA CAACAGCATC CAGCTTAAAC TGGTCTAG
 
Protein sequence
MAWENCVDNI RATLDPKAKE INAINWSSIS QDSLDNDTFA KGEPFSLICP SRHCQDFADF 
SPQAHRICQC LDMEGKRIPN CEPKDYLYPY PSQVCLNINN PADFSGKRSK ACTRQGDSYY
QLTCHCCCSC FAYGTKIGTP NGLKKIEQFA VGDLVLAASL ESNAGGIKLN WSPLKVSFSS
GTGPDSHQPA MIYIRHGETG SIIVTPDHLF LMPEGKLKRA DRLVPGKDQL VSYEGQAVPV
HEVHLGEYEG GVHHIATDNS FTGSLNGHLL LSEGVVSGDF NLQIHASELK ENYFIDDHDS
YPKIGTNEYK EQNTELLEGR YLSFQASGIN EVNPVPQPSK FYVHGQNISY IPQTAAKYLS
SLQEIDVNDN AERRSFSEMS LGNAAVNYAL KLFRGFYPDI IFHHDIARLE PNAFAFNQYG
KDIVVLSGGL TRIKNLQLEG MTMVLSHMVT RLQKINPVDY NGFTSAAMAD YYSPGVLQTV
LFDKLYADVL KKGKKQLEDG IYAYINKQHE AFEEDPYAPT WETRLDAIDA GYAMDFPPEG
IGGPVFQGLQ VQAAKAFPPA LAPNSFITED IDAAASRQVF DRLKENKVLD DQGVLNTKFS
IDTDLSFLFK DKPDNLNRYL TEEIRFILRH VPARIRLTFN LPLSAEKASA VGSYELEPES
NIMHAGVDEK DPAVLWLTAR LQKETEYTLT VSKYLKSKDG STVDPQNNSI QLKLV