Gene Dtox_3956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3956 
Symbol 
ID8430971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp4137514 
End bp4140405 
Gene Length2892 bp 
Protein Length963 aa 
Translation table11 
GC content51% 
IMG OID645036174 
Productexcinuclease ABC, A subunit 
Protein accessionYP_003193272 
Protein GI258517050 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000293672 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00020375 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTGGACA GAATCGTGGT TAAAGGAGCA AGGGTGCATA ATCTGAAAAA TATTGATGTG 
GAAATACCCA GGGATAAGCT GGTAGTAATT ACGGGTTTGT CCGGTTCCGG CAAGTCCTCT
CTGGCTTTTG ACACTATTTA TGCCGAGGGT CAGCGCCGCT ACGTGGAGTC GCTCTCGGCT
TATGCGCGCC AGTTTTTAGG ACAGATGAAC AAGCCGGATG TGGATTATAT AGAAGGGCTG
TCTCCGGCCA TTTCTATCGA TCAGAAAACA ACTTCCCATA ACCCTCGCTC CACTGTGGGG
ACAGTTACCG AAATCTATGA TTACCTGCGC CTGCTTTTTG CCAGGGTGGG GAGACCTCAC
TGCCATAAGT GCGGCAAGCC TATTACCAGG CAGACGGTGC AGCAGATAGT TGACCGGTTG
ATGCTGCTGC CGGAAAGCAC TAGGCTGCAG ATACTGGCCC CGGTGATCAG GGGTAAAAAA
GGTGAGCATG TAAAAGTACT GGAAGACATA AGGCGCGGCG GTTTTGTCCG GGTGAGAGTA
GACGGCGAAA CCAGGGAACT GGGTGAAGAA ATCAAGCTGG AGAAAAATAA GAAGCATACC
ATTGAAGTGG TAGTGGACAG GGTGATTATC AGGGCCGGTT CGGAGAAACG ACTGGCTGAT
TCACTGGAAA CGGCTCTGCA GCAGAGCGGC GGCATTGTGC TGGCCAGTAT CACGGACGGG
GAAGAGTTGA TTTTCAGTGA AAACTTTGCC TGTGTGGACT GCGGCATCAG CGTGCAGGAG
ATAGCTCCCA GATCCTTTTC CTTTAACAAC CCTTACGGCG CTTGCCCGGA ATGTACCGGT
CTCGGCACTA AGCTGGAAAT TGATCCCAAC TTAATTATCC CGGACATGAA TCTTTCTATA
GCTGAAGGGG CCATCGAAGG TTGGCATAAA GGAAATATCT CCGCTTCCTA TTTCAGCGGT
CTGGCCGAAC ACTATGGCTT TAGCCTGGAT ACACCTGTAA AAGAACTGAA GCCTGATCAC
CTGCAGGTAC TGCTCTATGG CACCGGTGAG CAAAAAGTGC GCATTATTTA TACTGATGTG
TACGGGCGGC GGCATGATTA CAAGATGCCT TTTGAAGGTA TTATTAATAA CATTGCCAGG
CGCTACAGGG AGACAGCCTC CGAGCATATG AGAAATGAAT TTGAACAGTA TATGAGTTCG
GTGATTTGTC CGGTCTGCGG CGGGGCCAGG CTGAAGCCTG AGGTGCTGGC GGTAAAAATA
GGCGGCTTGT CCATACATGA AGTAACCTGT TTAACGGTTA CCGACACATT GCATTTCTTT
GAAAAGTTGG ATTTGACTGA GCGTGAGCGG GTGATAGCCA GGCAGATATT AAAGGAAATT
AATGAGCGGT TAGGTTTTTT GATTAATGTG GGTTTAAACT ACCTGACTCT GAACCGGACA
GCCGGTACTC TTTCCGGGGG CGAGGCGCAG AGGATCCGCC TGGCTACTCA AATTGGAGCA
GGCTTGATGG GGGTTTTGTA TATACTGGAC GAGCCCAGCA TCGGTTTACA CCAGCGGGAT
AACGAGAGAT TGTTAAATAC CCTGCGCCGC TTGAGGGATA TAGGCAATAC TTTAATTGTG
GTGGAGCATG ATGAGGATAC GGTGCGCACG GCTGATTATA TTATTGATAT CGGGCCGGGA
GCCGGTGTGC ACGGCGGGCA GTTGGTGGCT GCCGGGACTT TGCGGGAAAT TCTGGACAAT
GAAAATTCTC TAACAGGCCA GTATTTAAGC GGCAGAAAGT ATATTCCGGT ACCGGACAGC
CGCCGGGAGC CTAACGGCAA GTATGTGGAA GTTAAAGGGG CGGAAGAAAA TAATCTTAAA
AATATTGATG TGCGCTTTCC CCTGGGGGTA TTCACCTGTG TTTCCGGTGT TTCCGGTTCC
GGTAAAAGTA CTCTGGTTAA CGAAATTTTA TATAAAACCT TAAGCCAGGA ACTGCACGGG
GCCAGGAGCA AGCCGGGTTG CTGCCGGGAA GTGGGGGGCC TGGAATATCT GGACAAGGTG
ATAGATGTGA ACCAGTCTCC TATCGGGCGT ACTCCCCGAT CCAACCCGGC CACTTATACC
GGGGTGTTTA CCTATATCCG GGAATTATTT GCCCAGACGC CGGAAGCCCG TATGAGGGGC
TATAAGCCCG GGCGCTTCAG CTTTAACGTT AAGGGCGGGC GCTGTGAGGC CTGTCAGGGA
GACGGCATTA TAAAAATAGA AATGCATTTT TTGCCGGACG TTTATGTTCC GTGCGAGGTT
TGCAAAGGAC GCCGCTACAG CAGAGAAACC CTGGAAGTAA CCTATAAAGG CAAGAGCATT
GCCGATGTGC TGGATATGAC GGTTGAGCAG GCTGTGGAAT TCTTCCGCCA CATACCGAAG
ATTCACCGTA AAATGGAGAC TATGCAGGAT GTCGGTTTGG GTTATATTCG TCTGGGTCAG
CCGGCGCCGG AACTTTCCGG CGGTGAAGCG CAGCGGGTAA AGCTGGCTGC CGAGTTGTCC
CGCCGCTCCA ACGGCAAAAC CTTTTACATT TTGGACGAGC CGACTACCGG TTTGCACACT
GATGATATAG CCAGGTTGTT AAAGGTACTG CACCGCCTGG TGGAAGCGGG GGATACTGTG
GTGGTCATTG AGCATAATCT GGATGTGATT AAAACAGCGG ATTATATAAT TGACTTAGGA
CCGGAAGGCG GGGACAAGGG CGGCAGCGTG GTAATTGCCG GGACGCCGGA GGAAGTGGCG
GCTGAGACAC AGTCTCACAC GGGCAGGTTT TTAAAGAAGG TTTTGCCGGC CGGAGTGGAA
GCTGCAGCCG GCGGCAGGGA AATGGGCGAT GCGGAGGAGA ATGCGGCTGC CGGTGAGGCG
CAGGCTATAT AG
 
Protein sequence
MLDRIVVKGA RVHNLKNIDV EIPRDKLVVI TGLSGSGKSS LAFDTIYAEG QRRYVESLSA 
YARQFLGQMN KPDVDYIEGL SPAISIDQKT TSHNPRSTVG TVTEIYDYLR LLFARVGRPH
CHKCGKPITR QTVQQIVDRL MLLPESTRLQ ILAPVIRGKK GEHVKVLEDI RRGGFVRVRV
DGETRELGEE IKLEKNKKHT IEVVVDRVII RAGSEKRLAD SLETALQQSG GIVLASITDG
EELIFSENFA CVDCGISVQE IAPRSFSFNN PYGACPECTG LGTKLEIDPN LIIPDMNLSI
AEGAIEGWHK GNISASYFSG LAEHYGFSLD TPVKELKPDH LQVLLYGTGE QKVRIIYTDV
YGRRHDYKMP FEGIINNIAR RYRETASEHM RNEFEQYMSS VICPVCGGAR LKPEVLAVKI
GGLSIHEVTC LTVTDTLHFF EKLDLTERER VIARQILKEI NERLGFLINV GLNYLTLNRT
AGTLSGGEAQ RIRLATQIGA GLMGVLYILD EPSIGLHQRD NERLLNTLRR LRDIGNTLIV
VEHDEDTVRT ADYIIDIGPG AGVHGGQLVA AGTLREILDN ENSLTGQYLS GRKYIPVPDS
RREPNGKYVE VKGAEENNLK NIDVRFPLGV FTCVSGVSGS GKSTLVNEIL YKTLSQELHG
ARSKPGCCRE VGGLEYLDKV IDVNQSPIGR TPRSNPATYT GVFTYIRELF AQTPEARMRG
YKPGRFSFNV KGGRCEACQG DGIIKIEMHF LPDVYVPCEV CKGRRYSRET LEVTYKGKSI
ADVLDMTVEQ AVEFFRHIPK IHRKMETMQD VGLGYIRLGQ PAPELSGGEA QRVKLAAELS
RRSNGKTFYI LDEPTTGLHT DDIARLLKVL HRLVEAGDTV VVIEHNLDVI KTADYIIDLG
PEGGDKGGSV VIAGTPEEVA AETQSHTGRF LKKVLPAGVE AAAGGREMGD AEENAAAGEA
QAI