Gene Dtox_3944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3944 
Symbol 
ID8430959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp4119730 
End bp4121721 
Gene Length1992 bp 
Protein Length663 aa 
Translation table11 
GC content49% 
IMG OID645036162 
Productglycogen debranching enzyme 
Protein accessionYP_003193260 
Protein GI258517038 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3408] Glycogen debranching enzyme 
TIGRFAM ID[TIGR01561] glycogen debranching enzyme, archaeal type, putative 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00146651 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.690389 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATTTCG GAAAAGGCAG TTGGAAAAGT TTTGAGCAGG GTGCGGAGAA GGAATGGCTG 
GTTGCCAACG GTCTGGGCGG CTACGCGGCC GGTACTGTCA TAGGGGCTAA CGCCAGCAAA
TACCATGGCC TGCTGGTGGC CGCCTTAAAT CCTCCGGTTG AGCGGGTTGT TTTGCTGGCC
AAGTTGGATG AACAAGTGGA GGCGGGGGGC ATTACTTACA ACCTTGCTTC AAACCAGACC
AACGGCGAAG TAACCCACTT TGGCTATGTT CACCTGCAGC GGGTGATCAT TGATCCATTG
CCCTGTTTTA TTTACAGTTT TTCTGATATA ACTATGGAAA AACATGTATT TATGCTGCGT
GATCAAAATA CTACGGTGAT TTTATACCGT TTTTACAATG GGGCCCTGCC CGCTACATTA
CGCTTTACAC CTCTGGTAAA CTGCCGTGAT TTTCATCACA ATACTTACTA TGGGCAGTTG
AATTTCAACA GCCAGCCGCT AAAGCGCGGT GTGGCGGTCA GTGCTGTTTC CGGAGTACCT
CCGCTGAAAA TTATCAGCAG CGCGGGAAGA TTCAATAAAC GGGATGACTG GTATAAAGGC
ATGTATTATG CCAGAGAGCG GGAGCGTGGG CTCAATCCTT ACGAAGATCA CTTTATGCCC
GGATATTTTG AGGTAAAAAT TGATTCCGGG GAAGTCAAAA CCATAACTGT GGTGGCCACC
ATAGAAGAAA AATTTCCTGC GGATCAAATT GCCGCCAACG GTGAGGGGTA TCTTTTGCAG
GAAAAGGTTC GATTAGAACA ATTACAGGCT CAGGCCGGAT ACAGTGAACC TTTGGCCAGG
CAGCTGGTCT GGGCGGCAGA CAGTTTTATT GTACACCGCC GCTCGACCGG CACTAAGAGT
GTTATTGCCG GTTATCCCTG GTTCAATGAT TGGGGCAGGG ATACTATGAT TGCCTTACCG
GGGCTGACTT TGATCACTCG CCGGTTCGAG GACGCCAGAG AAATATTGAG TACCTTTGCG
CGCTATTGTA AGGATGGCCT TTTGCCCAAC ATGTTTGCCG ACGGTGACAG GGAACCTCTT
TACAATACAG TAGACGCCTC TCTCTGGTAT TTTCAGGCTG TTTATAAGTT CCTTGAATAT
ACAGGCGACT TTGATTTTAT CAGGTCGGAA ATTTTTCCCG TATTGAGGGA TATCATTTAC
TGCCATGTGC GAGGAACACA TTTCAACATT AAGGCTGATG AGGATGGTTT GCTGCAGGCA
GGCTCCCCTT CCCTGCAGCT CACCTGGATG GATGCCAAGG TAAATGACTG GGTAGTTACT
CCCAGACACG GCAAACCGGT GGAAATTAAT GCTCTCTGGT ATAACGCGCT ATGCGTATAT
GAAAAGTTAT GCCGGCACTA CGGTGAAATA TTTCCCTACG GCGACCTTCC CGGTAAAGTA
GAAAACAGTT TTATCAAGCA GTTCTGGTAT TCTGATGCCG GTATACTCTA TGATGTAATC
GGGCCTGACG GTAAAAAGGA CGCCAAATTG CGGCCTAATC AAATTATAGC GGTAAGCCTG
CCTCATTCGA TGCTGTCTAA AAATAAAAGT ATGATTATAT TAAGAAGGGT CTGGCAGGAG
TTATACGCTA CTTACGGTTT GCGCAGTCTT TCCTTGAGAG ACCCGGAATA TAAGGGCGTT
TATACAGGCG ATCAATTGTG CAGGGACGGC GCCTATCACC AGGGGACGGT TTGGGGCTGG
TTAATCGGAC CGTTTATCAG CGCCTACAGG AGGATCAATG CCTATTCGCC TGCCAGCCGC
GAGCAGGCGG AGAGGTTTAT AGCTCCTTTT ATTGATCACC TGCGGGATCA CGGGGTTGGC
TTCATCTCGG AAATTTTTGA CGGCAACGAG CCGGTCAAAC CCAGAGGTAC TTTCGCCCAG
GCCTGGAGTG TGGCCGAAGT CCTCAGGGCT TATGTCGAGG ATGTGTTGGA GATTAAACCG
GGTCGTGGCT AG
 
Protein sequence
MYFGKGSWKS FEQGAEKEWL VANGLGGYAA GTVIGANASK YHGLLVAALN PPVERVVLLA 
KLDEQVEAGG ITYNLASNQT NGEVTHFGYV HLQRVIIDPL PCFIYSFSDI TMEKHVFMLR
DQNTTVILYR FYNGALPATL RFTPLVNCRD FHHNTYYGQL NFNSQPLKRG VAVSAVSGVP
PLKIISSAGR FNKRDDWYKG MYYARERERG LNPYEDHFMP GYFEVKIDSG EVKTITVVAT
IEEKFPADQI AANGEGYLLQ EKVRLEQLQA QAGYSEPLAR QLVWAADSFI VHRRSTGTKS
VIAGYPWFND WGRDTMIALP GLTLITRRFE DAREILSTFA RYCKDGLLPN MFADGDREPL
YNTVDASLWY FQAVYKFLEY TGDFDFIRSE IFPVLRDIIY CHVRGTHFNI KADEDGLLQA
GSPSLQLTWM DAKVNDWVVT PRHGKPVEIN ALWYNALCVY EKLCRHYGEI FPYGDLPGKV
ENSFIKQFWY SDAGILYDVI GPDGKKDAKL RPNQIIAVSL PHSMLSKNKS MIILRRVWQE
LYATYGLRSL SLRDPEYKGV YTGDQLCRDG AYHQGTVWGW LIGPFISAYR RINAYSPASR
EQAERFIAPF IDHLRDHGVG FISEIFDGNE PVKPRGTFAQ AWSVAEVLRA YVEDVLEIKP
GRG