Gene Dtox_0992 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_0992 
Symbol 
ID8427931 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp1016870 
End bp1018150 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content42% 
IMG OID645033330 
Productglycoside hydrolase family 18 
Protein accessionYP_003190504 
Protein GI258514282 
COG category[R] General function prediction only 
COG ID[COG3858] Predicted glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00884991 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000357477 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGGAAAC TGGGAATCAA GTCAGTTACC TGTCTGGTTG TTTTTTTTCT TTTTGCCGTA 
TGGTTTCCCA CAGTCGCACT GGCGGGAACA AGTGTTACAC CGGGTAATTT AAGACTCGGT
GACAGAGGGC CTGATGTTAC TTTGCTGCAG ACAAAACTTA AAGTTGCGGG CTTTTATCAA
GGAGAAAAGG TTTCCGGTTA TTTTGGCTTG AATACACTTT TCGCAGTCTC AAAATTTGAG
AAAGCTAACC GGCTGCGTGT TGACGGTATA GTTGATGCTG AGGAATGGAT TGCACTGCAA
AAACTTACCG CTATTCCGGC AGACAAGCTA AAGAAGATGG TATTAGGTTA TTATACAGTG
GATTATACAG GAGATAAGTT ATCCTATAAT TCCCTGGATA AGTACAGCAG TTATATAGAT
ACTGTAGCTA CTTTCAGTTT CAAAGTTAAC CGCGATGGCA GTTTAACCGG TGAAGTGCCG
CAGGATGCTT TAAAACTGGC CAAGGAGAGA AGCGTGGAAA CATTGCTGTT GGTTCATAAT
ATTGGCCAGC CAATTGACAG TGATGCTGCT CACTACGCTC TGTCAGTTGC CGAAAACCGC
AGCAGGCTGG AAGCAAACAT CATGTCAAAA GTGAAGGCCA ATGGTTATAA CGGTGTTAAT
ATTGACATTG AAGCTTTACC GCCGGGAGAC AGGCAGTATT ATAATATTTT CCTAAAGGAA
TTAGGCGACC AATTGCACAA GGAAAATTTG CTGCTTACCG TTTCTATCCC GGCTAAAACT
TTTGACTCTA CCAATGATAG CTGGTCCGGT GCTTATAGTT ATAAGGATAT TGGCCAACTG
GTTGATCAGG CTATGATTAT GACCTATGAT GAGCACTGGT TTGGCGGTTC TCCCGGTCCG
ATTGCCTCGG TGCCCTGGAT TAACAAGGTT ATGGACTATG CAGTCGAGGT GATGCCCAGA
GAAAAAATTT TTCTTGGCGT GGCTGCTTAT GGTTATGATT GGTCCAGTCA GGGAACCAGA
GCAGTGCGCT GGAATCAAGT CAATGATTTA GTTAAGAATT CCGGTAATGT TATATGGGAC
AATACAAACA GTGTACCCTG TGTCATTTAT TATAAAAATG GTGTCAGGCA TGAGTTGTGG
TTTGAAAATA ACTACAGTTT GCGTTTTAAA TTGGAAACGG TTAAGAGTTA TAACGTTTCA
GGTATAGCCA TCTGGCGTCT TGGTTTTGAG GATGACTCCT TTTGGAAAAT GGTTAATGAT
GAATTTAGAC AGGCTGACTA A
 
Protein sequence
MGKLGIKSVT CLVVFFLFAV WFPTVALAGT SVTPGNLRLG DRGPDVTLLQ TKLKVAGFYQ 
GEKVSGYFGL NTLFAVSKFE KANRLRVDGI VDAEEWIALQ KLTAIPADKL KKMVLGYYTV
DYTGDKLSYN SLDKYSSYID TVATFSFKVN RDGSLTGEVP QDALKLAKER SVETLLLVHN
IGQPIDSDAA HYALSVAENR SRLEANIMSK VKANGYNGVN IDIEALPPGD RQYYNIFLKE
LGDQLHKENL LLTVSIPAKT FDSTNDSWSG AYSYKDIGQL VDQAMIMTYD EHWFGGSPGP
IASVPWINKV MDYAVEVMPR EKIFLGVAAY GYDWSSQGTR AVRWNQVNDL VKNSGNVIWD
NTNSVPCVIY YKNGVRHELW FENNYSLRFK LETVKSYNVS GIAIWRLGFE DDSFWKMVND
EFRQAD