Gene Dtox_3544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3544 
Symbol 
ID8430539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp3738192 
End bp3739532 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content38% 
IMG OID645035764 
Productprotein of unknown function DUF21 
Protein accessionYP_003192882 
Protein GI258516660 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGATG ATACGTCGTT TAGTATTATT GTTGCAGTAC AAATTATATT CGCGTTGTTT 
TTGGTTTTTT TAAATGGTAT CTTTGTGGCT GCGGAATTCT CCTTTGTTAA AGTAAGACCT
ACACGTCTAG CGCAACTGGC AGATGAAGGA AACCGGAAAG CAAATATTGC TCAAACTATC
ATAAGCAATA TAGACGCATA TCTTTCAGTG TGTCAACTTG GTATAACCCT GGCAAGCCTT
GGCTTGGGTT GGCTCGGGGA ACCAGTGGTA GCTAAAATTA TAGAACCTGT TTTGGGTTAC
TTAGGGGTAT TTTCGTCAGA TGTGCTGCAT TACATTTCTT TTGTAATTGC ATTTAGTTTA
GTTACGTTAA TGCATGTAGT GTTTGGTGAA CTGGTACCAA AATCTTTGGC AATCCAGAGA
GCAGAAAAAA TAGCCCTGTA TCTGGCAACA CCTATGCGTA TATTTTATTA TCTTTTTTAT
CCGGGGATAA TCGTTTTTAA CGGTACAGCC AATTCAATAT TGCATATCAT TGGTATTCAA
CGTACCAGTG AACATGAGGC AAGTCATAGT GAGAAAGAAC TGCAAATGCT TGTTTCCGAA
AGTTACAAAT CCGGACATTT AGATAAAGAT GAGTGGAGAT TACTTCAAAA TGTTTTTGAA
TTTGAGAAAA GAATTGCCAG AGAAATATTG GTTCCACGTC CGGAAGTAGT TTTTTTAGAT
AGAAGAAAAA CCCTACAGCA AAATATTGAG ATAGCACGGC AATCGGAACA TACTCGTTTT
CCTCTTTGTG ACGGAGATAA TGACAATGTG GTTGGTCTTA TACACATCAA AGACCTTTTT
AAGCTAAAAG ATGAAACTAG CATTAATGAT GTTAAACGCA ATATTATGAT GGTACCAGAG
GGAATTCCAT TAGACAGATT ACTCAAGCAA TTCCAACAAT GCCGCCAGCA GATGGCTTTG
GTAGTAGATG AATACGGCGG TACAAGTGGT ATAGTTACTA TGGAAGATGT TTTGGAAAAA
TTGGTAGGTG AGATTCATGA TGAGTTTGAC AATGAGATTC CAAAAATAAT CCCAGAAAAA
GAAGGGACTT TTCTTGTAGA GGGTCGATTG CTTCTGGAAG AAGCTAAAGA AATGTTTCAC
CTACCTGTAA CTGAGGATAC AGAATATGAT ACTATTGGCG GTTATGTTTT TGGTGAACTC
GGCAAACGCC CCAAGGTTGG AGATATTGTG GAACTACCTA ATCACCGGCT AGAGGTAACC
AGAATTCAAG GACTCCGTAT CCAACAAATT CGTTTAAATA TCCTTGATAA TAAGTTAAAT
AGAGATATTC ATGCAGTATA A
 
Protein sequence
MGDDTSFSII VAVQIIFALF LVFLNGIFVA AEFSFVKVRP TRLAQLADEG NRKANIAQTI 
ISNIDAYLSV CQLGITLASL GLGWLGEPVV AKIIEPVLGY LGVFSSDVLH YISFVIAFSL
VTLMHVVFGE LVPKSLAIQR AEKIALYLAT PMRIFYYLFY PGIIVFNGTA NSILHIIGIQ
RTSEHEASHS EKELQMLVSE SYKSGHLDKD EWRLLQNVFE FEKRIAREIL VPRPEVVFLD
RRKTLQQNIE IARQSEHTRF PLCDGDNDNV VGLIHIKDLF KLKDETSIND VKRNIMMVPE
GIPLDRLLKQ FQQCRQQMAL VVDEYGGTSG IVTMEDVLEK LVGEIHDEFD NEIPKIIPEK
EGTFLVEGRL LLEEAKEMFH LPVTEDTEYD TIGGYVFGEL GKRPKVGDIV ELPNHRLEVT
RIQGLRIQQI RLNILDNKLN RDIHAV