Gene Dtox_1997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_1997 
Symbol 
ID8428979 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp2159658 
End bp2160929 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content37% 
IMG OID645034324 
Productprotein of unknown function DUF445 
Protein accessionYP_003191455 
Protein GI258515233 
COG category[S] Function unknown 
COG ID[COG2733] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.135519 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATATCT TGAGCAAAAA CAATTATCAT AAGGCGAGTA TCACCTTAGC AGTCGTAACT 
GTTGGTTTTT TAATCAGCTT CCCTTTCCGC TCAAGCTTTG CCGGAGGATT ATTGGAAAGC
CTTTTCGGTG CTGCTATGAT TGGTGGGTTA GCGGACTGGT TTGCTGTTTC GGCGCTGTTT
CGCAAGCCTT TAGGTATACC CTGGCGAACG GAAATAATCC CTAAAAACCG TGATAAAATT
TTCAATACTT TAATTGATAT GGCTGAGAAC GAACTGCTGA CCAGGGATAA CATTAAAAAG
AGATTGGCCG GCTACAATAT TTCAGATCTG TTAATAAGAT ATATTACAGA ATATGGGGGA
AATAAAAATA TTAAAGTCAT CCTAAACAGA ATTATAAGGG ATATTTGGGT TAATATTAAT
CCAGAGGATA CCGGTAAATA TTTAGAAGAT TTATTGAAAG AAAATGCCTT AAAAATAAAA
TTGTCCCCCT TGCTGCTGGA GGTTGTGGAA TGGTCCCTGA AAAACGGCTA TGATGAAAAA
GTTATTGATT ACTTTCTTGA CGAGTTAATC AGAATTTGCG GTCACCGCCA AATGAAAATC
ATGATTTTAA ACCTTTATTG CAATTTCAGA AAACTATACG AGCAAGGTTT GTTGAGAAGA
AAAGTAGCTA ACGAAATAAT ATTAAGTTTT ATACTGCGTC TGCCTCCGGA GATTATTGCT
GACTACCTTC AAGCGGAACT GCTTAAATTT CTCAAAGAGA TAAAAGAACC TGCTTCATTT
TGGCGCGATA AAATTAAAGT CAGAATTCAG GCTATACTTT TAAATCTTAA GCAGGATACA
AGAATAATTA ATAAACTGGA AGCCTGGAAA ATAAACCAGA TTGAGCAAAA CTTGCATATA
CAAAATGCTG TGGTTGCTTT TATACGGGCT TTGCGGGAGG AAGCGGCGGC AGCGCAGGAT
AAAACCTGGC AATTATACAG TTGGACAGAC GGATACATAG ATAGATTAAT TAAAAACTTT
AAAAAAGATT ATGTAAAGCA GGACAGGCTC AATGAGGTTG TTAAGTCAGC TCTGAATAAC
TGGATTGATC GCCAGCACAA ACAAATAGGA ATAATTATCA AGGAAAGCTT GAACCGCTTT
TCAGGATATT TACTGGTAGA ATTTATAGAA AACAGGGTGG GTAATGATTT GCAGATGATA
CGCATTAACG GGTCTGTAGT TGGTGGTTTA ACGGGCGTAT TAATATTTTT ATTAACCTTT
TGGATGACAT AG
 
Protein sequence
MNILSKNNYH KASITLAVVT VGFLISFPFR SSFAGGLLES LFGAAMIGGL ADWFAVSALF 
RKPLGIPWRT EIIPKNRDKI FNTLIDMAEN ELLTRDNIKK RLAGYNISDL LIRYITEYGG
NKNIKVILNR IIRDIWVNIN PEDTGKYLED LLKENALKIK LSPLLLEVVE WSLKNGYDEK
VIDYFLDELI RICGHRQMKI MILNLYCNFR KLYEQGLLRR KVANEIILSF ILRLPPEIIA
DYLQAELLKF LKEIKEPASF WRDKIKVRIQ AILLNLKQDT RIINKLEAWK INQIEQNLHI
QNAVVAFIRA LREEAAAAQD KTWQLYSWTD GYIDRLIKNF KKDYVKQDRL NEVVKSALNN
WIDRQHKQIG IIIKESLNRF SGYLLVEFIE NRVGNDLQMI RINGSVVGGL TGVLIFLLTF
WMT