Gene Dtox_3584 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3584 
Symbol 
ID8430590 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp3787484 
End bp3788755 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content48% 
IMG OID645035812 
Productprotein of unknown function DUF39 
Protein accessionYP_003192919 
Protein GI258516697 
COG category[S] Function unknown 
COG ID[COG1900] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00707173 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAATG TGGCCTCTTC TGTATGGAAG AGGCTTTACT TTATGTTTCG GATAGAGTAC 
ATCAGGGAGG GTAAAAACTT GAGCATCAAA AGAACTTACA CTGAGATCAA CAGCAAAATA
AAGTCAGGCC AGGCAGTTGT CGTAACAGCC GAAGAACTGA TAGGCATGGT TAAGGACAAG
GGTATTGCCG CAACCGCCGA ACAGGTTGAT GTCGTTACTA CCGGAACTTT CGGGCCCATG
TGCTCTTCCG GCGCTTTTAT AAATCTCGGA CACAGCAAGC CCCGCATGAA GATACAAAAA
GCCTGGCTAA ATAAAGTGCC TGCCTATGGC GGTATAGCCG CGGCCGATGT TTTTTTGGGA
GCTACAGAGT TACCTGAGGA TGACCCGTTA AACAACAGTT ACCCGGGTGA ATTCCGCTAC
GGCGGGGGCC ATGTAATTGA AGACATGGTG GCAGGCAAAA CCATCAAATA TAAGGCTGTG
GCTTACGGCA CGGACTGCTA CCCGCTAAAA GAAATTGAAA CGGAAATAAC CTTGCAGGAG
ATTAATGAAG CTATACTAAT GAACCCCCGC AATGCTTATC AAAACTACAA CTGTGCGGTA
AACCTTTCCG ACCGGACTGT TTATACTTAT ATGGGCATGC TTAAACCGGG ACTGGGCAAC
GCTCACTACT CCACGGCCGG ACAGCTCAGC CCCCTTTTAA AAGATCCCGA TTTCCGAACT
ATCGGCATAG GTACCAGGAT TTTTCTGGGC GGCGGGATAG GTTATGTTAT ATGGAACGGA
ACTCAGCACT TTCCGGAACA CATTACTGAT AAAGACGGCA AATACCTGGG TTCTGCCGGT
GGCACCCTGT CAGTATTGGG CGACTTAAAA CAAATGAGCC CTAAATGGCT GAAAGGCTCC
AGTTTCCTTG GCTACGGTGC CAGCTTAACC GTAGGCATAG GCATACCGAT ACCGGTACTC
GACGAAGATA TCCTGCGCTT TGCCGCCGCT TCAGATGAGG AACTGTACGC ACCGGTTGTT
GATTATGGGG ATGCCTATCC CAACCGCAAA CCAGGCAATC TTGGCTATGT CAGCTATGCG
GAGCTAAAAT CCGGTAAAAT ATCCTTAAAC GGCAAAGAAA TACCGACAGC GTCTATGTCC
AGCTATAAAA AAGCCAGAGA AATTGCTTCT ATCCTAAAAG GATGGATCCT AAAGGGAGAC
TTTCTCCTTT CTCAACCTGC ACAACTTCTG CCAAATACAA CTTCTGGCAT TGTGGGTAAG
GCCTTAAAAT AA
 
Protein sequence
MENVASSVWK RLYFMFRIEY IREGKNLSIK RTYTEINSKI KSGQAVVVTA EELIGMVKDK 
GIAATAEQVD VVTTGTFGPM CSSGAFINLG HSKPRMKIQK AWLNKVPAYG GIAAADVFLG
ATELPEDDPL NNSYPGEFRY GGGHVIEDMV AGKTIKYKAV AYGTDCYPLK EIETEITLQE
INEAILMNPR NAYQNYNCAV NLSDRTVYTY MGMLKPGLGN AHYSTAGQLS PLLKDPDFRT
IGIGTRIFLG GGIGYVIWNG TQHFPEHITD KDGKYLGSAG GTLSVLGDLK QMSPKWLKGS
SFLGYGASLT VGIGIPIPVL DEDILRFAAA SDEELYAPVV DYGDAYPNRK PGNLGYVSYA
ELKSGKISLN GKEIPTASMS SYKKAREIAS ILKGWILKGD FLLSQPAQLL PNTTSGIVGK
ALK