Gene Dtox_2651 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_2651 
Symbol 
ID8429637 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp2798494 
End bp2799834 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content40% 
IMG OID645034929 
Producthypothetical protein 
Protein accessionYP_003192056 
Protein GI258515834 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.310909 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAATG TGTTGAATTA TAAAAAAAAT GCATTTTGGG TAAGCGTTAT GGCAATTGTA 
ATTGTCGCTG TGGTCAGCCT TTCTCTTATC GTTAGTAAGC AAACGGAAGG GATGCCTGCG
AATCCCTCGA ATCAAGGTAA TGTGTCATCA TTCGCTCCTT TAAGGACATC GGAAGCGGCG
TATACCACGG AATATGATAG TGTAAAAATT TCATATCTCT CCGAAAACAA AGGATTTAAT
TCGTCAAATA CCTTTGAAGC AACTGATTCC CAAGCAGTGG CATACATTGA CTCAACGATA
AGAACAAGCA TGCCCTCCAA GCAGAAAGAT GATCTGGAAA ACAATCGTAC AGATCAGTAT
ACGATTAAAT TGTCAAATGA GATCGGCGGA TATAGCTGTG GGCTTTACTA TGACACACTC
TACGATAAAG CCTACATCAT AAAGGACGGT GGTCTTTTCG ATATAGGAAC AGATTTTGCA
CGCTATATTG ATTCACTTTT TGAATACACA AATATCACCT TCAGTGTAGA TAAATCTGAT
GAGGCGCTGT TTAAGGAATA TGGATGGACG CTTGATTATC AGATCAGCGA GATGAAAAAC
AAGCTAAACA ACATCAGCAC TTTGTCTGCT TTTAATCCAA ATACATATTA TTTCGCGTAT
AACAACGAGC TTTCAAAGGA TATCGGTCTT GACATGAGCA CATACTCTAA CGCCGCCGAT
CTTGACGTTA AAATTTATAG AATTCATGAA AGCATGCCGC AAGAGTTTTA TCCAATACAA
AATTGCAGAG GTATTGTCGT AAAAAGCGGC GGTAAAATCA TCGGCGCATT CATCAGTGCC
GGACGGCACA ATACGTTTAA TGCCTGCAGC TTGAAAGGGG GCAGTTTTGA CGAAGTGACC
GGCAAGACGT TTAACGATTG GCTTGCGGAC AAAGTTAAAG CGGACACTAA CGAAGAAAGG
CTGTCCCGGC TGGAGCCGGA GCAGATTATT GAAGAATATT TCACAGCTCT GAATAAAAAA
GACGCTGAGA CTGCGGAATG CTGTATTTCG AAAAAAACTC TGCTGGGTGA GTTGACAACA
AATATGCGAA ACGAGAAGTT ACATAACGAT GCGATCGACT TGCTTTTAAC TGATCAAAAT
TTTAATAACC TTAAGTCCGC AAAGTTGTTA AAAATTGAAT TGCTTGATGA GCCTAATAAG
AACACGAAAA AATTCAGGGT AACGGTGGAC CTTCGATACA ACAAGGAGGT GAGCGTCAGC
AACGGAAAGC AGTACTGGGA TTGCAGTATG GTTTATGAAT CTTTTCAGAC GGGATGGAAA
ATAGAAGGAT TTGGGCATTA G
 
Protein sequence
MKNVLNYKKN AFWVSVMAIV IVAVVSLSLI VSKQTEGMPA NPSNQGNVSS FAPLRTSEAA 
YTTEYDSVKI SYLSENKGFN SSNTFEATDS QAVAYIDSTI RTSMPSKQKD DLENNRTDQY
TIKLSNEIGG YSCGLYYDTL YDKAYIIKDG GLFDIGTDFA RYIDSLFEYT NITFSVDKSD
EALFKEYGWT LDYQISEMKN KLNNISTLSA FNPNTYYFAY NNELSKDIGL DMSTYSNAAD
LDVKIYRIHE SMPQEFYPIQ NCRGIVVKSG GKIIGAFISA GRHNTFNACS LKGGSFDEVT
GKTFNDWLAD KVKADTNEER LSRLEPEQII EEYFTALNKK DAETAECCIS KKTLLGELTT
NMRNEKLHND AIDLLLTDQN FNNLKSAKLL KIELLDEPNK NTKKFRVTVD LRYNKEVSVS
NGKQYWDCSM VYESFQTGWK IEGFGH