Gene Dtox_1987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_1987 
Symbol 
ID8428969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp2144021 
End bp2145175 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content46% 
IMG OID645034314 
Producthypothetical protein 
Protein accessionYP_003191445 
Protein GI258515223 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0946014 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAC TAAGAGGATT AAAAATAATA AACATAATAT TAGTCCTGCT CATGATGCTC 
AGCATAGTTA ATCCCGCTCT GGCGACAGGA TTAACTGTCA ATCCCGATCA GCCGGCGTAT
AGTGGAGGAG AAAAAGCCGC AGTTGCAGCA TCTGTTTATG ACGACGCTAC GAATGCAGGT
GGCGGCACAG ACGGGAATTC CGACGCTAAA AGCGAGGAAA ACCATGAAAT TGCGGAGGAT
AATGACTCCG CCGGTGTCAC CGAGCCCCAA AACAGCTCTG ACCAAACTGT GAGCAGCTTG
GTTTATCCTG CCAATGACAC CGAAAAGCAG GCTGCAAGCC TGGCTGTTGA ACTCAATGAT
GTTAATCCGG ACCTGGGAAC AGTTAAAGTT CGGGTAGTGG ATAACGTACA AAGACTGGCA
GGAGATTTGG CAAACATTTC TTCCGATTAC CGGGAACCCT TTGGTGAAAT ATTGCCGTTG
ACAGAAGTCG AGATTACCGA GGGGCTTACT ATGCGAGGGG CCTTGGAAAA GGCTCTGGCT
ACAAAAAGTA TAACGGTTTA TGGCGCGGTA GATTATGTAA GTGGGATTGG TCCTGTTACA
TCTGCCGACG GAAGCAGAAA AGTTGCCAAG CTTTCCGAGT TTGACAGTGG CAGCCAGAGT
GGCTGGATGG TCACACTAAA TGATTGGTTT ATCAATGCGG GGGCCAATAC CTTTACAGTT
AAGGACGGCG ATGTGGTAGA ATTCTGCTAC ACATGCAATC TGGGGGCTGA CCTTGGCTCC
GGCTTTAACA ACCCGGATAC ATCCTTAAAA GCCCTTTCCG TTAATAAAGG AGTCCTTAAC
CCGGTATTTG CGCCCGGAAC CAAAGAATAT ATATTAACTC TGCCTGCAGC AACACAAATA
ATGGTTACTC CTACCGCTGC AAACAAAAAC AATAAGGTAA CAATTCAGTC GGGAGACGTA
ACCTATCGCA GTACAGATGA AATTGCCGTT GCAGATGAAC AAGTAATCAC TGTTAAATGC
GGTCAGAACA CCTATAAAAT AACCGTAGCT GTCTCAAATA ATGACCAGAG CAGTGCCGAC
GCGGTCAATG ATTTAATTGA GGCACGTTTA CCCAGTATGA TATTGCCACA AGCAACGGCA
TTCAAATCGA ATTAA
 
Protein sequence
MKKLRGLKII NIILVLLMML SIVNPALATG LTVNPDQPAY SGGEKAAVAA SVYDDATNAG 
GGTDGNSDAK SEENHEIAED NDSAGVTEPQ NSSDQTVSSL VYPANDTEKQ AASLAVELND
VNPDLGTVKV RVVDNVQRLA GDLANISSDY REPFGEILPL TEVEITEGLT MRGALEKALA
TKSITVYGAV DYVSGIGPVT SADGSRKVAK LSEFDSGSQS GWMVTLNDWF INAGANTFTV
KDGDVVEFCY TCNLGADLGS GFNNPDTSLK ALSVNKGVLN PVFAPGTKEY ILTLPAATQI
MVTPTAANKN NKVTIQSGDV TYRSTDEIAV ADEQVITVKC GQNTYKITVA VSNNDQSSAD
AVNDLIEARL PSMILPQATA FKSN