Gene Dtox_0998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_0998 
Symbol 
ID8427937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp1020501 
End bp1021571 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content41% 
IMG OID645033334 
Producthypothetical protein 
Protein accessionYP_003190508 
Protein GI258514286 
COG category[R] General function prediction only 
COG ID[COG3943] Virulence protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000328283 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAGTGATA AGAAGAAGAA AAAAAGTGTA CAGCTTCGCA ACAGCACAGC CGAGTTTTTA 
ATATTTTCAT ATCAGGTCGG CGGTGATGGT GTTGAAGTCC GTGTTCAGAA CGGAACAATA
TGGCTGAGCC AGAAGCAGAT GGGATTGTTA TTTGACACTT CATCTGGTAA CATTGGCCTT
CATCTAAAAA ATATTTTTAA AGAAGAAGAA TTGAAACAAG ATTCAGTTAC CGAGGAATTC
TCGGTAACTG CCGAAGACGG CAAAAATTAT CGTGTCAAGC ATTATAACCT TGATGCCATC
ATTGCTGTAG GTTATCGTGT AAATTCAAAA CGAGCGACAG CCTTTCGACA ATGGGCAACA
GGAGTTTTGC GTGATTATGC TCTGCATGGC TACTTGCTCG ATAGAAAGCG GATGGAAAAC
GGCGCTTTTC TTGATGAGGA TTATTTCGAA CGTCTGCTTG AAGAGATTCG GGAAATACGA
CTCTCAGAAC GACGCTTTTA TCAAAAAATC ACCGACATCT ATTCAACTGC GATGGATTAT
GATAAAGATT CGCCCATAAC AAAAGAGTTC TTTGCAAAGG TTCAAAATAA AATGCATTTT
GCCGTTCACG GGAGTACTGC TGCCGAATTG ATTGTTGAAC GTGCCGACGC TAAAAAAGAT
TATATGGGAT TAACCAGCTG GGCGAATAGT CCTGACGGGA AGATTCTCAA AAGCGATGTT
ACTATCGCCA AGAATTATCT GACGGCTGAA GAACTTGCTG ATTTAGGCGC TATTGTGAAT
GCTTACTTGG ACTTGGCTGA AAGGCGCGCC AAACGCAGAA TCCCAATGAC TATGGAAGAC
TGGGCTAATA GACTCGATAT CTTCCTGCAG GCTGATGACA GGGAGCTTTT AACAAACGCA
GGAAAAATAT CGGCACAAAT TGCAAAGGAT CATGCAGAAA GCGAATATGA AAAGTATAGA
GTCATCCAGG ACAAGCTGTT TGAGAACGAT TTTGACAAGC AAATGAAAAT TCTTGAACAG
GAGATTGCAA AATCAGAAAA AGATAAAGAT TCTAATGGCG ACAAAAACTG A
 
Protein sequence
MSDKKKKKSV QLRNSTAEFL IFSYQVGGDG VEVRVQNGTI WLSQKQMGLL FDTSSGNIGL 
HLKNIFKEEE LKQDSVTEEF SVTAEDGKNY RVKHYNLDAI IAVGYRVNSK RATAFRQWAT
GVLRDYALHG YLLDRKRMEN GAFLDEDYFE RLLEEIREIR LSERRFYQKI TDIYSTAMDY
DKDSPITKEF FAKVQNKMHF AVHGSTAAEL IVERADAKKD YMGLTSWANS PDGKILKSDV
TIAKNYLTAE ELADLGAIVN AYLDLAERRA KRRIPMTMED WANRLDIFLQ ADDRELLTNA
GKISAQIAKD HAESEYEKYR VIQDKLFEND FDKQMKILEQ EIAKSEKDKD SNGDKN