Gene Dtox_3579 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3579 
Symbol 
ID8430585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp3780379 
End bp3781554 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content45% 
IMG OID645035807 
Productsulfate adenylyltransferase 
Protein accessionYP_003192914 
Protein GI258516692 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2046] ATP sulfurylase (sulfate adenylyltransferase) 
TIGRFAM ID[TIGR00339] ATP sulphurylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000159277 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATTAG TTGCACCGCA TGGTGGAAAA TTAACTCCGG TAATTCTCCC GAAAGAACAA 
CGTGAAGATG CTTTGGCAAA AGCTAAAACT CTGCCGGTAA TTAGAATGTC ATCCCGTGAA
ACATCTGATG TATTAATGAT TGGTATGGGT GCATTCAGTC CATTAATGGG TTTTATGACC
AAAGAAGATT ATGAAAGCGT AGTAAACACC AAGCACTTAG CCAACGGCTT AGCCTGGCCC
GTGCCCATCA CTGTTTCAGT TACCAAAGAA CAGGCTGCTG AACTTAAAGA AGGTATGGAA
GTAGCTCTGG TTGACGACGA AACAGATAAG TATGTGGCTA TTCTTACTGT TAAAGATAAA
TATGAGTATG ACAAGACCAA AGAATGTAAA GAAGTATTCT TTACCGATGA TCCCGAGCAT
GATGGTGTTA AGAAAGTTAT GGGCCAGCCG GAAATTAACG TTGGCGGCGA TATCATCACC
TTCAGTGAAA TGGGCTATGC TACCCAGTAT GCCGGTTATT ATGCTCACCC GCACGAAACC
CGTGCATTAT TTGAATCCAA GGGCTGGAAC ACTGTTTGTG CTTTCCAAAC CAGAAACCCC
TTGCACCGCT CTCACGAGTT CCTCTGCAAG ATCGGTATGG AAGTTTGCGA CGGTTTGTTC
CTGCACCCGA TTGTTGGTAA ATTAAAGCCT GGCGATATTC CGGCTGAAGT TCGTTTTAAG
TGCTACCAGG CTCATATGGA CAACTATTTC AATAATAAGA ACGTTGCTCT TAAAGTATAT
CCGATGGAAA TGCGTTATGC CGGACCCAGC GAAGCTATCC TGCATGCTAT CTTCCGTCAG
AACTTCGGTT GCAGCAACAT CTTAATCGGT CGTGACCACG CCGGTGTAGG CAGTTACTAT
TCTGCATATC AGGCTCAGGA AATTTTTGAC CAGTTTAAGC CCGGTGAGAT CCTTTGCCAG
CCGATTAAAG TTACAGCCGC TGCTTATTGC AAGAAGTGTA TGGGTATGGA AACTGAAAAG
ACCTGCCCGC ATACCGGTGA AGATCGCGTA GCTATCAGCG GTACCAAGGT TCGTCAGATG
TTTGGCGCCG GCCAACTGCC GCCGCTGGAA TTTGGACGTA AAGAAGTTCT CGAAATTCTC
ACCGAGTACT ATCAGGCTTT AGATAAAAAC AAGTAA
 
Protein sequence
MALVAPHGGK LTPVILPKEQ REDALAKAKT LPVIRMSSRE TSDVLMIGMG AFSPLMGFMT 
KEDYESVVNT KHLANGLAWP VPITVSVTKE QAAELKEGME VALVDDETDK YVAILTVKDK
YEYDKTKECK EVFFTDDPEH DGVKKVMGQP EINVGGDIIT FSEMGYATQY AGYYAHPHET
RALFESKGWN TVCAFQTRNP LHRSHEFLCK IGMEVCDGLF LHPIVGKLKP GDIPAEVRFK
CYQAHMDNYF NNKNVALKVY PMEMRYAGPS EAILHAIFRQ NFGCSNILIG RDHAGVGSYY
SAYQAQEIFD QFKPGEILCQ PIKVTAAAYC KKCMGMETEK TCPHTGEDRV AISGTKVRQM
FGAGQLPPLE FGRKEVLEIL TEYYQALDKN K