Gene Dtox_0906 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_0906 
Symbol 
ID8427845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp917376 
End bp918857 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content48% 
IMG OID645033249 
Productanthranilate synthase component I 
Protein accessionYP_003190423 
Protein GI258514201 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000348371 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000636534 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGTATATTA TGATAAAACC TGGTTACGAG GAATATCTCA GACTGAGTGC TGATTATAAC 
CTGATACCTG TATATACAGA TTGTGAAGCG GATACTGAAA CGCCCAACAC GGTTTATTTA
AAGACAGTCG GTGATGGCCA TGGCTGCCTG CTGGAAAGCG TAAACGGTGG GGAGAATGTG
GGGAGGCACT CCTTTATCTG CCTGAAGCCC TTTTTGACTT ACAGGGGATC TAATACGGAA
GGTGAACTGA CCTATCCCGG CGGATTGAAA AAAGCTGTTG TTGGTTCACC CCATAAGGTG
TTGCAGGGCC TGATGGACAG TTATAGAATT CCTTCCTTTC CTGAACTGAT CGAATTTTCC
GGCGGGGCGG TTGGCTATAT AGGCTATGAT GTTGTGCGTT CCGTCGAGGA GTTGCCTGAG
CTGTTGCCGG AAGACGATTC ATTGCCTTTG TGTATGATGT TTTTTCCTTC GGTGATTTTA
TGCTACGACC ATGTTTGCCG CAGTATGAAA ATTGTAGCCA ATGTTCCGGT AGGTGATGAC
CCGGCACAGT CATATGAGCA GTCTCTGGAG CTAATTAAGG CTGTGAAGCA GGATTTGCAG
AAACCGCTGG TTTTACCGGG GGATAATTTC GAGCAGGAAA AGCGGTCACC CGCCGCCGGT
CTTGAGGAGA TAGTATCTGA GCCGGGCAAA GAATTATTTA TGGAGATGGT AGAACAGGCT
CTGGAATATA TCAGAGCCGG GGATATCATT CAAGTTGTTT TATCCCGCCG CTATTCAACG
CCGCAGAGGG AGGAGCCTTT CAGTATCTTT AGAAAGCTGC GTCGTTTAAA CCCTTCGCCT
TATATGTACT TTATGGATTT CGGTGATCCT GTGGTGGTAG GGTCTTCGCC TGAGATGTTG
GTTAAGGTGC ATAACGGCCA GGTCCTCACT CATCCCATTG CCGGAACCAG ACCGAGAGGC
AAGAACGGTG CTCAGGACAG TGAACTAGCT AAAGATTTAT TGGCTGACGA GAAGGAGCGG
GCGGAACACC TGATGCTGGT GGATCTGGGA CGCAATGACA TAGGCAGAGT CAGTTTGCCC
GGTACGGTTG AGGTGGCCCG TTTTATGGAA ATAGAAAAGT TTTCTCACGT AATGCATATA
GTATCTACCG TTCAGGGGAG GCTTTTGCCG GAAAAAACAC CTTTGGATGC CTTAATGGCC
TGTTTTCCTG CCGGCACAGT CAGCGGGGCG CCTAAAATCA GAGCCATGAG TATTATAGAG
GAGTTGGAAC CGATGCGGCG CGGTATCTAT GCCGGCGCTG TCGGCTATAT CGGCTTTAAT
AACACTATGG ATACGGCTAT TGCCATCAGA ACAATTGTTG TAAATAAAGG CAAATGCTAT
GTGCAGGCAG GGGCTGGTAT TGTTGCCGAT TCAGAACCGG AAAAAGAGTA TGTGGAAACG
CAAAACAAAG CCGGAGCCCT TTTGCGGGTG TTGGGTTATT AG
 
Protein sequence
MYIMIKPGYE EYLRLSADYN LIPVYTDCEA DTETPNTVYL KTVGDGHGCL LESVNGGENV 
GRHSFICLKP FLTYRGSNTE GELTYPGGLK KAVVGSPHKV LQGLMDSYRI PSFPELIEFS
GGAVGYIGYD VVRSVEELPE LLPEDDSLPL CMMFFPSVIL CYDHVCRSMK IVANVPVGDD
PAQSYEQSLE LIKAVKQDLQ KPLVLPGDNF EQEKRSPAAG LEEIVSEPGK ELFMEMVEQA
LEYIRAGDII QVVLSRRYST PQREEPFSIF RKLRRLNPSP YMYFMDFGDP VVVGSSPEML
VKVHNGQVLT HPIAGTRPRG KNGAQDSELA KDLLADEKER AEHLMLVDLG RNDIGRVSLP
GTVEVARFME IEKFSHVMHI VSTVQGRLLP EKTPLDALMA CFPAGTVSGA PKIRAMSIIE
ELEPMRRGIY AGAVGYIGFN NTMDTAIAIR TIVVNKGKCY VQAGAGIVAD SEPEKEYVET
QNKAGALLRV LGY