Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_0906 |
Symbol | |
ID | 8427845 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | + |
Start bp | 917376 |
End bp | 918857 |
Gene Length | 1482 bp |
Protein Length | 493 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 645033249 |
Product | anthranilate synthase component I |
Protein accession | YP_003190423 |
Protein GI | 258514201 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000348371 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.00000000636534 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGTATATTA TGATAAAACC TGGTTACGAG GAATATCTCA GACTGAGTGC TGATTATAAC CTGATACCTG TATATACAGA TTGTGAAGCG GATACTGAAA CGCCCAACAC GGTTTATTTA AAGACAGTCG GTGATGGCCA TGGCTGCCTG CTGGAAAGCG TAAACGGTGG GGAGAATGTG GGGAGGCACT CCTTTATCTG CCTGAAGCCC TTTTTGACTT ACAGGGGATC TAATACGGAA GGTGAACTGA CCTATCCCGG CGGATTGAAA AAAGCTGTTG TTGGTTCACC CCATAAGGTG TTGCAGGGCC TGATGGACAG TTATAGAATT CCTTCCTTTC CTGAACTGAT CGAATTTTCC GGCGGGGCGG TTGGCTATAT AGGCTATGAT GTTGTGCGTT CCGTCGAGGA GTTGCCTGAG CTGTTGCCGG AAGACGATTC ATTGCCTTTG TGTATGATGT TTTTTCCTTC GGTGATTTTA TGCTACGACC ATGTTTGCCG CAGTATGAAA ATTGTAGCCA ATGTTCCGGT AGGTGATGAC CCGGCACAGT CATATGAGCA GTCTCTGGAG CTAATTAAGG CTGTGAAGCA GGATTTGCAG AAACCGCTGG TTTTACCGGG GGATAATTTC GAGCAGGAAA AGCGGTCACC CGCCGCCGGT CTTGAGGAGA TAGTATCTGA GCCGGGCAAA GAATTATTTA TGGAGATGGT AGAACAGGCT CTGGAATATA TCAGAGCCGG GGATATCATT CAAGTTGTTT TATCCCGCCG CTATTCAACG CCGCAGAGGG AGGAGCCTTT CAGTATCTTT AGAAAGCTGC GTCGTTTAAA CCCTTCGCCT TATATGTACT TTATGGATTT CGGTGATCCT GTGGTGGTAG GGTCTTCGCC TGAGATGTTG GTTAAGGTGC ATAACGGCCA GGTCCTCACT CATCCCATTG CCGGAACCAG ACCGAGAGGC AAGAACGGTG CTCAGGACAG TGAACTAGCT AAAGATTTAT TGGCTGACGA GAAGGAGCGG GCGGAACACC TGATGCTGGT GGATCTGGGA CGCAATGACA TAGGCAGAGT CAGTTTGCCC GGTACGGTTG AGGTGGCCCG TTTTATGGAA ATAGAAAAGT TTTCTCACGT AATGCATATA GTATCTACCG TTCAGGGGAG GCTTTTGCCG GAAAAAACAC CTTTGGATGC CTTAATGGCC TGTTTTCCTG CCGGCACAGT CAGCGGGGCG CCTAAAATCA GAGCCATGAG TATTATAGAG GAGTTGGAAC CGATGCGGCG CGGTATCTAT GCCGGCGCTG TCGGCTATAT CGGCTTTAAT AACACTATGG ATACGGCTAT TGCCATCAGA ACAATTGTTG TAAATAAAGG CAAATGCTAT GTGCAGGCAG GGGCTGGTAT TGTTGCCGAT TCAGAACCGG AAAAAGAGTA TGTGGAAACG CAAAACAAAG CCGGAGCCCT TTTGCGGGTG TTGGGTTATT AG
|
Protein sequence | MYIMIKPGYE EYLRLSADYN LIPVYTDCEA DTETPNTVYL KTVGDGHGCL LESVNGGENV GRHSFICLKP FLTYRGSNTE GELTYPGGLK KAVVGSPHKV LQGLMDSYRI PSFPELIEFS GGAVGYIGYD VVRSVEELPE LLPEDDSLPL CMMFFPSVIL CYDHVCRSMK IVANVPVGDD PAQSYEQSLE LIKAVKQDLQ KPLVLPGDNF EQEKRSPAAG LEEIVSEPGK ELFMEMVEQA LEYIRAGDII QVVLSRRYST PQREEPFSIF RKLRRLNPSP YMYFMDFGDP VVVGSSPEML VKVHNGQVLT HPIAGTRPRG KNGAQDSELA KDLLADEKER AEHLMLVDLG RNDIGRVSLP GTVEVARFME IEKFSHVMHI VSTVQGRLLP EKTPLDALMA CFPAGTVSGA PKIRAMSIIE ELEPMRRGIY AGAVGYIGFN NTMDTAIAIR TIVVNKGKCY VQAGAGIVAD SEPEKEYVET QNKAGALLRV LGY
|
| |