Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_0029 |
Symbol | |
ID | 8426951 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | + |
Start bp | 31706 |
End bp | 32866 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 645032425 |
Product | protein of unknown function DUF201 |
Protein accession | YP_003189616 |
Protein GI | 258513394 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTGGTAG CAATAATGGG TGGAAAGCTG CAGGGCGTGG AAGCGGCCTA TCTTTCCCGG CAAGCCGGAT GGAAAGCAGT GCTGGTTGAC AGGAATCGGG ACGCTCCGGC TGCCGGGATG TGCGATAACT TTATCCAAGC TGATCTTATG GAGAAAGATA AATTAGTTGA TTTATTTAAG AAAGTTAAAC TGGTTATTCC GGCTGTGGAA AACAGTGAGG TACTGGAAAA TATCAAAGAA TGTGCCCTTG CGGCAGGTGT GAAAATTATT TATGATTCTG CTGCCTATGC TCTTTCTTCC TCCAAAATAG AGTCGGATAA ACTCTTTGCA AGGCTTGATC TGCCTGTTCC GAAGCCCTGG TCCGGCTGTG GCTTTCCTGT TGTCATAAAG CCCTCCGGAG CAAGCGGCAG TGAAAATGTT TATAAGATTA ACAGTCTCAG AGATTTTAAC GAGCTTACCT TGAAGCTGGG TAATCTGGAT AGTTGGGTCA AACAGGAATA TCTGCGGGGT CCCTCCTACT CGATTGAAGT TATAGGCTGC AAGGGGGATT ATCAGACTTT CCAGATAACA GAATTAGAAA TGGACGCCGC TCATGATTGT AAAAGAGTAT TGGCTCCTGC GCAATTATCC ACGGAAAAGC AGGATGAATT TAAATCTCAG GCTGTTAAAA TAGCCCGGGC TCTCAACTTG AATGGAATTA TGGATGTCGA AGTCATTTTG CATGATGAAA AGTTTAAAGT ACTGGAAATA GACGCCCGCT TGCCAAGCCA GACTCCAACC GTAGTTTATA AAGCCACGGG TATTAATATG CTGGAAGTTT TATGGCCCGG TGGCTTAAAA AGAAGGGAGA AGCAAGAAAT AGGAATGACC AGGGGCGTTG TTTATGAACA TATTAAAGTA AGAAGCAACC GGCTGATTGA AGTTGGCGGA GAGCATATAA TGGCGGGAGC CGGTCGCCTG CATATCTTAA AAAAGTTTTT TGGTGCAGAC GAAGCTATTT CAAATTATGA CAGGGATAAA ACAGAATGGG TTGCTACTCT CATTATTACA GGTAAGAACA GGCAGGAGGC TTGGTTAAAA AGAACGGCAG TTATAAAGAA TATCATGCAA GAGTTTGCTG TTGAGGATTG CATGGATACC GGCATTTCAA ATAAAGGATG A
|
Protein sequence | MLVAIMGGKL QGVEAAYLSR QAGWKAVLVD RNRDAPAAGM CDNFIQADLM EKDKLVDLFK KVKLVIPAVE NSEVLENIKE CALAAGVKII YDSAAYALSS SKIESDKLFA RLDLPVPKPW SGCGFPVVIK PSGASGSENV YKINSLRDFN ELTLKLGNLD SWVKQEYLRG PSYSIEVIGC KGDYQTFQIT ELEMDAAHDC KRVLAPAQLS TEKQDEFKSQ AVKIARALNL NGIMDVEVIL HDEKFKVLEI DARLPSQTPT VVYKATGINM LEVLWPGGLK RREKQEIGMT RGVVYEHIKV RSNRLIEVGG EHIMAGAGRL HILKKFFGAD EAISNYDRDK TEWVATLIIT GKNRQEAWLK RTAVIKNIMQ EFAVEDCMDT GISNKG
|
| |