Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_2978 |
Symbol | |
ID | 8429968 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | - |
Start bp | 3164764 |
End bp | 3165894 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 645035232 |
Product | hypothetical protein |
Protein accession | YP_003192355 |
Protein GI | 258516133 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1769] Uncharacterized protein predicted to be involved in DNA repair (RAMP superfamily) |
TIGRFAM ID | [TIGR01888] CRISPR-associated protein, Cmr3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.25181 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGGAA GGTTTTGGGT TTTCTCAGCT TTGGATAGTT TATTCTTTGG CGACGGAACC CCTTACAATG CGGGTGAGGG TGGTCAGTCA AGGGTAGGCG GCTGTTTTCC GCCGTTTATG AGTACCTTGC AGGGAGCAGT TCGTACCGCT CTGGCCGAGG GGCAAGGATG GACGCCAAAG AACCCTGAAA AATGGCCACA AGAGCTGGGT ACTCCCGAGC ACCTGGGAAA TTTAAAATTA ATGGGCCCTT ATCTGTTAGA AGACGGGCTA ATACTTTTTC CGGTACCGCT GTTATTAATG CAAAAAAAAG ATAAAGATAT GAAATTTAGC CGTTTGGTGC CGGGGGAAAA GGTTGATTGT GATCTGGGGT ATGTATGTCT GCCGCGGCTT AAAGAGAAGC TTCCCGGAGC TAAATTAATG GAAAAACATT GGCTTACAGC GGAGGGCTTA AAAGCTATTC TTGCAGGCGG ACTGCCGGAT AAAGACGATG TTTTTGAACA GGACAAACTA TGGCAGGAGG AAAGCCGGGT AGGTATAGAG CGGCAGCAAA GCACGCGTAC TGCAGCGGTT GGTAAAATTT ATTCTTCCAT GCATGTACGG CTAAGAGATT TTGAGCAGAC GGTCAGCCTG GGAGTTTATG TGGATGGAAT ACCGGAGGAC TGGCACGATA AGGTTGCCAG GGTCGTACGG ATGGGTGGTG AGGGTAGAAT GGCCAGCCTG GATATCAAAG AAACCGGGTT CGAACTACCC CCGCATCCGG AATTAAAGCC GAGGGACGGC AAGGTGCAGT TTACTGTTAC TCTAATTACG CCGGGTTGGT TTGATGATTT AGATAGAGTT ATAATAAGCG GTCCTGTGAA AAGCATTCCA GGAGAATTAG TTACTGCCTG TATCGGGAAA CTAAAGCATG TAGGTGGTTG GGACATTAAA AATTGCTGTC CCCGACCCTT AAAACCGGTG CTTCCGGCAG GTACTACCTG GTTTTTTGAA GCAGCGGCAG CTGAGTTGGT ACGGGTATAC TCACTGCACG GGCAATGTAT CGGTAACAAT AATGAAAAAC AAAAGGATAA TGATAAAACT GCCTATGGCT TTGGACAAAT CGTAATCGGG AGATGGGGGG ATGAACAATG A
|
Protein sequence | MTGRFWVFSA LDSLFFGDGT PYNAGEGGQS RVGGCFPPFM STLQGAVRTA LAEGQGWTPK NPEKWPQELG TPEHLGNLKL MGPYLLEDGL ILFPVPLLLM QKKDKDMKFS RLVPGEKVDC DLGYVCLPRL KEKLPGAKLM EKHWLTAEGL KAILAGGLPD KDDVFEQDKL WQEESRVGIE RQQSTRTAAV GKIYSSMHVR LRDFEQTVSL GVYVDGIPED WHDKVARVVR MGGEGRMASL DIKETGFELP PHPELKPRDG KVQFTVTLIT PGWFDDLDRV IISGPVKSIP GELVTACIGK LKHVGGWDIK NCCPRPLKPV LPAGTTWFFE AAAAELVRVY SLHGQCIGNN NEKQKDNDKT AYGFGQIVIG RWGDEQ
|
| |