Gene Dtox_2978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_2978 
Symbol 
ID8429968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp3164764 
End bp3165894 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content48% 
IMG OID645035232 
Producthypothetical protein 
Protein accessionYP_003192355 
Protein GI258516133 
COG category[L] Replication, recombination and repair 
COG ID[COG1769] Uncharacterized protein predicted to be involved in DNA repair (RAMP superfamily) 
TIGRFAM ID[TIGR01888] CRISPR-associated protein, Cmr3 family 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.25181 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGGAA GGTTTTGGGT TTTCTCAGCT TTGGATAGTT TATTCTTTGG CGACGGAACC 
CCTTACAATG CGGGTGAGGG TGGTCAGTCA AGGGTAGGCG GCTGTTTTCC GCCGTTTATG
AGTACCTTGC AGGGAGCAGT TCGTACCGCT CTGGCCGAGG GGCAAGGATG GACGCCAAAG
AACCCTGAAA AATGGCCACA AGAGCTGGGT ACTCCCGAGC ACCTGGGAAA TTTAAAATTA
ATGGGCCCTT ATCTGTTAGA AGACGGGCTA ATACTTTTTC CGGTACCGCT GTTATTAATG
CAAAAAAAAG ATAAAGATAT GAAATTTAGC CGTTTGGTGC CGGGGGAAAA GGTTGATTGT
GATCTGGGGT ATGTATGTCT GCCGCGGCTT AAAGAGAAGC TTCCCGGAGC TAAATTAATG
GAAAAACATT GGCTTACAGC GGAGGGCTTA AAAGCTATTC TTGCAGGCGG ACTGCCGGAT
AAAGACGATG TTTTTGAACA GGACAAACTA TGGCAGGAGG AAAGCCGGGT AGGTATAGAG
CGGCAGCAAA GCACGCGTAC TGCAGCGGTT GGTAAAATTT ATTCTTCCAT GCATGTACGG
CTAAGAGATT TTGAGCAGAC GGTCAGCCTG GGAGTTTATG TGGATGGAAT ACCGGAGGAC
TGGCACGATA AGGTTGCCAG GGTCGTACGG ATGGGTGGTG AGGGTAGAAT GGCCAGCCTG
GATATCAAAG AAACCGGGTT CGAACTACCC CCGCATCCGG AATTAAAGCC GAGGGACGGC
AAGGTGCAGT TTACTGTTAC TCTAATTACG CCGGGTTGGT TTGATGATTT AGATAGAGTT
ATAATAAGCG GTCCTGTGAA AAGCATTCCA GGAGAATTAG TTACTGCCTG TATCGGGAAA
CTAAAGCATG TAGGTGGTTG GGACATTAAA AATTGCTGTC CCCGACCCTT AAAACCGGTG
CTTCCGGCAG GTACTACCTG GTTTTTTGAA GCAGCGGCAG CTGAGTTGGT ACGGGTATAC
TCACTGCACG GGCAATGTAT CGGTAACAAT AATGAAAAAC AAAAGGATAA TGATAAAACT
GCCTATGGCT TTGGACAAAT CGTAATCGGG AGATGGGGGG ATGAACAATG A
 
Protein sequence
MTGRFWVFSA LDSLFFGDGT PYNAGEGGQS RVGGCFPPFM STLQGAVRTA LAEGQGWTPK 
NPEKWPQELG TPEHLGNLKL MGPYLLEDGL ILFPVPLLLM QKKDKDMKFS RLVPGEKVDC
DLGYVCLPRL KEKLPGAKLM EKHWLTAEGL KAILAGGLPD KDDVFEQDKL WQEESRVGIE
RQQSTRTAAV GKIYSSMHVR LRDFEQTVSL GVYVDGIPED WHDKVARVVR MGGEGRMASL
DIKETGFELP PHPELKPRDG KVQFTVTLIT PGWFDDLDRV IISGPVKSIP GELVTACIGK
LKHVGGWDIK NCCPRPLKPV LPAGTTWFFE AAAAELVRVY SLHGQCIGNN NEKQKDNDKT
AYGFGQIVIG RWGDEQ