Gene Dtox_2621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_2621 
Symbol 
ID8429607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp2774037 
End bp2775239 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content41% 
IMG OID645034909 
Productrestriction modification system DNA specificity domain protein 
Protein accessionYP_003192036 
Protein GI258515814 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAATA ACAAAACGGC TTTGGTGCCG AGGCTACGGT TCCCGGAATT TGCAATCAAT 
GGCGCAATTA ACTTCGAAAC CGGAGATGTA ATATTTGAGC CCATTAGCAA TAAGAACCAT
AATTCTGATC TTCCGGTTCT CGCCATAACG CAGGAACATG GTGCCATCCC AAGAGATCAG
ATTGATTATA ATGTGTCGGT AACGGATAAA AGCCTTGAAA GTTACAAGGT TGTGGAAATA
GGAGATTTCA TCATTAGCCT AAGATCTTTT CAAGGCGGAA TAGAATATTC TTTATATCAT
GGTATTTGTA GCCCGGCATA TATTATTTTG CGCAAGAGGG TTCCAATTGT TGATCAATAC
TACAAACATT ATTTTAAAAC GGGGAGATTT ATAAAAGATC TAAATAAGGA CTTAGAGGGC
ATAAGAGATG GTAAAATGGT AAGTTATAGA CAGTTCTCTG CTATAATGCT GCCAAAGCCG
GATAGAAAAG AGCAACAAAA AATCGCCGAT TGCCTATCTT CCATTGACGA CCTTATCGCC
GCAGAAGACA AAAAGCTTGA GGCTCTCGGA GCACATAAAA GGGGGCTTAT GCAGAAGCTG
TTTCCCGCCG AGGGGAAAAC TTTGCCTGAG TGGAGGTTCC CAGAGTTTAG GGGTAGTGGA
GAGTGGGTAA TTAGTCCATT GAGTGAAGTA TGCGAAAACT TGGATTCAAG AAGGATTCCA
ATAACGGAGA AGGATAGAAA AAAAGGCTTT ACGCCTTATT ATGGTGCATC TGGTATTGTC
GATTATGTTG ACGGTTTTAT TTTTGACGAA GTATTACTGT GCGTGTCGGA AGATGGAGCT
AATTTAGTAG CGCGTACATA TCCCATCGCA TTTAGTATTT CGGGCAAAAC TTGGGTAAAC
AATCATGCAC ATGTACTCAA ATTCCAAAAT AGCAATACAC AAGTTATGGT TAAAAACTAT
ATAAATAGCA TTAACTTGGA GGACTTCCTT ACAGGCATGG CTCAGCCAAA ATTGAACAGA
GCAAAGCTGG ATATCATACC CATTCCATTG CCGAGCGAAA AGGAACAACA GAAAATCGCT
GATTGTCTTT CCAGCATTGA TGACCTTATA GCCGGGCAAG TCAAAAAGCT TGAGGCTCTG
AGAACCCACA AAAAAGGCTT AATGCAAGGC CTGTTCCCTT CTATTGAGGA GGTGGGCGAG
TGA
 
Protein sequence
MNNNKTALVP RLRFPEFAIN GAINFETGDV IFEPISNKNH NSDLPVLAIT QEHGAIPRDQ 
IDYNVSVTDK SLESYKVVEI GDFIISLRSF QGGIEYSLYH GICSPAYIIL RKRVPIVDQY
YKHYFKTGRF IKDLNKDLEG IRDGKMVSYR QFSAIMLPKP DRKEQQKIAD CLSSIDDLIA
AEDKKLEALG AHKRGLMQKL FPAEGKTLPE WRFPEFRGSG EWVISPLSEV CENLDSRRIP
ITEKDRKKGF TPYYGASGIV DYVDGFIFDE VLLCVSEDGA NLVARTYPIA FSISGKTWVN
NHAHVLKFQN SNTQVMVKNY INSINLEDFL TGMAQPKLNR AKLDIIPIPL PSEKEQQKIA
DCLSSIDDLI AGQVKKLEAL RTHKKGLMQG LFPSIEEVGE