Gene Dtox_3662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3662 
Symbol 
ID8430670 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp3849481 
End bp3850650 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content44% 
IMG OID645035889 
Productintegrase family protein 
Protein accessionYP_003192994 
Protein GI258516772 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000552191 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGGCT GGGTTGAATC TCGTGGTAAA AACAAATGGC GATTAAATGT CCCTGATGGG 
ACTGGCCCGG ATGGAAAACG TATCACTCAC AGAAAAGTAG TTGAAGCTAC CAGTGAGCGC
GAAGCTAAAA AACTGCTGGA CGTTTTCTCT GCTGAAGTCC AAAAGGGCCA GTATATCGCA
CCATCAAAAT TAACTTTCAA AGAGTTCAGT CAAAAATGGC TTGAAAGCAA AAAGGACTTA
GCACCAAAAA CCTTATATCG GTATAAGGAG ATATTAAACT CCCGAATATT ACCGGCAATG
GGGCACCTTA AGATTGAAGA CATTAAGCCA TTTCATATAA TGCAGTTTTA TGGCAATTTA
CAAGAACCCG GTATAAGGGA GGATGGCAAA GAAGGTACAT TATCGCCAGC TACTGTCCTT
TATCACCACC GACTGCTGAC CAATATATTC AATGCAGCCG TTAAATGGCA AATAATCCTT
ACCAATCCCG CCCTACGTGT GGAGGCACCC AAGGCCAAAA AGCATAAGGC TACTTCCTAT
GAGGAAGAAG ACACTGCAGC TTTACTCAGT GCATTGGAGG AACAGCCGCT AAAGTTCCAG
GCAATTGTTT ATATTGCCCT TGGTTGTGGT CTTCGTCGCG GTGAGATCAT GGGCCTGGAA
TGGAAGGACA TTGATCTCAC AAAAGGTACA CTGGAGGTAA GACAGTCCAG CCAGTACCTA
CCCGGTCATG GTACGTTTGC AAAGTCACCT AAGAATGAAA GCTCAGAACG CATTATTGCC
GTTCCTACAG AAACAATGTC GCTATTAAAA CAGCACAGAG TACAGCAAAA TGAGCAGCGT
TTACAAGTAG GCGGCCTGTG GCAAGCCTCA GATAGACTAT TTACTACCTG GGACGGAAAA
CCGATGCACC CGGACAGTAT AACAAAATGG TTTAGTGGTT TTCTAAAAAA CAACAACCTG
TCTCCATTGC CTTTTCATGG TTTGCGCCAT ACTGCAGCCA GCTACATGAT TAAGGCCGGT
ATCCCGCTTA AAAATATAGC CAGCCGCCTG GGTCATAGTT CACCCAACAC AACCCTAAAT
ATTTATGCCC ACAGTTTTAA GTCTGTTGAT GCAGAAGCAG CCAATAAAAT GAATGATATT
TTAACCACAC GAAAAAAAGG ACAGGCTTAA
 
Protein sequence
MAGWVESRGK NKWRLNVPDG TGPDGKRITH RKVVEATSER EAKKLLDVFS AEVQKGQYIA 
PSKLTFKEFS QKWLESKKDL APKTLYRYKE ILNSRILPAM GHLKIEDIKP FHIMQFYGNL
QEPGIREDGK EGTLSPATVL YHHRLLTNIF NAAVKWQIIL TNPALRVEAP KAKKHKATSY
EEEDTAALLS ALEEQPLKFQ AIVYIALGCG LRRGEIMGLE WKDIDLTKGT LEVRQSSQYL
PGHGTFAKSP KNESSERIIA VPTETMSLLK QHRVQQNEQR LQVGGLWQAS DRLFTTWDGK
PMHPDSITKW FSGFLKNNNL SPLPFHGLRH TAASYMIKAG IPLKNIASRL GHSSPNTTLN
IYAHSFKSVD AEAANKMNDI LTTRKKGQA