Gene Dtox_3303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3303 
Symbol 
ID8430297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp3505920 
End bp3508289 
Gene Length2370 bp 
Protein Length789 aa 
Translation table11 
GC content51% 
IMG OID645035538 
ProductDNA internalization-related competence protein ComEC/Rec2 
Protein accessionYP_003192657 
Protein GI258516435 
COG category[R] General function prediction only 
COG ID[COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein
[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.78547 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAAGC CGTTGGTTTT AATCACATTA ATCTATATAA CAGGCATCTT AGCAGGCGTT 
ATGATACAGG CGCCTGCTTA TTTGCTGTTA GGAGCAGCCT CCCTCTTATT TATACTTGGC
CTGGCCGGCT ATTTAATGAA CTGGCGGCAT AACGGCAAGC TAATTTTACT GGTCTTCTTC
TGCCTGGGCT TTTATTTTTG CCGTCTTTCC GTAGAGAGTA TAGATACTCC CTTGAATAAT
TTTGCCGGGC ACTATGTTGT GCTGACAGGA ACAGTCGTGC AGGAAGCCGA CCGGCGTGAG
GACAGGGTAA ACTATGTCCT CAAGTCGCAT AGTGCCAGGC TGGGCAGCCG GGAAAATCAT
TCAGCCGCGC TGGTCCTGGT AACTGTCTGG CAGCCCGGAT ACCTTTACGG CTACGGTGAT
GTTTTAACAG TCAAAGGAAA GCTCGGCTTG CCGGAGCAGC CGGGCAATCC GGGTGCCTTT
AATTACAGAG AATATCTGGC CAGGCAGGGG ATTGGCCTGG TGATGACAGT CAATGGCGAG
CAAAACGTGG CTTACTCGGG CACAGGGGAG AGCAGCCGTT TAGTGGCTTT TGCCCTGTCT
GTCAAGAAAA AGCTGCTTTT GGTTGTGGAC AAGACCTTTT CAGCCGAACA GGCCGCACTG
GTGAAAGGCA TCTTGTTTGG CAGCTGCGGG CAGATCAACG CGGAAGTTAC GGAAGCCTTT
CAACAGACAG GGTTAATTCA TATTTTAAGT GTTTCCGGCC TGCATGTGGG TTTTGTATTG
CTGGCGGTGT TGGCCCTGGC CTCGGCCTTT AAGCTGCCTT TACGGTACAA CCTGCCCGTC
GCTTCTTTTA TTCTCATCTT TTATGCCGTA CTGACCGGTA TGAGTCCCCC GGTTATAAGA
GCTGTGGTTA TGGCTTTGCT TCTCCTTTCT GCCCGTTACC TGGGGCGGCA GAGAGACTGG
GCGGTTGCCC TTTGCCTGGC GGCACTGCTT AATCTGCTCT ATAAGCCGTT AAATCTTTTC
AATCCCGGAT TTCAGCTTTC TTTTTTGGCT ACCTGGGGTA TCTTCTATCT GGTTCCGGCC
ATAGTTGCCT TTTTGCAGCA AAAATTGGGA ATAAGTAAAA GGATGGCAGC GGTGCTGGCT
GTTCCTCTGG CCGCTGAAGC GGTTACCCTG CCGCCGGTGG CATTGCATTT TAACCTTATA
GCACTGGTTG CACCCCTGGC TAATGTTGTT TGTGTTCCTC TTGTGGGTGC AATCACTTTG
TTTTCAGCCC TGGGTACAAC AGCCGGGTTG CTGTCTCTAA GCCTGACTCA ATTTATCAAT
ATTACTACAG CTTGTCTCCT GGATATCTTT CTCTGGCTGG TGAAACTTTT TCAGCACCTT
CCCGGTGCCT TTGTACATAC AGCCGCCCCG CCGGTTTGGG CAATTATTCT CTGGTACCTG
CTGGTTATAT TGTCAGTTAA TAAAGAATGG CGGGGCTGGT TGGGCCGTAG ATTGCCTGAT
CGTATGCTAA GGCCGGCCGG TGTACTTCTC TTTGTTGCCA CCATGCTTAG TTTTTCTTCC
GGCCTTCATC CCGCAAATCT TGACGGCCGG TTGGCAGTGC ATTTTATTGA TGTGGGACAG
GGAGACAGCA TATTGCTGCA GACGCCGGCC GGAAAAAATA TTCTGGTGGA CTGCGGTGGG
CATAAAGGAG AACTGGCAAG CGGCACAGGT GTGGGTGATA AAGTAGTGCT GCCCTACCTG
CGCCGCCTGG GGGTGCAGAA ACTGGATTTG CTTGTATTAA CCCACTATCA CGAAGATCAT
ATGGGAGGAG CGGCAGCCGT GATCAGGAAC CTGCCGGTGG CTTTGCTGCT GGTCCCCCCG
CAGGACAAGC CACCGGGTGC GGAGTTCAGC GGGCTGACAG AACAGATTAA GGGGGCCGGG
ATCGACATGA GAACGGCTGT TGCCGGTGAC TGTCTGAATT TAGACCCAGG GCTGGAGATT
GATGTTTTGA GCCCTCCCCG TGATATGAAC GGCGAGAACA ATGATTCCCT GGTGCTGCGT
GTTTCTTTTG GCCGGGAGGA TTTTTTACTG ACCGGGGATA TTGAAAAGGA AGCCCAGGAC
TTCCTGCTGC AGCAAAGATA TAATCTGGCC TGCGAAGTAT TAAAGGTGCC GCATCACGGC
AGCAAGTACT TTTTGCCGGA GTTTCTGGAG AAGGTGAAAC CTTTGGCTGC AGTTATTACC
GTGGGAAAGA ATAATTTCGG TCACCCGTCC TTGGAAACCC TGAAACTTTT ACAGGAGGTG
GGAGCGGCTG TTTATCGCAC GGATAGAGAC GGCGCGGTAA TTTTCAGGAC TGATGGAAAC
AGTATAAGAG TTGAGACAGG CAGGAAGTGA
 
Protein sequence
MDKPLVLITL IYITGILAGV MIQAPAYLLL GAASLLFILG LAGYLMNWRH NGKLILLVFF 
CLGFYFCRLS VESIDTPLNN FAGHYVVLTG TVVQEADRRE DRVNYVLKSH SARLGSRENH
SAALVLVTVW QPGYLYGYGD VLTVKGKLGL PEQPGNPGAF NYREYLARQG IGLVMTVNGE
QNVAYSGTGE SSRLVAFALS VKKKLLLVVD KTFSAEQAAL VKGILFGSCG QINAEVTEAF
QQTGLIHILS VSGLHVGFVL LAVLALASAF KLPLRYNLPV ASFILIFYAV LTGMSPPVIR
AVVMALLLLS ARYLGRQRDW AVALCLAALL NLLYKPLNLF NPGFQLSFLA TWGIFYLVPA
IVAFLQQKLG ISKRMAAVLA VPLAAEAVTL PPVALHFNLI ALVAPLANVV CVPLVGAITL
FSALGTTAGL LSLSLTQFIN ITTACLLDIF LWLVKLFQHL PGAFVHTAAP PVWAIILWYL
LVILSVNKEW RGWLGRRLPD RMLRPAGVLL FVATMLSFSS GLHPANLDGR LAVHFIDVGQ
GDSILLQTPA GKNILVDCGG HKGELASGTG VGDKVVLPYL RRLGVQKLDL LVLTHYHEDH
MGGAAAVIRN LPVALLLVPP QDKPPGAEFS GLTEQIKGAG IDMRTAVAGD CLNLDPGLEI
DVLSPPRDMN GENNDSLVLR VSFGREDFLL TGDIEKEAQD FLLQQRYNLA CEVLKVPHHG
SKYFLPEFLE KVKPLAAVIT VGKNNFGHPS LETLKLLQEV GAAVYRTDRD GAVIFRTDGN
SIRVETGRK