Gene Dtox_0807 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_0807 
Symbol 
ID8427745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp819553 
End bp820992 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content43% 
IMG OID645033163 
ProductVWA containing CoxE family protein 
Protein accessionYP_003190338 
Protein GI258514116 
COG category[R] General function prediction only 
COG ID[COG3552] Protein containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0395672 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAATC AAAAGTATAC TCAAATTCTT TCAGAAATAA GCAGCAAGGA TAATAAATCT 
TTGGAGTATA TGGTAGCCAG GTTTGCACAT ATCCTGAGAC ATTTGGATGT CCGGGTGAGC
GCCTCAGAAA CCATAGATGC TTTAAGGGCT TTATCCATAA TTAACATCAT GGACAGAGAT
CAAGTTAGAG CAGCTTTAAG AGGCACTCTG GTCAAAGGGG AAATGGAACA TAGAATTTTT
GACCTGGCTT TTAATAATTT CTTTCTCCCG CCTGAAGAAA AAGCCAGCCT GCGCCTGGAA
GAGAAGTTGG CCGAGCAGGA TCGTCTGGCC AGCCTGCAGG AAGCAGAGGA GGACTTTTTG
GCCAGTATGC AGGACGGAGA GTTTCCCTGG AGTGAGGAAT TATTGAAGAA TATCAGGCTG
ACCAGGGAGC AGAAAGAAAC TTATGCTCAC CTGCCGGAAA AAGAAAAGCA AAGATTAAAA
GAAATCCTGA CCAGTTTCCA GGGCAATAAT ATCAACAATC CTGATACTTT GATAGCTCAG
GTGGCCGAAT CTTCCTTAAA TTTCTGGCGT TATCATATGC TGAAGAATAA TGAGGATTTC
GATGAGCCGG AGCCTCTTGC TCCAGACAGG TTAACCGGCG AGGAAGAGAT GGATGAGGTA
ATCGAAAGAG TAAGCGCCGA GTTTTTCCGT GACGCGGGCG ATAATATAAT GTATCAGGAT
ATGAAGAATA TTTCGGATGA AAACTTGCCC CGTGTCATGT CCCTGATCAA AAAGATGACC
AAGAAGCTGG TAACAAGAGT TTCCCGCCGT ACCAGGTTCA GTAAAATGAA AAAAACCATA
GACATCCGGC GCAGCATTCG CCAGAATATA AGTTACGGGG GCATTCCTCT GGAACTGCGC
TACCGGGCCA AAAGGATTCA AAAGCCGCGC CTGTTATTGA TTTGTGACGT ATCTGCCTCC
ATGGCCCGCT ACGCCAGGTT TGTGATCCAG TTTATATACG GTCTTTCCAA CGCGGTGAAA
GATATTGAAA GTTTTATTTT TTCTGAGGAT CTGGAACGCA TAACCCCCAT GTTTAAAAGA
AAAAAAGGTT TTGCTGATAC CATGACTGAA ATCATCAACC AGAGCGGCAT ATGGGGTCAG
GCAACCGATT TTAACCGGTC ATTAGAGACT TTTGGGCAGA GATATCAAAA TTTATTAACA
AGTGAAACAT ATTTGATAAT TATGAGCGAT ACAAAAACTC TGGCGGTTGA ACAGGCTGCT
TTTCGCCTGA AGCAGATGAA AAAGAACCTC AGGGGTGTAA TATGGCTGAA TACTTTGCCC
AGAAATGAAT GGATACAATA TAAATCAGTC TTTATTTTTC AACAGCAGTC CCGTATGTTT
GAGTGCAATA CGCTGGCTCA CCTGGATAAA GTTATGCGCA GTCAAATTTT CTCTGTTTGA
 
Protein sequence
MNNQKYTQIL SEISSKDNKS LEYMVARFAH ILRHLDVRVS ASETIDALRA LSIINIMDRD 
QVRAALRGTL VKGEMEHRIF DLAFNNFFLP PEEKASLRLE EKLAEQDRLA SLQEAEEDFL
ASMQDGEFPW SEELLKNIRL TREQKETYAH LPEKEKQRLK EILTSFQGNN INNPDTLIAQ
VAESSLNFWR YHMLKNNEDF DEPEPLAPDR LTGEEEMDEV IERVSAEFFR DAGDNIMYQD
MKNISDENLP RVMSLIKKMT KKLVTRVSRR TRFSKMKKTI DIRRSIRQNI SYGGIPLELR
YRAKRIQKPR LLLICDVSAS MARYARFVIQ FIYGLSNAVK DIESFIFSED LERITPMFKR
KKGFADTMTE IINQSGIWGQ ATDFNRSLET FGQRYQNLLT SETYLIIMSD TKTLAVEQAA
FRLKQMKKNL RGVIWLNTLP RNEWIQYKSV FIFQQQSRMF ECNTLAHLDK VMRSQIFSV