Gene Dtox_4228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_4228 
Symbol 
ID8431242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp4398482 
End bp4400266 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content47% 
IMG OID645036420 
Productphage uncharacterized protein 
Protein accessionYP_003193518 
Protein GI258517296 
COG category 
COG ID 
TIGRFAM ID[TIGR01630] phage uncharacterized protein (putative large terminase), C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.76146 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAG GCCGCAAGGC TGATATCGTT TCACTGCTGT ATGAGGCCAT GGAGGAAAAT 
GAAAAACGTA GGCACGGCCA TGAGAAGCAG CAAAACGATG AAGCCAGAAT GCTTTTTGAA
GAGTATGTAA AACGAGATTC AAGCCCGGCA AGATTGGAAC TATGGAAATC TTATCAGGCT
GAGGCTCCGC TTATTGGTTC GGACGGGCTG AGGAAAAAGC TCGGGGCCAT GGATCTAGAG
TATTTTGGCA GGGCTTATCT TCATCATTAT TTCACTCGGG AAACTCCTGA ATTTCACCGG
GAGTTAGACC GGATTTGGCA ACAAGGCGTG CTTAAAGGGA TACTGCAGCT GACAGAGAAA
ACGGTGGCCA AGATCCGGCG GTTGCCAGGC TGCCGCCGGG CGGTGGCCGC ACCCAGGGGC
CACGCCAAGA GTACCAACCT GACCTTTAAG GATACCTTGC ACGCCATAGT CTATGAATAT
AAGCCCTATA TACTGATACT GTCAGATTCA TCCGATCAGG CGCAGGGATT CTTGTCAGAT
ATCCGGGGGG AATTGGAAGA GAACCTGGCC ATCAGGGAAG ATTTCGGAGA CCTTCAAGGG
AAGAAAGCCT GGCGTGAAGA TGTACTGATG ACCTCCACGG ATGTGAAGAT TGAGGCCATC
GGCAGCGGTA AGAAAATCCG GGGCCGACGC CATAAAAACT GGCGGCCTGG GCTGATTGTA
TTGGATGATA TTGAAAACGA TGATAATGTC CGGACGCCGG AACAAAGAAA GAAGCTGGAA
AATTGGTTCT TTAAAGCAGT GAGTAAGGCT GGTGACGACT ACACTGATAT TGTGTACATC
GGCACCATTT TGCATTACGA CTCCCTTCTT TCCAAGGTGC TTAAGAATCC GGCCTATAAG
TCAGTAAAAT ACCGGGCGAT CATCTCCTGG TCCGAACGCA AAGACCTGTG GGAAAAATGG
GAAGACATTT ATATTGATCT GGACAATGAA AATCGGGAGC AAGATGCCAG GGCATTTTTT
GAGGCCACTA AAGATGAAAT GCTAAAAGGT ACCCGGGTTT TATGGGAAGA TAAGCTTTCC
TATTATGCTC TTATGGTGAT GCGGGTTTCT GAGGGTGAAG CCAGCTTTAA CTCTGAGGAA
CAAAACGAGC CTATTAATCC AGAAGACTGC CTGTTCAACG AAGAGTGGTT CGAATATTAT
AACGAGGCTG CCATTGATTT CAGGGAAAAA CGTTTCCGTT TCTTTGGCTT TGTTGACCCC
TCTTTGGGGG GCAAGGGCAA GAAGAAGAAA AGCGACTTTT CCACAATCAT TACTTTGGTC
AAGGATGGCC AGACCGGTTA TATGTATGTG CTTGATGCCG ATATCGAAAG ACGCCACCCG
GACAGGATCA TCGAAGACAT TATGGAAAAG GAACGCTGGC TGAAGCTGAC ATTTGGCCGG
GGATATTTCC AATTCGGCTG TGAGACAAAC CAGTTTCAAT GGTTTTTAAA AGAAGAATTG
GCCAGGCGCA GCGCTGAAGC CGGTATTTAC CTCCCCATCG AGGAGGTAAA TCAAACCAGC
GATAAATATG GACGAATCCA GACTTTGCAG CCTGATATAA AAAACAGGTA CATTAAATTT
AACATCCGGC ATAAGCGTCT TTTGGAGCAA CTCAGGCAAT TTCCCATGGC GGCCCATGAT
GATGGGCCGG ATGCCCTGGA AGCATGCCGA ACTCTGGCCA GATCTAAACA ACAGGTTGAC
CAGGGCTTGC TGAATGTATT TAAAAAACTT CGGATATATG GGTGA
 
Protein sequence
MKKGRKADIV SLLYEAMEEN EKRRHGHEKQ QNDEARMLFE EYVKRDSSPA RLELWKSYQA 
EAPLIGSDGL RKKLGAMDLE YFGRAYLHHY FTRETPEFHR ELDRIWQQGV LKGILQLTEK
TVAKIRRLPG CRRAVAAPRG HAKSTNLTFK DTLHAIVYEY KPYILILSDS SDQAQGFLSD
IRGELEENLA IREDFGDLQG KKAWREDVLM TSTDVKIEAI GSGKKIRGRR HKNWRPGLIV
LDDIENDDNV RTPEQRKKLE NWFFKAVSKA GDDYTDIVYI GTILHYDSLL SKVLKNPAYK
SVKYRAIISW SERKDLWEKW EDIYIDLDNE NREQDARAFF EATKDEMLKG TRVLWEDKLS
YYALMVMRVS EGEASFNSEE QNEPINPEDC LFNEEWFEYY NEAAIDFREK RFRFFGFVDP
SLGGKGKKKK SDFSTIITLV KDGQTGYMYV LDADIERRHP DRIIEDIMEK ERWLKLTFGR
GYFQFGCETN QFQWFLKEEL ARRSAEAGIY LPIEEVNQTS DKYGRIQTLQ PDIKNRYIKF
NIRHKRLLEQ LRQFPMAAHD DGPDALEACR TLARSKQQVD QGLLNVFKKL RIYG