Gene Dtox_4250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_4250 
Symbol 
ID8431264 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp4418636 
End bp4419766 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content44% 
IMG OID645036442 
ProductBaseplate J family protein 
Protein accessionYP_003193540 
Protein GI258517318 
COG category[S] Function unknown 
COG ID[COG3299] Uncharacterized homolog of phage Mu protein gp47 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTACCC TACCCGATTT TTTAACAGAT CAAACTGAGG AAGTTATTTT GCAACGCATG 
TTAAGCAATG TCCCAGCCGA TTTGGATACA AGCGAAGGCA GTTATATTTG GGACTCATTA
AGTCCCGTTG CTATTGAGCT TGCTCTTGCC TACATCCAGG CGCAAGAAAT CCTAAAAAGA
GGTTTTATTG CAACAACATA CGGGGAATAC CTGAAGCTTA AGGCTGCTGA AGATGGTATT
GAGACTCGAT CTGCCGTTAG TGCTACGGGC ACAATAGAAA AAGGAAATCC ACTTAAGATT
GTAGGTACCC CTGGAGCTAA TTTCTCAGTA GGCATTGCTG TGGCTACGCC GGCAGATCTT
GCTACCGGTA CGGTATCAAT AGAGTTTACA ACTATTGGTG AGGTAACCTT AGATGTAAAC
GGGATAGGTT ATGCTGATAT TAAAGCAGTG GTGCCTGGTA AATCCGGTAA CGTTTCGGTT
GGTGCCATTA GTATTTTAAC AAAACCAATA TCAGGCATTA AAAGTGTTAC CAATGAAAAA
CCTACAACAG GTGGTTTAGA TGAGGAAGAT AAAGAGTTGC TGAGAGAGCG CATCTTAAAA
GAATGCCAAA AAGACGAAGG AGACGGCAAC TCAGCTGATT ATGAAATATG GGCTAAAGAA
GTGGCTGGCG TTGGCAATGT ATTAGTTGAG CCACTCTGGC AGGGAGAGGG CACTGTTAGG
GTTGTAATAT TGGACCCTGA TGGAAGAGAT GCGCCCAAAG CTACCGTTGA CGCAGTGCAA
AATCACCTTG ATCCCGGCAG TCTAGGACTG GGCGAAGGAA AAGCCCCTAT CGGTGCACGC
GTCACAGTTG TGACAGCTGA AGTAATAACC ATAAACGCCA CAATTCCAGG GTTAACAGTT
GGAGCCGGGT ATACACTCGA TCAAGGAAAA ACCAATGCAG AAATTTCCCT TAGTAACTAT
TTTAAAAAGA TTAATCCAGG TGGAATCATC AGAACGAAGA AGGCCGAGGC GGAAATTACA
AACGCTCTGG GAGTGCTTGA CATGGGCGAT CTATTACTTG ACGGAAAAAG AGATAATATT
GTTCTTGGAA TTACCCAATT AGCCGCCCTG GGGAGTGTGA TTTATGTATG A
 
Protein sequence
MATLPDFLTD QTEEVILQRM LSNVPADLDT SEGSYIWDSL SPVAIELALA YIQAQEILKR 
GFIATTYGEY LKLKAAEDGI ETRSAVSATG TIEKGNPLKI VGTPGANFSV GIAVATPADL
ATGTVSIEFT TIGEVTLDVN GIGYADIKAV VPGKSGNVSV GAISILTKPI SGIKSVTNEK
PTTGGLDEED KELLRERILK ECQKDEGDGN SADYEIWAKE VAGVGNVLVE PLWQGEGTVR
VVILDPDGRD APKATVDAVQ NHLDPGSLGL GEGKAPIGAR VTVVTAEVIT INATIPGLTV
GAGYTLDQGK TNAEISLSNY FKKINPGGII RTKKAEAEIT NALGVLDMGD LLLDGKRDNI
VLGITQLAAL GSVIYV