Gene Dtox_4291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_4291 
Symbol 
ID8431305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp4458917 
End bp4460983 
Gene Length2067 bp 
Protein Length688 aa 
Translation table11 
GC content38% 
IMG OID645036483 
Producthypothetical protein 
Protein accessionYP_003193581 
Protein GI258517359 
COG category 
COG ID 
TIGRFAM ID[TIGR02591] CRISPR-associated protein, Csh1 family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACTTGC CCCAACTTAC CGCCTGCATA GGTGACTCTC TTAAAAATGC AAATCATAAT 
TTATTAACTT CATTAATTAA AAGGCTAAAA AAAGTTAAAG AAACGGAAAC TGCTTATTCC
GTTTTTTTAA TTTTTGATTT AGAAGCGAGG GAAATACGCT TTTGTCTGAA CAAGGAGCTT
TCAGAAAGCT GCGTTGATAA GTACTTCTAT TTTGGCAATA ACTCGGCGGC CTCATCTCAA
TACTATCTTG CTCGCGAGAC GAAATCCCTT AATTACCTGC TTACATCTGT TCTCAGTGAT
CTCTACTCAA TGCTTGGTAA ATACAATATG CCAATCGGTG AATTAGCCTC AATCATAAAG
AGAATGGAAT CCTCCGGCTT AATACAGTTA GCTCCCAAAA AAGGAGATGG TAGGGTTAAT
TTGGGTGAGT TTTCAATAGT TAAGGGGAAC GACAAATTAA ACGTAGCCTT TGATAATAAA
GCTTTAACTA TCGATGGGCA CAATTATGGT TTTGAAGCGG TTATCCGGTT GTTTATTAAT
GATGATAATA AAAAGAACCG GTATGTTCTG GTAATTCCTG TGGTAAAGCT GGAGAACGGT
GAGGAAATAA TTCTTTCTAC CCACCCGGAG TATTTAGAGC TTGTTCGGCA GGCCAATAAT
TTAGGGGACG AGCCTCAAAC CGGAAAAGAC GGCGGGCGCG TTTGCTATGT TTGTGGCAGC
AAAAAGTCCG GTGTTTCCAG TGATTACTCT GCAAAATTCA GCCGTTCAGG AATCAATAAA
ATATTCACTA CTACTACCAT CAACACATCC CCCTACCTGC AAAACAATAA TTATGATCAA
ATCTATTCGA TGTGTACAAG TTGTTACCAA AAAATATTGC ATGGTGAAAA AATAATCACG
GAACAATTCC GCAGTAGAAT AGCCGGTGAA GATGTCTTCA TTATCCCCGA GGGTTTGACC
GCTTCATTTA ATTATAACTT TTTATTCAGG TTGAAAAATG GAGTGGATCT TGCTTTCAAT
ACTAATATTG CAAATAAGTG GTTAGATGAT TTGGAAGGTG CGCTTGATTT TGATCAGGTG
CAATTATATT CGTTAAATTT TTTGTTTTAC CGCACTGACG GAAATTCCGT TTCCATTTTG
GAAACCATTG AGGATGTACC GACCTTACGT TTTGTTAAAG TGATGCGAAT TTTGGCAGAA
AAAACATTCG ACATGGAACA TCAATTAAGG GAATTTTCCA TTGGTCAGAT ATACCGAATA
ATACCTGTGC GAACGAATAA AAAAGGTGAA CAGTTGGATA TCGGCAATGT ATTGTCTCTG
TATAAAGCCC TTTTGTTAGG CGAGCAGATC AGAAGTGCGG CTTTAATTAA CTACGCTGCC
GATGCTCTGG ATAAGGGAAT GCGCCAGTTA AGTAAGGATA AAGCTGACAA TTATCAAAAT
ATGGGTTTAA GGTATTATGC CGGGGGACGG GAAGACTTCT TTATTAAAAG AATAATCATG
AGTTACCTGG TTTTGATTGA GACCTGTCAA CAGCTTAATA TTCTGGATAA GCCGGTTTTT
GATTTTAATG GGGAGGGGGC AAACCACTTG GATAAAATTA ATACTGCATC GGAGAAAGTT
AATTCGTCTA TCGAAGCAAT GGAAAAATTT CTTGATGATA GAAAATTTGA TAAAGAAGCA
AGAGCATTAT TTTACCTGGG GGTATTGATT AACCGTGTAG CAATTGCGCA GTTAGAAAAG
GAGCATAAGA AAAAACCGGT ATTAAAGAAA ATACAGTTTC AAGGAATGAA AAGCAAAGAA
GTTTACCGGT TATATCTGGA TGTGCTGGAA AAACTGCAGC AGTATGATAG GTTTTCACTT
TTTGCTGAAG CTGTGTTAAA CCGTTTACAT TACTATGGTT CTTTTAATCA TACAGAGATG
CTTGGCGAGC GGGAAAATGT CTTTTTCATA ATGTCAGGGT ACGCTTATCT GGTGGGAACA
AAGACTCCGG ATATTACTAA GGGCGAAGAA GACATAATGG CGGACAGTAT TGAAGAATCT
GATCATAATG AAACTCCAAT CAGTTAA
 
Protein sequence
MNLPQLTACI GDSLKNANHN LLTSLIKRLK KVKETETAYS VFLIFDLEAR EIRFCLNKEL 
SESCVDKYFY FGNNSAASSQ YYLARETKSL NYLLTSVLSD LYSMLGKYNM PIGELASIIK
RMESSGLIQL APKKGDGRVN LGEFSIVKGN DKLNVAFDNK ALTIDGHNYG FEAVIRLFIN
DDNKKNRYVL VIPVVKLENG EEIILSTHPE YLELVRQANN LGDEPQTGKD GGRVCYVCGS
KKSGVSSDYS AKFSRSGINK IFTTTTINTS PYLQNNNYDQ IYSMCTSCYQ KILHGEKIIT
EQFRSRIAGE DVFIIPEGLT ASFNYNFLFR LKNGVDLAFN TNIANKWLDD LEGALDFDQV
QLYSLNFLFY RTDGNSVSIL ETIEDVPTLR FVKVMRILAE KTFDMEHQLR EFSIGQIYRI
IPVRTNKKGE QLDIGNVLSL YKALLLGEQI RSAALINYAA DALDKGMRQL SKDKADNYQN
MGLRYYAGGR EDFFIKRIIM SYLVLIETCQ QLNILDKPVF DFNGEGANHL DKINTASEKV
NSSIEAMEKF LDDRKFDKEA RALFYLGVLI NRVAIAQLEK EHKKKPVLKK IQFQGMKSKE
VYRLYLDVLE KLQQYDRFSL FAEAVLNRLH YYGSFNHTEM LGERENVFFI MSGYAYLVGT
KTPDITKGEE DIMADSIEES DHNETPIS