Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_1770 |
Symbol | |
ID | 8428741 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | + |
Start bp | 1872804 |
End bp | 1874960 |
Gene Length | 2157 bp |
Protein Length | 718 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 645034105 |
Product | RNA binding S1 domain protein |
Protein accession | YP_003191247 |
Protein GI | 258515025 |
COG category | [K] Transcription |
COG ID | [COG2183] Transcriptional accessory protein |
TIGRFAM ID | [TIGR00426] competence protein ComEA helix-hairpin-helix repeat region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.133357 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATATTG AACAATTGAT AATAAGGCAT GTAGCCGGCA ATACCGGACT GCCTTTAAAA AAAGTTGAGA GCACTGTTCA GTTGTTGGAT GCAGGCAATA CTGTACCTTT TATCTCTCGT TACCGTAAAG AAATGACCGG TGAATTAAAT GAGGTTGAGA TACGCAGTAT TGAAGAACAA TTAAAATACC AGCGCAATCT CCAGCAGCGT AAGGAAGAGG TTGTCAGGCT GATAGAAGAA CAGGGTAAAT TGACGGATGA ACTTCAGCAA AAAATTATGA GTGCCGGTAA ATTAAACATA GTGGAGGATT TATACAGGCC TTATCGCCAG AAGCGCAAAA CAAGAGCCGG TGTAGCCAGA GAGAAAGGAC TTGAGCCTCT TGCCCTTTAT ATTCTGTCTC TTCCTAAAAC CGGTGATCCG GGCTTGGAAG CTCAAATATA TGTTAATGAA GAAAAAGGAA TTCTGACAGC GGAGGATGCT CTGCAAGGTG CCTGTGATAT TATAGCCGAA ACAGTTGCTG ACGATCCGGA GATCAGAGGC TGGGTTAGAA GGTATATTTT CAGAAATGGT TTGCTGGTAA CCGAGGCTAA AGAAAAAGAA ACTCGTTCGG TGTACGAGAT GTATTATGAA TACAGTGAGG CTGTAAGAAC CGTGGCGCCT CATCGCACAC TGGCCGTTAA CCGGGGTGAA AAGGAAGGCA TTTTGAAAGT TAGGATTGAG GTGCCGGAAG AAGAGATATA TGGATACTTG AACCGTCGCT GGTTAAAAAA TCCCGTCTCA GTTACTGTTG CTTATGTACA GAGCGCGGTT ATTGACGGTT ATAAAAGATT GCTCTTTCCT GCCGTGGAAA GAGATGTCCG TAATGAATTA ACTGAAAAAG CTGAGGAGCA GGCAATAAAA ATATTTTCCA AAAACTTGCG CCAACTTCTA CTGCAGCCAC CGGTTAAAGG GGAAGTTGTG TTGGGAGTTG ACCCTGGTTT TCGCACTGGT TGCAAATGGG CAGTGGTTGA TGATACTGGA AAACTATGGG AGGTGGGAGT GGTTTATCCT ACTCCGCCGC AGAAAAAAAT TACAGAAACC AAAGAAATCT TTCGCCAACT GGCTGATCGT TACGGTATAA CTGTTATTGC AATTGGCAAC GGTACTGCTT CCCGTGAGAC AGAACTGGTA GTTGTTGATT TCATTAAAGA ATATGTGAGA CCCTTGCAAT ATATAATTGT CAGTGAGGCA GGTGCCAGTG TTTATTCGGC CTCTGAACTG GCTGCCAGGG AGTTCCCAAA ACTGGATGTG GCTGAACGCA GCGCAGTCTC CATTGCCAGG CGTTTGCAGG ATCCTCTGGC CGAACTGGTA AAAATAGACC CTAAAGCAGT TGGTGTCGGC CAGTATCAGC ATGACGTAGC TCATAAGCGT CTGGAGGAAA GCCTGGCCAG TGTTGTGGAA TCGGTAGTTA ACCATGTCGG GGTAGATTTA AATACGGCTT CTCCGTCTCT ATTGTCACAT GTAGCCGGAG TTAATATGAC CGTAGCTCAA AAAATAGTTG AATTCCGTGA AAATGAGGGT AAATTTAAAA ACCGCAGCCA GTTAAAGAAA GTTCCCCGGT TAGGACCGAA AACCTTTGAA CAATGCGTTG GTTTTCTGAG AATTAGTGAT GGTGAAAATA CCCTGGACAA GACACCCATA CACCCTGAGT CATATGATCA GGTAAAAAAG TTTTTGAAAG AAATTGGCTG TTCTATGATG GAAGTCGGTT CCGGTAAAAT GAAAGAAAGG CTCAAAAATA TAGTTATAGA CGAAATGGCG CTTAAATTAG ATATAGGATT ACCAACATTG AAGGATATAA TCGAGAGTTT GGAAAAACCC GGACGTGACC CGCGTGAGGA ATTACCCAAA CCTATCTTTA GAACAGATGT TCTAAAGATA GAAGATTTGC AGGTAGGCAT GGAACTCAAA GGGACAGTAC GAAATGTAGT GGATTTCGGC GCATTCGTAG ATATTGGAGT AAAGGTTGAC GGTATGGTTC ACAAATCTGA ACTGGGTACC AGGCGTTTCA GTCATCCTAT GGATGTGTTG TCTGTAGGAG ATATTGTAAC TGTAAAAGTT CTGTCGGTAG ATCTGGAAAG GCAAAGAGTG GCTCTGACAT TAAATGAAAA ACACTGA
|
Protein sequence | MDIEQLIIRH VAGNTGLPLK KVESTVQLLD AGNTVPFISR YRKEMTGELN EVEIRSIEEQ LKYQRNLQQR KEEVVRLIEE QGKLTDELQQ KIMSAGKLNI VEDLYRPYRQ KRKTRAGVAR EKGLEPLALY ILSLPKTGDP GLEAQIYVNE EKGILTAEDA LQGACDIIAE TVADDPEIRG WVRRYIFRNG LLVTEAKEKE TRSVYEMYYE YSEAVRTVAP HRTLAVNRGE KEGILKVRIE VPEEEIYGYL NRRWLKNPVS VTVAYVQSAV IDGYKRLLFP AVERDVRNEL TEKAEEQAIK IFSKNLRQLL LQPPVKGEVV LGVDPGFRTG CKWAVVDDTG KLWEVGVVYP TPPQKKITET KEIFRQLADR YGITVIAIGN GTASRETELV VVDFIKEYVR PLQYIIVSEA GASVYSASEL AAREFPKLDV AERSAVSIAR RLQDPLAELV KIDPKAVGVG QYQHDVAHKR LEESLASVVE SVVNHVGVDL NTASPSLLSH VAGVNMTVAQ KIVEFRENEG KFKNRSQLKK VPRLGPKTFE QCVGFLRISD GENTLDKTPI HPESYDQVKK FLKEIGCSMM EVGSGKMKER LKNIVIDEMA LKLDIGLPTL KDIIESLEKP GRDPREELPK PIFRTDVLKI EDLQVGMELK GTVRNVVDFG AFVDIGVKVD GMVHKSELGT RRFSHPMDVL SVGDIVTVKV LSVDLERQRV ALTLNEKH
|
| |