Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_1328 |
Symbol | |
ID | 8428277 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | + |
Start bp | 1351979 |
End bp | 1354768 |
Gene Length | 2790 bp |
Protein Length | 929 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 645033667 |
Product | pentapeptide repeat protein |
Protein accession | YP_003190831 |
Protein GI | 258514609 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000290536 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0000145927 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAACTTTA TCGGAAAATG GAACTTTTCC ACCCCGGATA ATGCTTTTAT AGGTATTGAC TCTGGGGGAA AGCTGATAAT CTATCCGAAA CCGGTGGATG GAATGTTGAA TTTTAATGCT TATGGCAGCA ACAGCAAATT TATGCTTCAG TCTGCCGTTT TATCCGGAAC AGGCGCCGGA AAATATGTGG AGGGAGCAGG TGATAATTAT CAGGCTTCTC TGGAGCGTGA AGGTGCGGTG TATAACTTTG CGCTGGAGGA TGCAGGAAAT GACAAGATCC GGATTGTGGA TTTCGGTATA AACGGACAAG GAACGGACGT TTATTATCTC AATGTTTCCG GTCAAACCCT TGGGCGTGTC AAAAAAGACA GCAATCCGCC TTCAACCACT CTATTTACTC AAACTGTTGT GACAAACGGT TTGGATTTTA TCCAGAAGTG GGGGGGCGAA GGAGCAGATT TTACATGGGT ATATTTCGGG GGAGTACAAT TTTCAAGCGG GGTCAGGTTT TCCAATGCAA CATTGTCCTA TTGTAATTTT GAAGCTGCCG TTCTGCTCAA TGCCCGTTTT CAGACTACCC AGTTGGATCA TACGAATTTT AAGAAAGCTG CGCTGACAGG TGCTAATTTA AATAATGCCT GCTTAAATAA CAGCGACCTT ACCGGTGCTT TCCTTTCAAA TGCAGTGATG AATCAGATAA AATGCCGGGA GGCAGTTCTT GAAGAAGCGA ACCTTAGCGG GACTTCGTTG AATGATGCAG ACTTCACTTC TGCCAGACTA AGAAAAGCGG ACTTCACCAA GGCGATAGTC AACGGGGTCA ATTTGACCGA TACCGATCTC AGAGAGGTTA AGATGAGCAA CCCCAAAGAT CCCGGACAAT GGACCATCTA TATGAAAAAT GCTGTTATCA GTTCAAAGAC CAACTTTGCC GGAGCGCAAA TGCAGTATTT AAACCTCACA GGTCAAAATC TGGACAACGT TGTTTTTGCC CATACGGATC TGACAGGGTC GGCAATGGAC AATACAAGGC TCAATCATGC AGACCTTAGT TATGCAAACC TGTCAAATGT CAGTATTACC GGAAATATTC CAATGCATGG AGCAAATCTT TCCAATTCAG TTTTAAGCTC AACCAACCTG ACGGGTGCTC AAATGGGCTC TTTGAGCGTG TTGTTCCGTA TTACGGATCA AACGGGGGTT GCAGGGTTTA AAAAAGCGCT GCAGAATTCC GATACGGTAA CGGTGAAAAA GATATTTGCC GATAACGGTG TACCTTTAGA GGGAGATATA GTAATAAACT CATCCCAGTA TGCCCCGGAA AGGGTCTGGG AGGTCCGTAC CGCAACAAAA ACCTATACGG TCAGGCTGGA AATAATAAAT AACGCTGAGG CAATGGTGGT GTATGAGACA GTTACTGCCG CTATAATTGA AAATGCCTAT ATGATGAATG CGGTGCTTAC CTCTGCTAAT CTTTATAATG TCAGAGCCTC AGGAATACAG CTGTTCGGAC CCAAAGCAAA GCTGGACGGG AATGCCATTT TGGAAAAAGC CCAATTTGAC GCCTCAAATT TAAGCGGACT TAATTTAAAA CAAGCCCTGC TTTACGGCAT TAATTTGAAC TATAGCAATC TGGTCGGCGC GCAGCTTCAA GGAGCAAAGC TTGGGCTGGA TTCCGACGGT GGGCAGGCTA CCTTGAAAAA TTCCAATTTA CAGGGAGCGG ATTTCTCGGA TGCGTCCCTT GATTATGCCA TGTTCACCGA TGCCGCTGTT TCTGTGGGCA GTGATGATGG CATGGGTAGA CCTGACGGTG TCTGGCTTTT TAGTGCCCCG TCAGACCAGA CCGAGGTTTG CCTGCAAGAG TTGGAAAACA GCAAAAAGCT GTTCAATATG CCGATAGAAA TGGAACAAAA TTTGCAGCCG GGAAAGGTGA GCCCGGAGCT GAGGGATGCA TTTGCACAGC ATGACGTTGA ATTGAGTGCC GATGCGCTTG TTTGCGGTCA GGAGATAGGC TTTCAGTGGC AGATAACCGA TGGAAATGAG CAATATCTTA TTTACGAAGG CTGCGATCAG CAAAAATATA CACCTGCGTT GATCGTAGAT GTCATTTCTA CGTCCGAGAG CTTTCCGATT CCTCTTAGCC TTAAAAGTGA TCTAAAGAAC GGTCCTGTGT CCAATTTAAT TAAAGTTGCT TTCGAAACAT ACGGCTCTAT TAATTTGACC GACCGGGCCT GGCTGACCGT ACAGCCGATA TCTGTTGTAT GGCAGGTGGT GGATTCCAAC ATAAATTATA CTTTGTGGCG CGGTTTGGAT ATGTCCTGTT CTTTAGCCCT GTTTGCCCGC CCTAGTCTCT CAAACGTTAC GGCGTTGTTC GGCTCCCGTT CAGTGGTGCT TAGTCAGCGT GCCCAGGTAA CTGCAAACGG AACCGGTAAA TATATGCTTG ACAATGACAG CAATAATCCG TACAATCCGC TAAATGGTTA TATACGGTTT AATGTGATTA AAAATGGTAA TGTTCTGGAT GTTTATGGGT ATGCTGTTCG AATTTTGGCT ACCGGTGCCG ATAATCAGCA GGAATATAAA AACATTATAT GTGAGGTTAC CAAGCTGTCT GAAAATGAAT TGACGGATCA AACTGTATGT CCGAACAGTG CATTCACCAG GACAAACAAA GCCAATATGC TGCCCTTTCG CCAATGGATG CGTGCCCGTG TACTTCCGCG GCCACCCGTA TGTATACCGT CTCTGGACGG AACCTATTAT TGTCCCAATA GCGTGGAAGG CGGTATGTAG
|
Protein sequence | MNFIGKWNFS TPDNAFIGID SGGKLIIYPK PVDGMLNFNA YGSNSKFMLQ SAVLSGTGAG KYVEGAGDNY QASLEREGAV YNFALEDAGN DKIRIVDFGI NGQGTDVYYL NVSGQTLGRV KKDSNPPSTT LFTQTVVTNG LDFIQKWGGE GADFTWVYFG GVQFSSGVRF SNATLSYCNF EAAVLLNARF QTTQLDHTNF KKAALTGANL NNACLNNSDL TGAFLSNAVM NQIKCREAVL EEANLSGTSL NDADFTSARL RKADFTKAIV NGVNLTDTDL REVKMSNPKD PGQWTIYMKN AVISSKTNFA GAQMQYLNLT GQNLDNVVFA HTDLTGSAMD NTRLNHADLS YANLSNVSIT GNIPMHGANL SNSVLSSTNL TGAQMGSLSV LFRITDQTGV AGFKKALQNS DTVTVKKIFA DNGVPLEGDI VINSSQYAPE RVWEVRTATK TYTVRLEIIN NAEAMVVYET VTAAIIENAY MMNAVLTSAN LYNVRASGIQ LFGPKAKLDG NAILEKAQFD ASNLSGLNLK QALLYGINLN YSNLVGAQLQ GAKLGLDSDG GQATLKNSNL QGADFSDASL DYAMFTDAAV SVGSDDGMGR PDGVWLFSAP SDQTEVCLQE LENSKKLFNM PIEMEQNLQP GKVSPELRDA FAQHDVELSA DALVCGQEIG FQWQITDGNE QYLIYEGCDQ QKYTPALIVD VISTSESFPI PLSLKSDLKN GPVSNLIKVA FETYGSINLT DRAWLTVQPI SVVWQVVDSN INYTLWRGLD MSCSLALFAR PSLSNVTALF GSRSVVLSQR AQVTANGTGK YMLDNDSNNP YNPLNGYIRF NVIKNGNVLD VYGYAVRILA TGADNQQEYK NIICEVTKLS ENELTDQTVC PNSAFTRTNK ANMLPFRQWM RARVLPRPPV CIPSLDGTYY CPNSVEGGM
|
| |