Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_1963 |
Symbol | |
ID | 8428945 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | + |
Start bp | 2091509 |
End bp | 2094538 |
Gene Length | 3030 bp |
Protein Length | 1009 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 645034291 |
Product | S-layer domain protein |
Protein accession | YP_003191422 |
Protein GI | 258515200 |
COG category | [S] Function unknown |
COG ID | [COG1520] FOG: WD40-like repeat |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000481253 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0221131 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTAAAA AGAGGATTTT ACTTCTATTT TTTGCACTTT TGTTTGTTTG CCTGCTGATT CCGGTTGCGG GATGGGCCGC CGGAGGCACA TGGCCGCAGT TTCAAAATGA TCTCTACAAT GATGGACTGA CCACTGATTC GGCTCCGCTG AGCAGCGCTT CTGTAGCCTG GCAGCAGCAG GTTGGGAACA CCTCTATGGC GGGCATTAAC CATACCCCCC TGGTTGCCGG GGGCAGTGTT TTTGCCATGG ATGCATTTGG AAAGTTATGG TCATTTGATA TGAAAACAGG GACTGAGAAA TGGTCCACTC AATTGAGCTG CTCTATAAAG CAATTTCAAC TTTCCACGCC GGCCTATGAC AGCGGCAAGC TGTATGCGGC CACCAACGAT GGGCATGTTT ATGCTCTGGA CGCCGGCAGC GGGAGTATTT TGTGGGATAT ACCGCTGCAG TTGGCATCAA AGTACAGCCA ACTGAACACG CCGGTTAAGT ATGTTGACGG GAAGATTTAC ATTGGAGCCT GGAATCCTGA TTCCGCTTCT AACGAGTTTT ATTATTGCCT GGATGCTGTT ACCGGTGCTC CGGGAATTGG CGGTCAGTAC CAGATGCAGA ATTCGGCTTG CGGAGGTGGA TATTACTGGA CCGGCGCTTG TATTGTGGGT AAGTACATGA TTTTTGGTTC CGAGCGGTCG ACACTTACCT GTCTTGATAA AGATACCGGT GCCTTGATTT CTTCCGTGAG TCTGAAGGTA TACAGCAGCG GAGCCAAGGA AATACGCTCC TCCGTAAGCT ATGATTCCAC AACAGGCATG GTCTTTCTGA CTGATCAGGG CGGTAATTGC TGGTCCTTTA AATTCGACTC GGACAGCGGT AATCTGATCT ATCTGTGGAA CAAGACTATT GATAAAACCA GCACTTCGAC CCCGGCGGTA TATGACGGCA AGCTTTACGT GGGCAGCGGA ACCTATGCTT CCAGGGGCGG TCTTTACTGT CTGGACGAGC AGACAGGAAG CGAGCTTTGG AACTTCATGC CCACCGGTGA CGGGGAAGTT TCGGTACCGG GCGTACAGGC ATCACCTGCA ATAGCCGTGC AAAATGGAAC CCCGTACATT TATTTTGCCT CAATTTGTGA GAATTCTCTG GTTTATTGCC TGGATCAAAA CGGTAATCAA GTCTGGCAGT TTGCCGTTCC CAATTATACT TACACTACTA CCAGCATCGC CGTGGCTGAT GGTTGGTTGT ATTTGGGCAA TGACTACGGG TGGTTGTATG CTTTAAAGGG AACAGCAGTT CCGGTCACGG GAGTTTCTCT TAACAAAACC ACCGACACCA TTACTGTCGG AGCAACAGAC ACGCTCATAG CCAATATCGC TCCGGTTAAT GCCACAAATC AGAGCGTGAC CTGGGTTTCT GACAATACTG CAGCAGCTGC CGTGGATCAG AGCGGAAAGG TTACGGCGGT GGCTCAGGGG ACAGCCAATG TTACTGCTAC CACGGCGGAC GGTGGTTATA CAGCAACTTG TGTTGTAACA GTCACCGGTG GTGGTGGCGG TGGCGGCTCA TCGACAACGA GCAAGGTGAA TCTTGTCATT AAAGATAAAA ACGGAAGTAC TCGCTTTAAT AACAATATAA GTGTTCAGGC GGGAGATACC GTTATGGATG TCTTATTTGC CGCGGCAGAG AAGGATTCGG CTATCGATCC CCAGGTGGAA TGGGAAAACG CCTACATTAT GGGGGGTTAT GTATGGAGCA TCTACGGTGT GGAAAGTCCC TGGGGGAAAA TGTCTGAAGG ATGGGTATTT CAGGTAAACG GTGTCATGTC AAATAAAGGT GCGGCCAAAT ATATCGTCAG CGATGGCGAC AACATTTTAT GGGAATGGAG TGCAATGGAA CCGGTTACGG GAATTACTCT GGACAAGACC AGCAGCACTG TCAATGTGGG CGGCACTGTC CAATTGAATG CCAATATTAA ACCAGCAAAT GCCTCCAATA GGAGTATTAA CTGGACCTCC GACAACACTG CAGTAGCTAC CGTCGATAGT GACGGCAAGG TTACCGGCGT GTCCGCCGGC TCCGCCAAAA TCACTGCCGC CACTGCTGAC GGCGGTTATA AGGCGACCTG TGTGGTTACT GTTCAGGCCG CAGCCGGAGG ATCGGCCTCA ACGACAGGGT CAGGAATAAG CTTGAACAAG ACCACTGACA CGATTAAAAT TGGAGCCACA GATCAATTGA CAGTGACGAT CTCTGCAACC GGTGTTTCAG ATCAGGATAT CAAATGGGCA TCCGACAATA CCACAGTGGC TACCGTCGAT AGCAAGGGCA TGGTTACCGG AGTTTCCGTC GGTACCGCTA AAATCACCGC CGCAACGGCA GACGGCAGAT ATACAGCGAC TTGTGTGGTT ACTGTTCAGT CTGTTGAGCC GGCGCAGCAG GTTCAACAGA CCGTTCAGCC GCAATCTCAA ACTCAATCCC AGTCTGCAGT TGCATTTGAA GACCTGCAGG CAGGCTATTG GGCCAGGGAA GCTATTGAAT ATATGGTTGC CGGAGGTTAT CTGAAGGGAT ATGAGGATGG TACTTTCAGG CCTGATCAGC CCATAACCAG GGCGGAATTT ACTGCTTTAA CGGTGAAAGT AATGGGTTTG CAGGAAGCAG ATGGCAGAGA CATATTTAAG GATGTGCATT CCGGTGACTG GTATTACGAT ATTGTGAACA TCGCCTTTAC ACATGATTTG GTTTCCGGCT ACGGGGATGG CATGTTTGGC CCTAACGAAC CGGTTACCCG GGAACAGGTA GTGTCAATGA TCAGCCGTGT TTTAGCGCAA AAAGAGGGCC AGCAGAAGGA GACAGCAGTA AAAGATGAAA TATTGCAGCA ATTCAATGAT GCCGGGGAGA TTTCCGATTG GGCCCGGCCT GCTGTGGCCA TAGTGATCAA CAAGGGTATA GTCAATGGAT ATGAAGACGG TACCTTCAGG CCGAATTCGC CCGCTACCAG GGCCGAATGT GTAGTAATGC TCAGAAAGTT GCTGCCCTAG
|
Protein sequence | MSKKRILLLF FALLFVCLLI PVAGWAAGGT WPQFQNDLYN DGLTTDSAPL SSASVAWQQQ VGNTSMAGIN HTPLVAGGSV FAMDAFGKLW SFDMKTGTEK WSTQLSCSIK QFQLSTPAYD SGKLYAATND GHVYALDAGS GSILWDIPLQ LASKYSQLNT PVKYVDGKIY IGAWNPDSAS NEFYYCLDAV TGAPGIGGQY QMQNSACGGG YYWTGACIVG KYMIFGSERS TLTCLDKDTG ALISSVSLKV YSSGAKEIRS SVSYDSTTGM VFLTDQGGNC WSFKFDSDSG NLIYLWNKTI DKTSTSTPAV YDGKLYVGSG TYASRGGLYC LDEQTGSELW NFMPTGDGEV SVPGVQASPA IAVQNGTPYI YFASICENSL VYCLDQNGNQ VWQFAVPNYT YTTTSIAVAD GWLYLGNDYG WLYALKGTAV PVTGVSLNKT TDTITVGATD TLIANIAPVN ATNQSVTWVS DNTAAAAVDQ SGKVTAVAQG TANVTATTAD GGYTATCVVT VTGGGGGGGS STTSKVNLVI KDKNGSTRFN NNISVQAGDT VMDVLFAAAE KDSAIDPQVE WENAYIMGGY VWSIYGVESP WGKMSEGWVF QVNGVMSNKG AAKYIVSDGD NILWEWSAME PVTGITLDKT SSTVNVGGTV QLNANIKPAN ASNRSINWTS DNTAVATVDS DGKVTGVSAG SAKITAATAD GGYKATCVVT VQAAAGGSAS TTGSGISLNK TTDTIKIGAT DQLTVTISAT GVSDQDIKWA SDNTTVATVD SKGMVTGVSV GTAKITAATA DGRYTATCVV TVQSVEPAQQ VQQTVQPQSQ TQSQSAVAFE DLQAGYWARE AIEYMVAGGY LKGYEDGTFR PDQPITRAEF TALTVKVMGL QEADGRDIFK DVHSGDWYYD IVNIAFTHDL VSGYGDGMFG PNEPVTREQV VSMISRVLAQ KEGQQKETAV KDEILQQFND AGEISDWARP AVAIVINKGI VNGYEDGTFR PNSPATRAEC VVMLRKLLP
|
| |