Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_1965 |
Symbol | |
ID | 8428947 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | + |
Start bp | 2097401 |
End bp | 2098771 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 645034293 |
Product | hypothetical protein |
Protein accession | YP_003191424 |
Protein GI | 258515202 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000122492 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00000114295 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAACGAT CTCCCAGGAG GTTAATCGTT GTATGTATGT TGTTTATCTT GTTAGTATTA ATGTCCTCAC TGGCGGCTGC AGTTGAAGCT TTTGTACAGC TTCCTTCTCA AGAGTTGATG AAGGTGCGTG TCACTGGTAC CAAAAACTTC GGTAATGAAG TTGTGTTCGA CAAAGAGGTG GAGACAAGAG TCAATTCAAC CGCGGGGGAT GCTCTGGGGC AGGCGGCAGA AATCGAAATG TCCGGTGACT ATGTGGAGAC CGTCGCCGGG ATCAAAGGAA ACCAGCAGGT GTACTGGTTT TATTATATTA ACGGTCTGAT GTCAAAGGCT TTTGCCTATG GCTATAAACT GCGTCCGGGG GATGTGGAGA ATTGGGATTT TCACGATTGG ACTTTTTATA TGATGGGACC GTCAGCGATG CTGGGAGCCT TCCCGGAGCC CTGCCTGCAT GGTTACGGGG GCAAGGTGGC GCCTACTATG GTGGTTTTTG CCCCGGGGTT TGAGGAAGAG GCCGCCGGGC TCAGGGACCG GTTGATCGCT CTTGGTGTGT CAGGTGTGAA AATGAAGAAT CAGAACGATT TGACTATTGA TGAGAAGAAA AACGATAATC TGTTTATTAT CGCTACGGCG GACCAGCCGC TTATAGCAAG CATGAATGAG CAGTTCAAGA TTCATGAGCC GGTTTATTTT TCCGACGGTA AAATCAAGAC GCGTGATTTC AGCGGAAATG ACAGTCAAAC ATTTGGGGCG GGGTATGGAG TGCTGAATGT GATCCAAAAC CCATGGAATC CTAAGGGCAG CTGGGCCTGT CAAGGTGCGG TTTGGGCGAT AACCGGTCTT GACGAAACGG GTGTGCGTCG CGCCGCCAAG GTACTGACCG GTTTTCCGAA AGAATTGAGT CATTCTTTTG CTCTTGTTAT CGGCAACGGG GAGGTAATTA AGACCCCGGT GGGGCCCGGT GGGGCCAAAA CAGTGGCTGT CAACACCGAG TCCGGTCTCT CTCCAGTGTC CGGTCCCTCT CCGGTATCCG GGGATTCCCA GACACCGGCT GTAAGCGCGG GGACTGTCGC CCCGGCCCAG GAGCCTGCTA AGCAGGATAA TCAAGAAGAG AATAAAGCTG CGCAAAAAAC CGACGCAGCA GATAAGACAG AGACTGCTGA TAGTTCGGAT ATCTCAAAAT CAAGTGATGA AAATCAATCC TCAGAACTTC CTACTGCTTC AGTTCTTCCC ACACTGAAGG AAAATGTCGC GCGCCATTGG TGGGTTCTGT TTCCGACGGT AGGAGTGGCT GCTGTTCCTG CCTGCTACTA TATAAAGAGG CACCGCAAAC TAAAAGAGAC TGACAATGCC GAGGAGCAGG AGTTGATATG A
|
Protein sequence | MKRSPRRLIV VCMLFILLVL MSSLAAAVEA FVQLPSQELM KVRVTGTKNF GNEVVFDKEV ETRVNSTAGD ALGQAAEIEM SGDYVETVAG IKGNQQVYWF YYINGLMSKA FAYGYKLRPG DVENWDFHDW TFYMMGPSAM LGAFPEPCLH GYGGKVAPTM VVFAPGFEEE AAGLRDRLIA LGVSGVKMKN QNDLTIDEKK NDNLFIIATA DQPLIASMNE QFKIHEPVYF SDGKIKTRDF SGNDSQTFGA GYGVLNVIQN PWNPKGSWAC QGAVWAITGL DETGVRRAAK VLTGFPKELS HSFALVIGNG EVIKTPVGPG GAKTVAVNTE SGLSPVSGPS PVSGDSQTPA VSAGTVAPAQ EPAKQDNQEE NKAAQKTDAA DKTETADSSD ISKSSDENQS SELPTASVLP TLKENVARHW WVLFPTVGVA AVPACYYIKR HRKLKETDNA EEQELI
|
| |