Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_3940 |
Symbol | |
ID | 8430955 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | - |
Start bp | 4115635 |
End bp | 4116921 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 645036158 |
Product | protein of unknown function UPF0052 and CofD |
Protein accession | YP_003193256 |
Protein GI | 258517034 |
COG category | [S] Function unknown |
COG ID | [COG0391] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01826] conserved hypothetical protein, cofD-related |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000112442 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.62962 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATAACAG CTAAATGGTT GTACCCCGGT ATGAAGGTTA AGAGATGGTT ACTTCTGTCT CTGGTGGGAA TTGTTTTAGT TTTAGCCGGT TTTTCCTTGA TGCACGGATC GGTATTTAAC CGGCCGGAAA TACTTGTCTC AAAAATAATT CATGATATGC TGGATTTTGT GCCTTCTCCG AGGAGCGGTT TGTTTATTTT TCTTGCCGGC TTGATTTTTC TGATCTGGGG TTTGCTGCAG GTCTTTAAGT CGGTATTGAG TGCCGTTATG ACGGACAGAG AGGGCAGGTT GGTGGATATT ATTTTTTTCA GGCGCTATTT AAGGCGCGGG CCAAAGATTG TGGCTATAGG AGGCGGCACC GGTCTTTCGG TAATGCTCAG GGGGCTGAAA AACTATACCA GTAATATTAC GGCAATTGTT ACTGTGGCGG ATGACGGCGG GAGTTCCGGT CGTTTGCGGG GTGATCTGGG TATTTTGCCG CCGGGTGATA TCCGCAGCTG TCTGGCGGCT TTAGCCGATA AGGAAGACTT GATGGAGCAA ATGCTGCGGT ACCGCTTTAA TTCGGGTGAA CTGGCCGGTC ATAATTTGGG CAATTTATTT TTGGCGGCTT TGAATGATAT GTCAGGCGGG TTTGACAGCG CCGTGCGCAG TTTGAGTAAA GTCCTGGCTA TCAGGGGGCA GGTGTTGCCG GTAACTCTGC AGAATGTTAA CCTGGCAGCT GATTTGGAGG ACGGCACCAC TATTTACGGG GAGTCTTCGA TTTGCAAGAG CCAAAAAAGA ATTAAAAGGG TATACCTGTA TCCCGCCAAT TGTTTGCCCT TGCCTGAGGC TCTGGAAGCT ATTAAAGAAG CTGATGCCAT TATACTGGGA CCGGGCAGTC TGTATACCAG TATTATACCT AATTTATTGG TTATAGGCAT TCCGGATGCC ATTATGGAAT CTGAGGCCGT TAAAATTTAT GTCAGTAATG TGATGACACA GCCGGGTGAA ACGGACGATT TTTCTGCTAC GGATCACTTG CAGGCGATTA TAAGTCATGG CGGGCCGATT ATCGATTATA TGATTGTCAA CCGGCAGGAA ATCCCGTCGC ACTTATTGAA GAAATACCGC ATGGAGGGTT CACAGCCTGT AAGGTGTAAT ATTAAAGAGG CGGAGAAACT GGGTGTTAAA GTTGTGATAG ATAAATTGGT GCATGAGACT GATGTGGTCA GACACCACCC TGACAAGTTG GCTGCTGCGA TTATGAGACT GCTTCTGGTG TTTAGAAAGA AAAATAAGTT CAGGTAA
|
Protein sequence | MITAKWLYPG MKVKRWLLLS LVGIVLVLAG FSLMHGSVFN RPEILVSKII HDMLDFVPSP RSGLFIFLAG LIFLIWGLLQ VFKSVLSAVM TDREGRLVDI IFFRRYLRRG PKIVAIGGGT GLSVMLRGLK NYTSNITAIV TVADDGGSSG RLRGDLGILP PGDIRSCLAA LADKEDLMEQ MLRYRFNSGE LAGHNLGNLF LAALNDMSGG FDSAVRSLSK VLAIRGQVLP VTLQNVNLAA DLEDGTTIYG ESSICKSQKR IKRVYLYPAN CLPLPEALEA IKEADAIILG PGSLYTSIIP NLLVIGIPDA IMESEAVKIY VSNVMTQPGE TDDFSATDHL QAIISHGGPI IDYMIVNRQE IPSHLLKKYR MEGSQPVRCN IKEAEKLGVK VVIDKLVHET DVVRHHPDKL AAAIMRLLLV FRKKNKFR
|
| |