Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_3544 |
Symbol | |
ID | 8430539 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | + |
Start bp | 3738192 |
End bp | 3739532 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 645035764 |
Product | protein of unknown function DUF21 |
Protein accession | YP_003192882 |
Protein GI | 258516660 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTGATG ATACGTCGTT TAGTATTATT GTTGCAGTAC AAATTATATT CGCGTTGTTT TTGGTTTTTT TAAATGGTAT CTTTGTGGCT GCGGAATTCT CCTTTGTTAA AGTAAGACCT ACACGTCTAG CGCAACTGGC AGATGAAGGA AACCGGAAAG CAAATATTGC TCAAACTATC ATAAGCAATA TAGACGCATA TCTTTCAGTG TGTCAACTTG GTATAACCCT GGCAAGCCTT GGCTTGGGTT GGCTCGGGGA ACCAGTGGTA GCTAAAATTA TAGAACCTGT TTTGGGTTAC TTAGGGGTAT TTTCGTCAGA TGTGCTGCAT TACATTTCTT TTGTAATTGC ATTTAGTTTA GTTACGTTAA TGCATGTAGT GTTTGGTGAA CTGGTACCAA AATCTTTGGC AATCCAGAGA GCAGAAAAAA TAGCCCTGTA TCTGGCAACA CCTATGCGTA TATTTTATTA TCTTTTTTAT CCGGGGATAA TCGTTTTTAA CGGTACAGCC AATTCAATAT TGCATATCAT TGGTATTCAA CGTACCAGTG AACATGAGGC AAGTCATAGT GAGAAAGAAC TGCAAATGCT TGTTTCCGAA AGTTACAAAT CCGGACATTT AGATAAAGAT GAGTGGAGAT TACTTCAAAA TGTTTTTGAA TTTGAGAAAA GAATTGCCAG AGAAATATTG GTTCCACGTC CGGAAGTAGT TTTTTTAGAT AGAAGAAAAA CCCTACAGCA AAATATTGAG ATAGCACGGC AATCGGAACA TACTCGTTTT CCTCTTTGTG ACGGAGATAA TGACAATGTG GTTGGTCTTA TACACATCAA AGACCTTTTT AAGCTAAAAG ATGAAACTAG CATTAATGAT GTTAAACGCA ATATTATGAT GGTACCAGAG GGAATTCCAT TAGACAGATT ACTCAAGCAA TTCCAACAAT GCCGCCAGCA GATGGCTTTG GTAGTAGATG AATACGGCGG TACAAGTGGT ATAGTTACTA TGGAAGATGT TTTGGAAAAA TTGGTAGGTG AGATTCATGA TGAGTTTGAC AATGAGATTC CAAAAATAAT CCCAGAAAAA GAAGGGACTT TTCTTGTAGA GGGTCGATTG CTTCTGGAAG AAGCTAAAGA AATGTTTCAC CTACCTGTAA CTGAGGATAC AGAATATGAT ACTATTGGCG GTTATGTTTT TGGTGAACTC GGCAAACGCC CCAAGGTTGG AGATATTGTG GAACTACCTA ATCACCGGCT AGAGGTAACC AGAATTCAAG GACTCCGTAT CCAACAAATT CGTTTAAATA TCCTTGATAA TAAGTTAAAT AGAGATATTC ATGCAGTATA A
|
Protein sequence | MGDDTSFSII VAVQIIFALF LVFLNGIFVA AEFSFVKVRP TRLAQLADEG NRKANIAQTI ISNIDAYLSV CQLGITLASL GLGWLGEPVV AKIIEPVLGY LGVFSSDVLH YISFVIAFSL VTLMHVVFGE LVPKSLAIQR AEKIALYLAT PMRIFYYLFY PGIIVFNGTA NSILHIIGIQ RTSEHEASHS EKELQMLVSE SYKSGHLDKD EWRLLQNVFE FEKRIAREIL VPRPEVVFLD RRKTLQQNIE IARQSEHTRF PLCDGDNDNV VGLIHIKDLF KLKDETSIND VKRNIMMVPE GIPLDRLLKQ FQQCRQQMAL VVDEYGGTSG IVTMEDVLEK LVGEIHDEFD NEIPKIIPEK EGTFLVEGRL LLEEAKEMFH LPVTEDTEYD TIGGYVFGEL GKRPKVGDIV ELPNHRLEVT RIQGLRIQQI RLNILDNKLN RDIHAV
|
| |