Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_3939 |
Symbol | |
ID | 8430954 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | - |
Start bp | 4114652 |
End bp | 4115599 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 645036157 |
Product | protein of unknown function DUF199 |
Protein accession | YP_003193255 |
Protein GI | 258517033 |
COG category | [S] Function unknown |
COG ID | [COG1481] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR00647] conserved hypothetical protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000093648 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.595989 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCTTTT CGGTAGTGAC CAAGGATGAA CTTGCCCGCA TAATTGCTAA AAAGGATTGC TGCAGGTTAG CGGAACTGGC AGCCCTGGTG AAAATGGACG GCAGTATAGA AATAAGCGGC AACCAGAGAC TGGGTTTAAG TGTAGTGACA GAGAGTGCCG GAGTGGCGCG CAAAATACTA ACGCTGATTA AGCAGTTTTT TGATCATAAT ACCCAGGTAA TAGTACGGCG CAAGGCCAGA CTGAAGAAGA ATAATATTTA CCAGGTAAAG GTCAGCTCGG CTCCCGGTAT AAGAGATATT CTGGTTCGTT TGGGTATGCT GGATGAGGAC GGACAGCTGA GTGAGAAAAT AAAAGCCGGG TTAATTACAC GTACCTGCTG TAAAAAAGCG TACCTGAGAG GCGCTTTTTT AGGAGGTGGC TCGGTAAATA ATCCCGAGGG AGATTATCAC CTGGAGATTA TCACTAACAA AGAACAGCAT GCTCTGGATA TAGCCGGTGT AATGCGTAAA TTCGACCTTT CGGCCAAAGT CAGTACCAGA AAGAACTGGT ATGTAGTATA CTTAAAGGAA AGTGAGCAGA TAATTACCTG CCTTAATGTT ATGGGCGCTC ACACAGCCTT GTTGGATTTT GAAAACGCGC GTATATACAA AGACATGCGC AATCAGGTAA ACCGGCTGGT GAATTGTGAG ACAGCCAACT TAAACAAAAC TGTTGACGCA GCGGTTCGCC AGTTAGAGTG CATTAAACTG ATTGAAGGTC TTGTCGGGTT GGAGAAACTG CCGAAAAGCC TGCGCAGGAC AGCGGAACTG AGGCTGCAAA ACCCAGAGGT AAGTCTGAGA GAGCTGGGAG ATTTATTGGA GCCTAAAGCG GGAAAATCTT GCGTTAACCA CCGTATGCGT AAATTAGAGA AGATAGCCGA CGCATTGCGG GCTAAGCGGC AGTATTAG
|
Protein sequence | MSFSVVTKDE LARIIAKKDC CRLAELAALV KMDGSIEISG NQRLGLSVVT ESAGVARKIL TLIKQFFDHN TQVIVRRKAR LKKNNIYQVK VSSAPGIRDI LVRLGMLDED GQLSEKIKAG LITRTCCKKA YLRGAFLGGG SVNNPEGDYH LEIITNKEQH ALDIAGVMRK FDLSAKVSTR KNWYVVYLKE SEQIITCLNV MGAHTALLDF ENARIYKDMR NQVNRLVNCE TANLNKTVDA AVRQLECIKL IEGLVGLEKL PKSLRRTAEL RLQNPEVSLR ELGDLLEPKA GKSCVNHRMR KLEKIADALR AKRQY
|
| |