Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_1684 |
Symbol | |
ID | 8428650 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | + |
Start bp | 1771802 |
End bp | 1773097 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 645034017 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_003191164 |
Protein GI | 258514942 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000247869 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAATTATA CTACTCAAAT GGACGCTGCC CGTCAAGGAA TTGTCACCGG GGAAATGGAA GAGGTGGCTC GCAAGGAGTT AATGGATGTA TCAGTTTTGC GGGAACTGAT CGCGGAGGGT AAGGTGGTTA TACCCGCCAA TAAAAATCAT ACTTCCCTAA AAGCTTGCGG GATTGGGCAG GGCTTAAAAA CAAAAATTAA CGTTAATCTA GGTGTCTCCA AAGACTGCTG CAGTATTGAG TCTGAGATGG AAAAGGTTCG GCGTGCCATT GAACTGCAGG CGGATGCCAT TATGGATCTT AGCTGTTACG GAAAAACCGA AGAATTTAGG CGCAGACTGG TGGAAATATC ACCGGCGGCT GTTGGCACCG TGCCTGTCTA TGATGCTGTC GGTTTTTACG ATAAGGAATT GAAAGAAATA ACAGCCGGGG AGTTTCTGGG AGTGGCTGAG AAACATGCCC AGGACGGGGT TGACTTTATG ACCGTACACG CAGGGATTAA TCGGGAGACT GCCGGCCGGT TCAAGAAAAA CCCGCGTTTG ACCAATATTG TTTCCAGAGG TGGGGCATTG CTTTATGCCT GGATGGAACT TAATGATCGA GAAAACCCTT TCTTTGAGTA TTATGACGAA CTTCTGGATA TTTTCAGAAA GTATGATGTC ACTATCAGCC TGGGTGATGC CTGCCGGCCG GGCAGTATTA AAGATGCTAC TGATGCCAGC CAGATTCAGG AATTGATTAT TCTTGGTGAA TTGACTAAGC GGGCCTGGGA GAAAGATGTT CAGGTGATGA TAGAAGGACC CGGTCATATG GCTTTAAACG AAATTGTCCC GAACATGCTT TTGGAGAAGA AGTTATGCCA CGGTGCTCCT TTTTACGTCC TGGGACCGCT GGTTACCGAT GTAGCTCCCG GTTACGACCA TATCACCAGT GCCATCGGCG GGGCCATCGC TGCTGCCAAT GGGGCGGATT TCCTCTGTTA TGTAACTCCG GCGGAGCACC TGCGGCTGCC CACCCTGGAA GATATGAAAG AGGGTATCAT CGCCTCCCGT ATTGCTGCCC ACGCGGCCGA CATAGCCAAG GGAGTTCCCT GTGCCAGGCA GTGGGATGAT AATATGAGCG AAGCCAGGCG CAATTTGGAC TGGCAGAGAA TGTTTGAGTT GGCCCTGGAT CCGGAGAAGG CCAGGAACTA CAGGTCACAA TCCCAGCCTG AGAACGAGGA CACCTGCACC ATGTGCGGCA AAATGTGTGC TGTACGTAAT ATGAATAAGG TGTTGGACGG GTCGGAGCCT ATTTAG
|
Protein sequence | MNYTTQMDAA RQGIVTGEME EVARKELMDV SVLRELIAEG KVVIPANKNH TSLKACGIGQ GLKTKINVNL GVSKDCCSIE SEMEKVRRAI ELQADAIMDL SCYGKTEEFR RRLVEISPAA VGTVPVYDAV GFYDKELKEI TAGEFLGVAE KHAQDGVDFM TVHAGINRET AGRFKKNPRL TNIVSRGGAL LYAWMELNDR ENPFFEYYDE LLDIFRKYDV TISLGDACRP GSIKDATDAS QIQELIILGE LTKRAWEKDV QVMIEGPGHM ALNEIVPNML LEKKLCHGAP FYVLGPLVTD VAPGYDHITS AIGGAIAAAN GADFLCYVTP AEHLRLPTLE DMKEGIIASR IAAHAADIAK GVPCARQWDD NMSEARRNLD WQRMFELALD PEKARNYRSQ SQPENEDTCT MCGKMCAVRN MNKVLDGSEP I
|
| |