Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_1597 |
Symbol | |
ID | 8428561 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | + |
Start bp | 1674405 |
End bp | 1675685 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 645033930 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_003191079 |
Protein GI | 258514857 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACAGG TATTAAAGGC CCGGGCCGGA AAAATAACGG AAGAAATGGA AGCCGTTGCG CTGTATGAAC AAGTCGATGT GGAATTTGTT AGAAAAGGAG TGGCAGAGGG ACGCGTTGTC ATTCCCAGAA ATATTAACCG TAAGCCCTTC AGATATTGCG GTATCGGGGA GGGTATGCGG GTTAAGGTCA ATGCTCTGAT CGGCACCTCC AGCAACCGTG ATGATATGGC CATGGAAGCC AGAAAATTGC AAGCAGCACA GGATGCCGGC TGCGACAGCT TCATGGACTT AAGTACCGGC TCCAACATTG ATGACATGCG CAGGCAAACC TTGGACATGG CCAAGGTAGC CGTTGGCTGT ACCACTATTT ATCAAGCGGG ACAGGAAGCC ATTGAAAAAT ATGGCAGCGT GGTAGAAATG CGTCCCAAGG ATATTCTGGA CAATATCGAA AAACAGGCAG CGGAAGGTAT GGATTTTATG GCGATCCACT GTGCATTTAA TAATTCTGTG TTGAAAGTAA TGCAAAAAAC AGGTCGTGTA ACCTGGGTTG TCAGTCGCGG CGGTGCATTC ATCACCGGCT GGATGCTGCA CAACAAAAAG GAAAATCCCC TCTTTGAGCA CTATGATCGT ATTCTGGAGA TATTAAAAGC CTATGACGTC ACTCTGAGCT TGGGCGATGC CATTCGTCCC GGCGCAACGG CAGACTCTCT GGATGGTGCT CAGATGCAGG GACTGCTGGT GCAGGGTGAT CTGGCTAAAC GCGCTCAGGC CGCCGGCGTT CAGGTTATGG TAGAAGGACC GGGGCATGTG CCCCTTAATC ATGTAGAAGC TACCATGAAG CTCCAAAAAA GAATATGCAA TAACGCGCCT TATTTTATCC TGGGTACATT GGCGACGGAT GTTGCCCCGG GCTACGACAA CATTACCGGT GCCATCGGAG GTGCCTTTGC CGGTTCCTGC GGTGCAGATT TCCTCTGCTA CCTGACTCCG GCAGAGCATT TGGGCCTGCC TTTGGAGGAA GATGTGCGTG TCGGCGTCAT CACCACCAGA ATAGCGGCAC AGATTGCCGA TGTAGCCAGA GGTCACAAAC AGGCTATTGC AAGAGAAAAT GAAATGGCCA GTGCTCGTGT AGCCATGGAT ATTGACCGTC AGATCAAAGC GGCTCTGGCG CCGGATAAAT TGATTGCCGC TAAGGAAAAG GGCTGCGGAC AGCACTTGTG TGCAGCCTGC GGTAAAGACT GCGCTGTTCA GGAAGCCGCC CGTTATTTTG GTATCTCGTA G
|
Protein sequence | MTQVLKARAG KITEEMEAVA LYEQVDVEFV RKGVAEGRVV IPRNINRKPF RYCGIGEGMR VKVNALIGTS SNRDDMAMEA RKLQAAQDAG CDSFMDLSTG SNIDDMRRQT LDMAKVAVGC TTIYQAGQEA IEKYGSVVEM RPKDILDNIE KQAAEGMDFM AIHCAFNNSV LKVMQKTGRV TWVVSRGGAF ITGWMLHNKK ENPLFEHYDR ILEILKAYDV TLSLGDAIRP GATADSLDGA QMQGLLVQGD LAKRAQAAGV QVMVEGPGHV PLNHVEATMK LQKRICNNAP YFILGTLATD VAPGYDNITG AIGGAFAGSC GADFLCYLTP AEHLGLPLEE DVRVGVITTR IAAQIADVAR GHKQAIAREN EMASARVAMD IDRQIKAALA PDKLIAAKEK GCGQHLCAAC GKDCAVQEAA RYFGIS
|
| |