Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0952 |
Symbol | |
ID | 4811245 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1141213 |
End bp | 1142493 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640106371 |
Product | dihydroorotase |
Protein accession | YP_001037379 |
Protein GI | 125973469 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0044] Dihydroorotase and related cyclic amidohydrolases |
TIGRFAM ID | [TIGR00857] dihydroorotase, multifunctional complex type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0556673 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTTAATTA AAGGCGGACA TGTTGTTGAC CCGAAAACCA ACACAAACGG TATTATGGAT ATTTTGGTTG AAGACGGTAT AATAACGGAG ATAGGCAAAG ATATTGAAAT TTCAAACGGT GATATAATTT ATGCCGAGGG CAAGCTGGTA CTGCCGGGAC TGGTGGATGC CCATTGTCAT CTGAGAGATC CGGGTTTTGA ATACAAGGAG GATATAGAAA CAGGGACCAT GAGTGCTGCA ATGGGAGGAT TTACTTCCAT AGCGTGTATG CCTAATACGG ATCCTGTCTG TGACAATAAG GCTGTGGTGA AATATATTAT AAACAAGGCA AAACAGGACG GGTATGTTAA TGTATATCCC ATCGGGGCCA TATCAAAAGG ACAAAAAGGC GAAGAGCTTT CAGAGATAGG TGAACTTAAA TTTGCCGGAG CGGTGGCAAT TTCCGACGAC GGAAAGCCGG TAAAAAGTTC TTCACTGATG AAAAGGGCTT TGGAATATTC ATCCATGTTT GACATAGCTG TAATATCCCA TTGCGAAGAT CTGGACCTTG CAGACGGCGG TGTGATGAAC GAGGGCTACT GGTCCACAGT TATGGGACTT AAGGGTATAC CTTCGGCGGC TGAGGAAATA ATGGTGGCAA GGGATATTAT ACTGTCTGAG TACACAAAGG TTCCGATACA CATAGCCCAT GTGAGTACCG AACTTTCGGT GGAGCTTATA AGGAATGCCA AAAAGCGCGG GGTAAAAGTT ACATGTGAGA CTTGTCCTCA CTACTTTGTT CTTACCGATG AGGCTTGCAA AGATTTTAAC ACCCTTGCAA AAGTAAATCC TCCGCTGAGG ACGAGAAGAG ATGTTGAGGC CGTGATTGAA GGACTGAAGG ACGGCACGAT TGACATAATA GCAACGGACC ATGCTCCGCA TCATGCCGAT GAGAAAAATG TTGAATTTAA TTTGGCCGCA AACGGCATGG TCGGATTTGA AACGGCATTG CCTCTGGCGA TAACCTATCT TGTAAAACCG GGGCACCTTA CCATCAGCCA GCTGGTTGAA AAGATGTGCG TAAATCCTTC GAAACTTTTG GGTATCAACA AAGGTACGCT GGAGACAGGC AGAAGCGCGG ATATAACTAT TGTTGACCTG AATGAAGAAT TTGTGGTGGA TGTCAACAAA TTCAAGTCAA AAAGCAAGAA CTCACCTTTT CATGGGTTCA AGCTGAATGG AAGTGTATAT TATACCTTGG TAAACGGCAA TGTTGTTGTC AGAGAAAAGG TGCTGCTTTA G
|
Protein sequence | MLIKGGHVVD PKTNTNGIMD ILVEDGIITE IGKDIEISNG DIIYAEGKLV LPGLVDAHCH LRDPGFEYKE DIETGTMSAA MGGFTSIACM PNTDPVCDNK AVVKYIINKA KQDGYVNVYP IGAISKGQKG EELSEIGELK FAGAVAISDD GKPVKSSSLM KRALEYSSMF DIAVISHCED LDLADGGVMN EGYWSTVMGL KGIPSAAEEI MVARDIILSE YTKVPIHIAH VSTELSVELI RNAKKRGVKV TCETCPHYFV LTDEACKDFN TLAKVNPPLR TRRDVEAVIE GLKDGTIDII ATDHAPHHAD EKNVEFNLAA NGMVGFETAL PLAITYLVKP GHLTISQLVE KMCVNPSKLL GINKGTLETG RSADITIVDL NEEFVVDVNK FKSKSKNSPF HGFKLNGSVY YTLVNGNVVV REKVLL
|
| |