Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0947 |
Symbol | |
ID | 4811240 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1133998 |
End bp | 1134921 |
Gene Length | 924 bp |
Protein Length | 307 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640106366 |
Product | dihydroorotate oxidase B, catalytic subunit |
Protein accession | YP_001037374 |
Protein GI | 125973464 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0167] Dihydroorotate dehydrogenase |
TIGRFAM ID | [TIGR01037] dihydroorotate dehydrogenase (subfamily 1) family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.302447 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGAGA AAAGTATTGA TTTAAGTGTT GATATTGCAG GTTTGAGGCT TTCAAATCCT GTTATAGCAG CTTCCGGTAC TTTCGGATTT GGCAGGGAGT TTGTTGATTA CGTGGATTTA AATAAAATTG GCGGAATATC GGTAAAAGGA CTTACTCTGG AAAAAAGGCA GGGGAACAGG CCTCCGAGGA TTGCCGAGAC TCCGGCCGGT ATTCTTAACA GTGTGGGGCT TCAAAATCCG GGCGTTAGAG CCTTTATAGA AAATGAAATT CCTTTTTTGA GAAAGTATAA TACAAAAATA ATTGCCAATA TTGCGGGCAA TACTATAGAG GATTACTGCA AGATGGCAGA ACTTTTGTCA GATGCGGATA TTGACGCAAT AGAACTTAAT GTTTCCTGTC CCAATGTAAA GAAGGGATGT GTTGCTTTTG GGAATTCTCC TGCAGGAATA AGCGAGATTA CGAGCAAAGT GAAAAAATAC TGCAAAAAGC CGCTTATTGT TAAGCTTACT CCCAATGTTA CCGATATTAA AGAAATAGCT GTCGCCGCCG AAGCAGCCGG AGCCGATGCT CTTTCCCTTA TAAACACGAT TCTCGGGATG GCCATTGACA TACACAGAAA AAGGCCGATA CTTGCCAACA ATGTGGGGGG ACTTTCGGGA CCTGCGGTAA AGCCCATTGC AGTGAGGATG GTTTATGAAG TTTGCAGTGT TGTCAAAATA CCCGTTATTG GAATGGGCGG AATATCAAGC GGTGAGGATG CGGTGGAATT CATGCTGGCA GGTGCAAGCG CAGTGATGGT GGGGACGGCC AATTTTATAA ATCCTGCGGC ATGCATTGAT GTTGTGGAAG GAATAAAAAA TTACCTTAAA ATGTATAATC ACGGCAGTGT TTATGAAATA ATAGGAAAGT TACAGCTCAA CTGA
|
Protein sequence | MTEKSIDLSV DIAGLRLSNP VIAASGTFGF GREFVDYVDL NKIGGISVKG LTLEKRQGNR PPRIAETPAG ILNSVGLQNP GVRAFIENEI PFLRKYNTKI IANIAGNTIE DYCKMAELLS DADIDAIELN VSCPNVKKGC VAFGNSPAGI SEITSKVKKY CKKPLIVKLT PNVTDIKEIA VAAEAAGADA LSLINTILGM AIDIHRKRPI LANNVGGLSG PAVKPIAVRM VYEVCSVVKI PVIGMGGISS GEDAVEFMLA GASAVMVGTA NFINPAACID VVEGIKNYLK MYNHGSVYEI IGKLQLN
|
| |