Gene Cthe_0947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0947 
Symbol 
ID4811240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1133998 
End bp1134921 
Gene Length924 bp 
Protein Length307 aa 
Translation table11 
GC content43% 
IMG OID640106366 
Productdihydroorotate oxidase B, catalytic subunit 
Protein accessionYP_001037374 
Protein GI125973464 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01037] dihydroorotate dehydrogenase (subfamily 1) family protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.302447 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGAGA AAAGTATTGA TTTAAGTGTT GATATTGCAG GTTTGAGGCT TTCAAATCCT 
GTTATAGCAG CTTCCGGTAC TTTCGGATTT GGCAGGGAGT TTGTTGATTA CGTGGATTTA
AATAAAATTG GCGGAATATC GGTAAAAGGA CTTACTCTGG AAAAAAGGCA GGGGAACAGG
CCTCCGAGGA TTGCCGAGAC TCCGGCCGGT ATTCTTAACA GTGTGGGGCT TCAAAATCCG
GGCGTTAGAG CCTTTATAGA AAATGAAATT CCTTTTTTGA GAAAGTATAA TACAAAAATA
ATTGCCAATA TTGCGGGCAA TACTATAGAG GATTACTGCA AGATGGCAGA ACTTTTGTCA
GATGCGGATA TTGACGCAAT AGAACTTAAT GTTTCCTGTC CCAATGTAAA GAAGGGATGT
GTTGCTTTTG GGAATTCTCC TGCAGGAATA AGCGAGATTA CGAGCAAAGT GAAAAAATAC
TGCAAAAAGC CGCTTATTGT TAAGCTTACT CCCAATGTTA CCGATATTAA AGAAATAGCT
GTCGCCGCCG AAGCAGCCGG AGCCGATGCT CTTTCCCTTA TAAACACGAT TCTCGGGATG
GCCATTGACA TACACAGAAA AAGGCCGATA CTTGCCAACA ATGTGGGGGG ACTTTCGGGA
CCTGCGGTAA AGCCCATTGC AGTGAGGATG GTTTATGAAG TTTGCAGTGT TGTCAAAATA
CCCGTTATTG GAATGGGCGG AATATCAAGC GGTGAGGATG CGGTGGAATT CATGCTGGCA
GGTGCAAGCG CAGTGATGGT GGGGACGGCC AATTTTATAA ATCCTGCGGC ATGCATTGAT
GTTGTGGAAG GAATAAAAAA TTACCTTAAA ATGTATAATC ACGGCAGTGT TTATGAAATA
ATAGGAAAGT TACAGCTCAA CTGA
 
Protein sequence
MTEKSIDLSV DIAGLRLSNP VIAASGTFGF GREFVDYVDL NKIGGISVKG LTLEKRQGNR 
PPRIAETPAG ILNSVGLQNP GVRAFIENEI PFLRKYNTKI IANIAGNTIE DYCKMAELLS
DADIDAIELN VSCPNVKKGC VAFGNSPAGI SEITSKVKKY CKKPLIVKLT PNVTDIKEIA
VAAEAAGADA LSLINTILGM AIDIHRKRPI LANNVGGLSG PAVKPIAVRM VYEVCSVVKI
PVIGMGGISS GEDAVEFMLA GASAVMVGTA NFINPAACID VVEGIKNYLK MYNHGSVYEI
IGKLQLN