Gene Cthe_0952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0952 
Symbol 
ID4811245 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1141213 
End bp1142493 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content44% 
IMG OID640106371 
Productdihydroorotase 
Protein accessionYP_001037379 
Protein GI125973469 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0556673 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTAATTA AAGGCGGACA TGTTGTTGAC CCGAAAACCA ACACAAACGG TATTATGGAT 
ATTTTGGTTG AAGACGGTAT AATAACGGAG ATAGGCAAAG ATATTGAAAT TTCAAACGGT
GATATAATTT ATGCCGAGGG CAAGCTGGTA CTGCCGGGAC TGGTGGATGC CCATTGTCAT
CTGAGAGATC CGGGTTTTGA ATACAAGGAG GATATAGAAA CAGGGACCAT GAGTGCTGCA
ATGGGAGGAT TTACTTCCAT AGCGTGTATG CCTAATACGG ATCCTGTCTG TGACAATAAG
GCTGTGGTGA AATATATTAT AAACAAGGCA AAACAGGACG GGTATGTTAA TGTATATCCC
ATCGGGGCCA TATCAAAAGG ACAAAAAGGC GAAGAGCTTT CAGAGATAGG TGAACTTAAA
TTTGCCGGAG CGGTGGCAAT TTCCGACGAC GGAAAGCCGG TAAAAAGTTC TTCACTGATG
AAAAGGGCTT TGGAATATTC ATCCATGTTT GACATAGCTG TAATATCCCA TTGCGAAGAT
CTGGACCTTG CAGACGGCGG TGTGATGAAC GAGGGCTACT GGTCCACAGT TATGGGACTT
AAGGGTATAC CTTCGGCGGC TGAGGAAATA ATGGTGGCAA GGGATATTAT ACTGTCTGAG
TACACAAAGG TTCCGATACA CATAGCCCAT GTGAGTACCG AACTTTCGGT GGAGCTTATA
AGGAATGCCA AAAAGCGCGG GGTAAAAGTT ACATGTGAGA CTTGTCCTCA CTACTTTGTT
CTTACCGATG AGGCTTGCAA AGATTTTAAC ACCCTTGCAA AAGTAAATCC TCCGCTGAGG
ACGAGAAGAG ATGTTGAGGC CGTGATTGAA GGACTGAAGG ACGGCACGAT TGACATAATA
GCAACGGACC ATGCTCCGCA TCATGCCGAT GAGAAAAATG TTGAATTTAA TTTGGCCGCA
AACGGCATGG TCGGATTTGA AACGGCATTG CCTCTGGCGA TAACCTATCT TGTAAAACCG
GGGCACCTTA CCATCAGCCA GCTGGTTGAA AAGATGTGCG TAAATCCTTC GAAACTTTTG
GGTATCAACA AAGGTACGCT GGAGACAGGC AGAAGCGCGG ATATAACTAT TGTTGACCTG
AATGAAGAAT TTGTGGTGGA TGTCAACAAA TTCAAGTCAA AAAGCAAGAA CTCACCTTTT
CATGGGTTCA AGCTGAATGG AAGTGTATAT TATACCTTGG TAAACGGCAA TGTTGTTGTC
AGAGAAAAGG TGCTGCTTTA G
 
Protein sequence
MLIKGGHVVD PKTNTNGIMD ILVEDGIITE IGKDIEISNG DIIYAEGKLV LPGLVDAHCH 
LRDPGFEYKE DIETGTMSAA MGGFTSIACM PNTDPVCDNK AVVKYIINKA KQDGYVNVYP
IGAISKGQKG EELSEIGELK FAGAVAISDD GKPVKSSSLM KRALEYSSMF DIAVISHCED
LDLADGGVMN EGYWSTVMGL KGIPSAAEEI MVARDIILSE YTKVPIHIAH VSTELSVELI
RNAKKRGVKV TCETCPHYFV LTDEACKDFN TLAKVNPPLR TRRDVEAVIE GLKDGTIDII
ATDHAPHHAD EKNVEFNLAA NGMVGFETAL PLAITYLVKP GHLTISQLVE KMCVNPSKLL
GINKGTLETG RSADITIVDL NEEFVVDVNK FKSKSKNSPF HGFKLNGSVY YTLVNGNVVV
REKVLL