Gene Cthe_0332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0332 
Symbol 
ID4808481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp421697 
End bp423370 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content40% 
IMG OID640105746 
Productphosphoribulokinase/uridine kinase 
Protein accessionYP_001036763 
Protein GI125972853 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0572] Uridine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGAAA ACAATCTAAA TATGATTAAA GTGGTATTTC CGGATAACAG CGAAAGAGAA 
GTGTATGAAG GAATATCTTT GCAGGAATTG AGTGAAAGCT GCAAAAACCA ATATAAATCA
ACCATTGTGG CGGCAAAGGT TAACAACGAT ATAAAGGAAT TAAGCTATCG TCTTAATGAA
AGTTGCCGGG TGGAGTTTAT CGACCTTACG GATGATGACG GAATGAGGAT ATATAAAAGA
AGCCTCAGCT TTATTCTCAT TAAGGCCGTA AATGACCTTT TTCCCGACAG AAAGGTTATA
ATTTGCCATT CCATCAGCAA GGGAATTTAC TGTGAGGTTA AAGGCGACAC ACCTCTTACT
GTTGAAGAGG TAGACATGAT AAAAAACAGA ATGAAAGAAA TTGTCAATTT GAAAATTCCT
TTCATAAAGA AGATAATGTC TCTTGATGAG GCAAGGGAAG TATTCAGAAA AATCGGAAGA
ATGGACAGGT TCCGTTCCAT AGAATACAGA AAAAAGCCCT ATGTGACTTT ATACGAATGC
GATGGGTTCC AGGACTATTT TTATGGATAT ATGGTGCCTC ATACGGGGTA TCTGGATAAA
TTTGATTTAA AATATTATCA GCCCGGCCTG ATATTGATGA GTCCGGAAAA AACCAGTCCG
GATGCTATAC CGCAATTCAA AGAGCAAAAG AAGCTTTTCA GCATATTTGC GGAATACAAA
AAATGGGGAA AAATACTTGG TGTCGAAGAT GTGAGTGCGC TAAATGACAT TGTAAAGGAA
GGCAAAATAA ATGAGCTTAT AAGAGTTGCA GAGGCTTTGC ATGAGAAGAA AATTGCGCAG
ATTGCGGATA TGATAGCCTT TAATGAGCAT AAGAAGAAAG TCGTTTTGAT TGCCGGTCCG
TCTTCATCGG GAAAGACAAC CTTTGCCCAC AGACTTTCGA TACAGCTTAA GGTAAATGGT
TTGAGGCCCG TTACCATATC TCTGGATGAT TATTTTGTTG ACAGGGAGCT TACTCCCAAG
GATGAAAACG GAGAATACGA CTTTGAGGCC CTGGAGGCTA TTGATATCAA ACTTTTCAAC
CGGCATCTTG CGGAGCTGAT AGAAGGGAAA GAAGTTGACG TTCCGATTTT CAATTTCCCT
AAAGGATGCA GGGAAAGCTT TTGCAGGAAG CTTAAGATTG ACGAAGACCA GATAATCATA
ATTGAAGGCA TACACGGATT GAATGAAAAA CTGACGGCCT CAATTCCAAA AGAGAACAAA
TTCAAGATAT ATGTGAGCGC ACTTACCTCA ATGAATATTG ATGAGCATAA TCGTATACCT
ACTACGGATA CACGAATCAT CAGAAGGATT GTAAGAGATT ACCAATTCAG AGGCTGCAGT
GCGGCAAACA CTATAAAACG CTGGCCTTCT GTAAGAAGGG GCGAGGAGAG AAACATATTC
CCGTTCCAGG AAGAAGCGGA TGTAATGTTT AATTCAGCGC TTATGTTTGA ACTGGGAGTT
TTAAAAACCT ATGCGGAACC GCTGCTGATG GAGATAGATT CTTCTGAGCC TGAATATTCC
GAGGCAAGGA GACTTATAGA ATTTTTGAAC AATTTCTTGC CGATAGACTC GAAAGAGATT
CCTGCAAATT CAATAATAAG GGAGTTTATT GGCGGAAGTT GTTTTTACCA GTAA
 
Protein sequence
MNENNLNMIK VVFPDNSERE VYEGISLQEL SESCKNQYKS TIVAAKVNND IKELSYRLNE 
SCRVEFIDLT DDDGMRIYKR SLSFILIKAV NDLFPDRKVI ICHSISKGIY CEVKGDTPLT
VEEVDMIKNR MKEIVNLKIP FIKKIMSLDE AREVFRKIGR MDRFRSIEYR KKPYVTLYEC
DGFQDYFYGY MVPHTGYLDK FDLKYYQPGL ILMSPEKTSP DAIPQFKEQK KLFSIFAEYK
KWGKILGVED VSALNDIVKE GKINELIRVA EALHEKKIAQ IADMIAFNEH KKKVVLIAGP
SSSGKTTFAH RLSIQLKVNG LRPVTISLDD YFVDRELTPK DENGEYDFEA LEAIDIKLFN
RHLAELIEGK EVDVPIFNFP KGCRESFCRK LKIDEDQIII IEGIHGLNEK LTASIPKENK
FKIYVSALTS MNIDEHNRIP TTDTRIIRRI VRDYQFRGCS AANTIKRWPS VRRGEERNIF
PFQEEADVMF NSALMFELGV LKTYAEPLLM EIDSSEPEYS EARRLIEFLN NFLPIDSKEI
PANSIIREFI GGSCFYQ