Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0332 |
Symbol | |
ID | 4808481 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 421697 |
End bp | 423370 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640105746 |
Product | phosphoribulokinase/uridine kinase |
Protein accession | YP_001036763 |
Protein GI | 125972853 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0572] Uridine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGAAA ACAATCTAAA TATGATTAAA GTGGTATTTC CGGATAACAG CGAAAGAGAA GTGTATGAAG GAATATCTTT GCAGGAATTG AGTGAAAGCT GCAAAAACCA ATATAAATCA ACCATTGTGG CGGCAAAGGT TAACAACGAT ATAAAGGAAT TAAGCTATCG TCTTAATGAA AGTTGCCGGG TGGAGTTTAT CGACCTTACG GATGATGACG GAATGAGGAT ATATAAAAGA AGCCTCAGCT TTATTCTCAT TAAGGCCGTA AATGACCTTT TTCCCGACAG AAAGGTTATA ATTTGCCATT CCATCAGCAA GGGAATTTAC TGTGAGGTTA AAGGCGACAC ACCTCTTACT GTTGAAGAGG TAGACATGAT AAAAAACAGA ATGAAAGAAA TTGTCAATTT GAAAATTCCT TTCATAAAGA AGATAATGTC TCTTGATGAG GCAAGGGAAG TATTCAGAAA AATCGGAAGA ATGGACAGGT TCCGTTCCAT AGAATACAGA AAAAAGCCCT ATGTGACTTT ATACGAATGC GATGGGTTCC AGGACTATTT TTATGGATAT ATGGTGCCTC ATACGGGGTA TCTGGATAAA TTTGATTTAA AATATTATCA GCCCGGCCTG ATATTGATGA GTCCGGAAAA AACCAGTCCG GATGCTATAC CGCAATTCAA AGAGCAAAAG AAGCTTTTCA GCATATTTGC GGAATACAAA AAATGGGGAA AAATACTTGG TGTCGAAGAT GTGAGTGCGC TAAATGACAT TGTAAAGGAA GGCAAAATAA ATGAGCTTAT AAGAGTTGCA GAGGCTTTGC ATGAGAAGAA AATTGCGCAG ATTGCGGATA TGATAGCCTT TAATGAGCAT AAGAAGAAAG TCGTTTTGAT TGCCGGTCCG TCTTCATCGG GAAAGACAAC CTTTGCCCAC AGACTTTCGA TACAGCTTAA GGTAAATGGT TTGAGGCCCG TTACCATATC TCTGGATGAT TATTTTGTTG ACAGGGAGCT TACTCCCAAG GATGAAAACG GAGAATACGA CTTTGAGGCC CTGGAGGCTA TTGATATCAA ACTTTTCAAC CGGCATCTTG CGGAGCTGAT AGAAGGGAAA GAAGTTGACG TTCCGATTTT CAATTTCCCT AAAGGATGCA GGGAAAGCTT TTGCAGGAAG CTTAAGATTG ACGAAGACCA GATAATCATA ATTGAAGGCA TACACGGATT GAATGAAAAA CTGACGGCCT CAATTCCAAA AGAGAACAAA TTCAAGATAT ATGTGAGCGC ACTTACCTCA ATGAATATTG ATGAGCATAA TCGTATACCT ACTACGGATA CACGAATCAT CAGAAGGATT GTAAGAGATT ACCAATTCAG AGGCTGCAGT GCGGCAAACA CTATAAAACG CTGGCCTTCT GTAAGAAGGG GCGAGGAGAG AAACATATTC CCGTTCCAGG AAGAAGCGGA TGTAATGTTT AATTCAGCGC TTATGTTTGA ACTGGGAGTT TTAAAAACCT ATGCGGAACC GCTGCTGATG GAGATAGATT CTTCTGAGCC TGAATATTCC GAGGCAAGGA GACTTATAGA ATTTTTGAAC AATTTCTTGC CGATAGACTC GAAAGAGATT CCTGCAAATT CAATAATAAG GGAGTTTATT GGCGGAAGTT GTTTTTACCA GTAA
|
Protein sequence | MNENNLNMIK VVFPDNSERE VYEGISLQEL SESCKNQYKS TIVAAKVNND IKELSYRLNE SCRVEFIDLT DDDGMRIYKR SLSFILIKAV NDLFPDRKVI ICHSISKGIY CEVKGDTPLT VEEVDMIKNR MKEIVNLKIP FIKKIMSLDE AREVFRKIGR MDRFRSIEYR KKPYVTLYEC DGFQDYFYGY MVPHTGYLDK FDLKYYQPGL ILMSPEKTSP DAIPQFKEQK KLFSIFAEYK KWGKILGVED VSALNDIVKE GKINELIRVA EALHEKKIAQ IADMIAFNEH KKKVVLIAGP SSSGKTTFAH RLSIQLKVNG LRPVTISLDD YFVDRELTPK DENGEYDFEA LEAIDIKLFN RHLAELIEGK EVDVPIFNFP KGCRESFCRK LKIDEDQIII IEGIHGLNEK LTASIPKENK FKIYVSALTS MNIDEHNRIP TTDTRIIRRI VRDYQFRGCS AANTIKRWPS VRRGEERNIF PFQEEADVMF NSALMFELGV LKTYAEPLLM EIDSSEPEYS EARRLIEFLN NFLPIDSKEI PANSIIREFI GGSCFYQ
|
| |