Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0681 |
Symbol | |
ID | 4810299 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 837592 |
End bp | 839085 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640106098 |
Product | inosine 5-monophosphate dehydrogenase |
Protein accession | YP_001037109 |
Protein GI | 125973199 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0516] IMP dehydrogenase/GMP reductase |
TIGRFAM ID | [TIGR01302] inosine-5'-monophosphate dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000239611 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGTATA TCTATGAAGA GGTGTCAAGA ACTTTCAGCG AATACTTACT AATACCAAAT CTTACAACGG AAAAATGTAC TCCGGATAAT ATAGACCTTA GCACTCCTTT GGTAAAATTC AAAAAAGATG AAGAGTGTAG TTTAAAACTC AATATTCCTA TAGTATCCGC CATTATGCAG TCGGTGTCCA ACGACACACT GGCAATAGCT CTTGCCAGAT GCGGAGGATT ATCTTTTATA TATGCTTCTC AGCCAATTGA AAGCCAGGCT GAAATGGTTA AAAGAGTAAA AAAGTACAAG TCGGGATTTG TTGTCAGCGA CTCCAATCTC ACCATCGACA GCACATTGAA GGATGTCATC GAGTTAAAGA ACAGAACCGG CCATTCCACT ATCGCAATAA CGGACGACGG TACAGCTTCC GGAAAGCTTC TTGGATTGGT CACTACAAGG GACTATAGAA TAAGCAGGGA TCCTTTGGAT AAAAAAGTAA AGGATTTCAT GACACCCTTC TCAAAACTCG TTGTAGGTAA ATTGGGTATC AGCTTAAGCG AAGCAAATGA CATAATATGG GAAAACAAGC TTAATTGTCT TCCTATTGTT GACGATGAGC AAAGGCTTCA CTATTTAGTT TTCAGAAAAG ACTATGACGA CCATAAGCAG AACCCTTATG AACTGCTTGA CAGCAACAAA AGACTTAGAG TCGGAGCAGG AATAAACACA AGAGACTACA AAGAAAGAGT GCCGGCGCTG GTAGATGCGG GAGTTGACGT CCTGTGTATT GATTCGTCCG ATGGATTTTC CGTATGGCAG AAATATACCC TGGATTATAT AAAATCAAAT TACAATATAA AAGTCGGTGC GGGAAATGTG GTTGACAGGG AAGGTTTCTT GTACCTTGCC GAGGCCGGAG CCGACTTTGT AAAGGTTGGA ATCGGTGGAG GCTCCATTTG TATAACCCGT GAACAAAAAG GAATCGGAAG AGGACAAGCC ACGGCCGTTA TTGAAGTGGC AAAAGCCAGA GATGAATATT TTGAGAAAAC CGGCGTTTAC ATTCCGATTT GCTCAGACGG CGGTATTGTT CACGACTATC ACATTGTTCT GGCCCTGGCA ATGGGTGCTG ATTTCGTAAT GATGGGAAGA TATTTTGCAA GATTCGATGA AAGTCCCACG AAGAAAGTTA AGAGCGGAAA CGGTTATGTT AAAGAATATT GGGGGGAAGG CTCAAACAGA GCCAGGAACT GGCAGCGTTA CGACCATGGA GGGGAAAGTA CCAATCTGAA ATTTGAAGAA GGTGTTGACA GCTACGTACC TTATGCGGGT AAACTGAGAG ACAACCTTGA AATTACACTG AGCAAAATAA AAGCTACAAT GTCAAGCTGC GGCGCAGCTT CCATAAGCGA GCTTCAGAAA ACCGCAAGGC TGACTGTGGT ATCTTCCACA AGCATAATAG AAGGCGGGGC TCACGACGTT ATATTAAAGG ACAAGGATTA TTAA
|
Protein sequence | MAYIYEEVSR TFSEYLLIPN LTTEKCTPDN IDLSTPLVKF KKDEECSLKL NIPIVSAIMQ SVSNDTLAIA LARCGGLSFI YASQPIESQA EMVKRVKKYK SGFVVSDSNL TIDSTLKDVI ELKNRTGHST IAITDDGTAS GKLLGLVTTR DYRISRDPLD KKVKDFMTPF SKLVVGKLGI SLSEANDIIW ENKLNCLPIV DDEQRLHYLV FRKDYDDHKQ NPYELLDSNK RLRVGAGINT RDYKERVPAL VDAGVDVLCI DSSDGFSVWQ KYTLDYIKSN YNIKVGAGNV VDREGFLYLA EAGADFVKVG IGGGSICITR EQKGIGRGQA TAVIEVAKAR DEYFEKTGVY IPICSDGGIV HDYHIVLALA MGADFVMMGR YFARFDESPT KKVKSGNGYV KEYWGEGSNR ARNWQRYDHG GESTNLKFEE GVDSYVPYAG KLRDNLEITL SKIKATMSSC GAASISELQK TARLTVVSST SIIEGGAHDV ILKDKDY
|
| |