Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1079 |
Symbol | |
ID | 4811377 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1285501 |
End bp | 1287963 |
Gene Length | 2463 bp |
Protein Length | 820 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640106501 |
Product | nucleotidyl transferase |
Protein accession | YP_001037504 |
Protein GI | 125973594 |
COG category | [G] Carbohydrate transport and metabolism [J] Translation, ribosomal structure and biogenesis [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1109] Phosphomannomutase [COG1208] Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) |
TIGRFAM ID | [TIGR01208] glucose-1-phosphate thymidylylransferase, long form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGCTG TCATCATGGC GGGAGGAGAA GGTACCAGAC TTAGACCGCT TACGTGTAAC AGGCCAAAAC CAATGGTACC TGTTGTCAAC AAACCTGTTA TGGAGCATAT TATTGAACTT CTGAAAAAAC ATGGATTTAC GGATATTGCA GTTACTTTGC AATATTTACC GGATATGATA AAGGACTATT TTGGAGATGG AAGTGATTTT GGTATAAATC TTAGATATTA TGTGGAAGAT AAACCGATGG GAACGGCGGG AAGTGTAAAA AACGCCGAAG AATTTTTGGA TGATACATTT CTGGTGATAA GTGGAGATGC CTTGACTGAT ATTGATTTGG GAAAGGCGGT AGAGTACCAC TACAGCAAAG GGTCGATGGC AACGCTGGTA CTGAAAAAGG TCGACATACC TTTGGAATAC GGCGTTGTTG TTACCGATGA AAACGGCAGA ATAACAAGGT TTTTGGAAAA GCCAAGCTGG GGCGAAGTGT TCAGCGATAC TGTAAATACC GGTATCTATA TACTGTCTCC TGAGGTTTTA AAATATTTTA ACAAAAATGA AATGTTTGAT TTCAGCAAGG ATTTGTTTCC TATGCTGCTT AAGGAAAATA AACCCATGTA TGGATACATC ACCGATGAAT ACTGGTGCGA CATAGGAGAC CTCATGGCAT ACAGCAAAGC CCATATGGAT GTTCTGGACG GCAAAGTGAA AATAAACATA CCGGGAAATA AAATAAAAGA CAGAGTATGG GTGGGAGAAG GAACCGTAAT AGAGGAAAAC GTTGTTATTG AAGAGCCCTG TGTTATTGGA GCAAATACCC GGATCAAGAA AGATTCTGTT ATCGGCAGTT ATTCCGTTTT GGGAGACAAT AATATTATTG GTGAGAGGAG TGGTATAAAG AGGAGCATTC TTTGGAAAAA CAACGTGCTT GAAACCAATA CCCAACTGAG AGGTACCGTA GTTTGCAGCA AGGTCAATAT CAAAGAAGGG GTTTTTGCTT TTGAAAATTC CGTTATAGGT GATGATACTC AAATAGGAAA AAATGCGGTG ATAAAGCCCA GCATTAAAAT ATGGCCTAAC AAGATTGTGG AGGGAGGAAC TGAGGTAAAT TCAAACCTTG TATGGGGTTC AAAATTTGTC CGTTCCATAT TTGGTTTCAG GGGAGTTGCC GGAGAAATAA ATGTGGATAT TACTCCGGAA TATGCTTCAA AGCTTGGTGC TGCCTACGGA GCTATTTTTA AGGGCAAGGG TAAAATAGGG GTAAGCTCTG ACGACTCGGC CGCTGCCAAA ATGCTGAAGA TGTCTTTTAT GTCGGGGCTG CTTTCCGCAG GACTTAAGGT TTTGGACTTT GGAGTCCTCC ATTTGCCAGT TGCAAGATCC GCAGTAAGGT TTTACGGAGC TGACGGAGGA ATTCATATAA GCACGTCAAG CACAAATTTC GGCCGGCTTA CCGTGGACTT TTTGGACAAA AACGGCAGCA ATATAAAAAG AGAAGTGGAG AGAAAAATTG AGAATGCTTT CCAGAGGGAA GACTTCAGCA GATGTGAGGG AGAGGCCATA AAAGATGTTG AAGTAATACC TGACTATACC AAATACTACT TGAGAAATAT AATAAACAAC ACCCGGTCAA AAGACTTCGG TTATCGTATA GCTCTTAATT CTTTGTCCGG TTTCATTCTC GAAACAGTGG GCAATTTGCT TGAAACTTTT GGCTGTACGG TAGAAAAAAC TTCGCTTGAT TTGCGTAATG CCAAAACAAT GAGAAATGGC AATAAATCTG CGGAAATGAC CTATTTTACC GACATGATAA AAATGGGAAA CTACGATTTA GGTGTGTCAA TTGAAGATAC TTCTGAAAAA ATGATGCTGG TGGACAATAA TGGCCGGATT GTATCGGATG ACATGTTTAC GGCCCTTATT TCATTGATTG TGTTTAAAAC CGTTCAGGGC GGAACAGTGG TGGTGCCGAT TTCCACAAGC TCGGTTGTGG ACAAACTGGC AGAGCAAAAC AACGGCAAGG TAATAAGAAC GAAAACGTCT CCTCAGGATA TTATGAACAA GATGCTTGGC AACGAAACAA AGGAAAATGT GTATGAACAG TTTACGCTGC ATTTTGATGC CATTGCCGGA CTGGTGAAAA TTCTTGATTT CATGAAATCG GAAAACTACA GTCTTTCCGA CCTTGTAAAC ATGATACCGG ATTTCTATGT AAGCGAAAAA GAGGTTGAAT GTTCGTGGAA CGTAAAAGGA AAAGTAATAA GACAGATTAT CCAGGAAAAT GAGGGACAAA GCATTGAAAC CCTTGAAGGA GTAAAGATAT TCAAAGACGG CGGATGGGTA TTGGTTTTGC CTGATGCCGA ACAGCCCGTC TGCAGAATTA AAAGCGAAAG TTATTCCGCT GAATTTGCAG AAGAACTTAC GAATTTCTAC GTCAATAAAG TAAGAGAGAT AAGTCAGAAA TAA
|
Protein sequence | MKAVIMAGGE GTRLRPLTCN RPKPMVPVVN KPVMEHIIEL LKKHGFTDIA VTLQYLPDMI KDYFGDGSDF GINLRYYVED KPMGTAGSVK NAEEFLDDTF LVISGDALTD IDLGKAVEYH YSKGSMATLV LKKVDIPLEY GVVVTDENGR ITRFLEKPSW GEVFSDTVNT GIYILSPEVL KYFNKNEMFD FSKDLFPMLL KENKPMYGYI TDEYWCDIGD LMAYSKAHMD VLDGKVKINI PGNKIKDRVW VGEGTVIEEN VVIEEPCVIG ANTRIKKDSV IGSYSVLGDN NIIGERSGIK RSILWKNNVL ETNTQLRGTV VCSKVNIKEG VFAFENSVIG DDTQIGKNAV IKPSIKIWPN KIVEGGTEVN SNLVWGSKFV RSIFGFRGVA GEINVDITPE YASKLGAAYG AIFKGKGKIG VSSDDSAAAK MLKMSFMSGL LSAGLKVLDF GVLHLPVARS AVRFYGADGG IHISTSSTNF GRLTVDFLDK NGSNIKREVE RKIENAFQRE DFSRCEGEAI KDVEVIPDYT KYYLRNIINN TRSKDFGYRI ALNSLSGFIL ETVGNLLETF GCTVEKTSLD LRNAKTMRNG NKSAEMTYFT DMIKMGNYDL GVSIEDTSEK MMLVDNNGRI VSDDMFTALI SLIVFKTVQG GTVVVPISTS SVVDKLAEQN NGKVIRTKTS PQDIMNKMLG NETKENVYEQ FTLHFDAIAG LVKILDFMKS ENYSLSDLVN MIPDFYVSEK EVECSWNVKG KVIRQIIQEN EGQSIETLEG VKIFKDGGWV LVLPDAEQPV CRIKSESYSA EFAEELTNFY VNKVREISQK
|
| |