Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2643 |
Symbol | |
ID | 4808954 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 3124691 |
End bp | 3125737 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640108056 |
Product | nucleotidyl transferase |
Protein accession | YP_001039035 |
Protein GI | 125975125 |
COG category | [J] Translation, ribosomal structure and biogenesis [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1208] Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGTTG CAGATTTTTT AATAGATGAA GAGGCAACAA TGTTAAACGC GATGGAACAA CTTGATAAGG TCGCAAAAAA AGTGCTCTTT GTCATAAAAG GAGGCCGTTT TGTCGCAACT GTCACCGACG GAGATATAAG GCGCTGGATT TTAAAGAAGG GAAATCTTGA TGCAAAAGTC AGGGAAATTG CCAACTACAG TCCAAAATTT CTGTATGAAG AAGAAAAAAA CAGGGCTAAA GAATATATGA AAAAGTATTC TGTTGAGGCC TTACCAATAC TCGACAAAGA GTGCAATATT GTTTCCGTTG TACTCTGGAA CGATGAACAA ATAGGGCCAA AGAAAAATCT GGATATTCCG GTGGTAATCA TGGCCGGAGG ACTTGGCACC CGGCTTTATC CATATACCAA AATACTTCCA AAACCGTTAA TACCAATAGG TGAAATACCT ATCGCGGAAC ATATAATGAA CAGGTTTAAC AAATTTGGAT GCAGGCAGTT TTATCTCATT TTAAACCACA AGAAAAATAC CGTAAAAGCA TACTTTAACG ATATTGAAAA AAACTATTCC GTAAACTATG TGGAAGAAGA AAAACCTTTG GGAACAGGCG GCGGACTTAG CCTTTTAAAA GGGAAAATTA CTTCCACCTT TGTCCTTTCA AATTGCGATA TTTTGATAGA AGAGGATTAT GAGAAAATAT ACAGCTACCA TAAAAAAATG AACAATCTGA TAACCATGGT TTGCTCACTA AAGAATATCA AAATTCCCTA TGGTGTTGTT GAAATTAATG ACAAGGGTGA AATCGAAAAT ATAAAGGAAA AACCTGAGCT CGTGTACTTT GTCAACACAG GATTGTATTT TGCCGAGCCG AAAATCATTG AAGAATTGGA AGAAAACCGG CCCGTGGAAT TTCCGGACAT CATCAAGAAA TACAAATCCC GGGGAGAAAA AATCGGTGTC TATCCAATAA GCGAAAACAG CTGGATGGAC ATGGGGCAGT TTGATGAAAT GGAAAAAATG AAGAGAAGGT TGGAAAAAGA TGAATAA
|
Protein sequence | MDVADFLIDE EATMLNAMEQ LDKVAKKVLF VIKGGRFVAT VTDGDIRRWI LKKGNLDAKV REIANYSPKF LYEEEKNRAK EYMKKYSVEA LPILDKECNI VSVVLWNDEQ IGPKKNLDIP VVIMAGGLGT RLYPYTKILP KPLIPIGEIP IAEHIMNRFN KFGCRQFYLI LNHKKNTVKA YFNDIEKNYS VNYVEEEKPL GTGGGLSLLK GKITSTFVLS NCDILIEEDY EKIYSYHKKM NNLITMVCSL KNIKIPYGVV EINDKGEIEN IKEKPELVYF VNTGLYFAEP KIIEELEENR PVEFPDIIKK YKSRGEKIGV YPISENSWMD MGQFDEMEKM KRRLEKDE
|
| |