Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3166 |
Symbol | glgC |
ID | 4809616 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3741555 |
End bp | 3742835 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640108599 |
Product | glucose-1-phosphate adenylyltransferase |
Protein accession | YP_001039554 |
Protein GI | 125975644 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0448] ADP-glucose pyrophosphorylase |
TIGRFAM ID | [TIGR02091] glucose-1-phosphate adenylyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATAAAA AGGAGATTAT TGCCTTGCTG CTTGCCGGCG GTCAGGGCAG TAGACTGGGT GTACTGACAA AAAACATTGC AAAGCCTGCA GTTTTGTATG GAGGTAAGTA CAGGATAATC GATTTCTCTC TCAGCAACTG TGTTAATTCC GATATTGATA CGGTAGGAGT GCTGACCCAA TACCAACCCC TTGAGCTTAA CGCACACATA GGAATCGGAA AGCCGTGGGA TATGGACAGG ATAAACGGAG GAGTTACAAT ATTGTCGCCG TATCTTAAGG CGGAAATAGG TGAGTGGTAT AAGGGAACGG CAAATGCAGT TTTTCAGAAT ATCCATTATG TTGACAAGTA TTCTCCTAAA TATGTAATAA TCCTCTCGGG AGACCATGTT TACAAGATGA ACTATTCCCA AATGCTCGAT TTCCATAAAG AGAACAATGC TGATGCAACC ATATCGGTAA TAAATGTTCC ATGGGAAGAG GCAAGCAGAT ATGGAATTAT GAATACTTAT GAAAACGGAA AAATATATGA GTTTGAAGAA AAACCGCAAA ATCCGAAAAG TAATCTTGCT TCCATGGGAG TTTACATATT TAATTGGGAG GTTTTAAAAG AGTACCTTAT AAGAGATGAT CAGAACGAAG AATCAGCCCA TGACTTCGGT AAAAATATCA TCCCGATGAT GCTTAAAGAA GGCAGAAGCA TGTGGGCATA CAAATTCAAC GGATATTGGA GGGATGTCGG TACCATACAA GCTTACTGGG AGTCGAACAT GGATCTTATA AGCAGGGTTC CCGAATTCAA CCTTTTTGAT CCCGCATGGA AGATTTATAC TCCGAATCCG GTCAAACCGG CTCACTATAT AGGCCCCACC GGAAGTGTAA AAAAGTCCAT TGTCGCTGAA GGCTGCATGA TATACGGCAG TGTCAGGAAT TCTGTTTTGT TCCCCGGTGT TTATGTAAGT GAGGGAGCAG AAATTGTTGA CTCTATAGTA ATGAGCGACA GTGTTATTGG TGAGAATACC CAGATCTACA AATGTATTAT CGGTGAAGAA GTAAAGGTTG GGAAAAATGT GAGAATGGGA ATCGGCGAAA ACATACCCAA TGAACTTAAA CCCCATTTGT ATGATTCCGG CATAACGGTG GTCGGGGAAA AGGCTGTTGT ACCTGACGGA TGTCAGATAG GAAAAAATGT TGTTATAGAT CCGTACATAA CCGCGGAAGA ATTTCCCTCG CTTAATATCG AGTCTGCAAA AAGTGTTTTA AAGGGAGGAG AAACCGAATG A
|
Protein sequence | MHKKEIIALL LAGGQGSRLG VLTKNIAKPA VLYGGKYRII DFSLSNCVNS DIDTVGVLTQ YQPLELNAHI GIGKPWDMDR INGGVTILSP YLKAEIGEWY KGTANAVFQN IHYVDKYSPK YVIILSGDHV YKMNYSQMLD FHKENNADAT ISVINVPWEE ASRYGIMNTY ENGKIYEFEE KPQNPKSNLA SMGVYIFNWE VLKEYLIRDD QNEESAHDFG KNIIPMMLKE GRSMWAYKFN GYWRDVGTIQ AYWESNMDLI SRVPEFNLFD PAWKIYTPNP VKPAHYIGPT GSVKKSIVAE GCMIYGSVRN SVLFPGVYVS EGAEIVDSIV MSDSVIGENT QIYKCIIGEE VKVGKNVRMG IGENIPNELK PHLYDSGITV VGEKAVVPDG CQIGKNVVID PYITAEEFPS LNIESAKSVL KGGETE
|
| |