Gene Cthe_3166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3166 
SymbolglgC 
ID4809616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3741555 
End bp3742835 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content42% 
IMG OID640108599 
Productglucose-1-phosphate adenylyltransferase 
Protein accessionYP_001039554 
Protein GI125975644 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0448] ADP-glucose pyrophosphorylase 
TIGRFAM ID[TIGR02091] glucose-1-phosphate adenylyltransferase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATAAAA AGGAGATTAT TGCCTTGCTG CTTGCCGGCG GTCAGGGCAG TAGACTGGGT 
GTACTGACAA AAAACATTGC AAAGCCTGCA GTTTTGTATG GAGGTAAGTA CAGGATAATC
GATTTCTCTC TCAGCAACTG TGTTAATTCC GATATTGATA CGGTAGGAGT GCTGACCCAA
TACCAACCCC TTGAGCTTAA CGCACACATA GGAATCGGAA AGCCGTGGGA TATGGACAGG
ATAAACGGAG GAGTTACAAT ATTGTCGCCG TATCTTAAGG CGGAAATAGG TGAGTGGTAT
AAGGGAACGG CAAATGCAGT TTTTCAGAAT ATCCATTATG TTGACAAGTA TTCTCCTAAA
TATGTAATAA TCCTCTCGGG AGACCATGTT TACAAGATGA ACTATTCCCA AATGCTCGAT
TTCCATAAAG AGAACAATGC TGATGCAACC ATATCGGTAA TAAATGTTCC ATGGGAAGAG
GCAAGCAGAT ATGGAATTAT GAATACTTAT GAAAACGGAA AAATATATGA GTTTGAAGAA
AAACCGCAAA ATCCGAAAAG TAATCTTGCT TCCATGGGAG TTTACATATT TAATTGGGAG
GTTTTAAAAG AGTACCTTAT AAGAGATGAT CAGAACGAAG AATCAGCCCA TGACTTCGGT
AAAAATATCA TCCCGATGAT GCTTAAAGAA GGCAGAAGCA TGTGGGCATA CAAATTCAAC
GGATATTGGA GGGATGTCGG TACCATACAA GCTTACTGGG AGTCGAACAT GGATCTTATA
AGCAGGGTTC CCGAATTCAA CCTTTTTGAT CCCGCATGGA AGATTTATAC TCCGAATCCG
GTCAAACCGG CTCACTATAT AGGCCCCACC GGAAGTGTAA AAAAGTCCAT TGTCGCTGAA
GGCTGCATGA TATACGGCAG TGTCAGGAAT TCTGTTTTGT TCCCCGGTGT TTATGTAAGT
GAGGGAGCAG AAATTGTTGA CTCTATAGTA ATGAGCGACA GTGTTATTGG TGAGAATACC
CAGATCTACA AATGTATTAT CGGTGAAGAA GTAAAGGTTG GGAAAAATGT GAGAATGGGA
ATCGGCGAAA ACATACCCAA TGAACTTAAA CCCCATTTGT ATGATTCCGG CATAACGGTG
GTCGGGGAAA AGGCTGTTGT ACCTGACGGA TGTCAGATAG GAAAAAATGT TGTTATAGAT
CCGTACATAA CCGCGGAAGA ATTTCCCTCG CTTAATATCG AGTCTGCAAA AAGTGTTTTA
AAGGGAGGAG AAACCGAATG A
 
Protein sequence
MHKKEIIALL LAGGQGSRLG VLTKNIAKPA VLYGGKYRII DFSLSNCVNS DIDTVGVLTQ 
YQPLELNAHI GIGKPWDMDR INGGVTILSP YLKAEIGEWY KGTANAVFQN IHYVDKYSPK
YVIILSGDHV YKMNYSQMLD FHKENNADAT ISVINVPWEE ASRYGIMNTY ENGKIYEFEE
KPQNPKSNLA SMGVYIFNWE VLKEYLIRDD QNEESAHDFG KNIIPMMLKE GRSMWAYKFN
GYWRDVGTIQ AYWESNMDLI SRVPEFNLFD PAWKIYTPNP VKPAHYIGPT GSVKKSIVAE
GCMIYGSVRN SVLFPGVYVS EGAEIVDSIV MSDSVIGENT QIYKCIIGEE VKVGKNVRMG
IGENIPNELK PHLYDSGITV VGEKAVVPDG CQIGKNVVID PYITAEEFPS LNIESAKSVL
KGGETE