Gene Cthe_3167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3167 
Symbol 
ID4809617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3742832 
End bp3743953 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content39% 
IMG OID640108600 
Productglucose-1-phosphate adenylyltransferase, GlgD subunit 
Protein accessionYP_001039555 
Protein GI125975645 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0448] ADP-glucose pyrophosphorylase 
TIGRFAM ID[TIGR02092] glucose-1-phosphate adenylyltransferase, GlgD subunit 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGCA CAATGGGTAT AATACTTACC GGTGGAAAAA ACGACAGGCT CAAAGAACTG 
GCAGAGATGC GATCCAGTAC TGCCGTGCCG ATAGGAGGAA AATACAGAGT CATAGATTTT
GTACTGTCGA ATATGGTAAA CTCCGGCATA AAAAATATTG GCGTGCTTAC CCAGTACAGT
TTCAGATCGC TTATGGACCA TTTGGGTTCA GGAAAAGAAT GGGATTTGGA CAGGAGAAAT
GAAGGTCTCT TTATATTCCC TCCTTATCTT GCAGGAGAAC ATTCAGGCTG GTACCAGGGA
AGTGCCGATG CCATGTACCA TAACATAACC TTTTTAAAGA GAAGTTATGA AGAATATGTT
CTGGTTGCTC AGGGAAACTG TGTTTATAAG ATGCTTTTTG ACGACATGCT GGACTACCAT
ATAAAGAAAA ATGCCGATAT TACCATAGCA TACAGGGATA TGAGCGATTT TCCGCGGGAA
GAGCTTTCAT TCATGGGCGT AATGACTATG GATGAAAACA GAAGAGTGGT TGATTTTAAG
GAGAAACACA AAAACCCTGA GTCAACCATT TGCTCAATGG GAATATACAT ATTGAAAAGG
GAGCTTCTGA TTGCACTTTT GGAAGAGTGC AATTCCCACG GGAAATATGA CTTTGTCAAG
GACGTTATTA TAAACAAGCT CCCAACCCTC AATATTTACG GATACAAGTT TGAAGGATAT
TGGAGAAATT TAAACAGCAT AAATGCTTAT TACAGGATTA ACATGGAGAT GCTGAATCCG
GAAATAAGAT ATCAGCTTTT TGAACAGCAT GGAAAAGTTT ATACAAAGGT AAAAGACGAA
CCTCCCGCAA AATATAACGA GGAAGCGGAG GTCAGAAACT CCATAATAGC CGACGGATGC
ATTATAGAGG GAACGGTTAT AAACTCGGTA CTTTTCCGCG GAGTCACAAT AAAAAGAAAT
GCCGTGGTAA AAGACTGTAT AATAATGCAA GATTCAGTAA TTGAGGAAGA TGTGGATATT
GAAAGCCTTA TCATAGACAA GAACGTGTAC TTGTCCAAAG GAATCAGGCT TAAGGGAATG
GTCAATTTCC CTATAACAAT AGGAAAAAAT GCGGTAATAT AA
 
Protein sequence
MKSTMGIILT GGKNDRLKEL AEMRSSTAVP IGGKYRVIDF VLSNMVNSGI KNIGVLTQYS 
FRSLMDHLGS GKEWDLDRRN EGLFIFPPYL AGEHSGWYQG SADAMYHNIT FLKRSYEEYV
LVAQGNCVYK MLFDDMLDYH IKKNADITIA YRDMSDFPRE ELSFMGVMTM DENRRVVDFK
EKHKNPESTI CSMGIYILKR ELLIALLEEC NSHGKYDFVK DVIINKLPTL NIYGYKFEGY
WRNLNSINAY YRINMEMLNP EIRYQLFEQH GKVYTKVKDE PPAKYNEEAE VRNSIIADGC
IIEGTVINSV LFRGVTIKRN AVVKDCIIMQ DSVIEEDVDI ESLIIDKNVY LSKGIRLKGM
VNFPITIGKN AVI