Gene Cthe_0357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0357 
Symbol 
ID4808434 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp448123 
End bp450690 
Gene Length2568 bp 
Protein Length855 aa 
Translation table11 
GC content42% 
IMG OID640105771 
Productalpha-glucan phosphorylases 
Protein accessionYP_001036788 
Protein GI125972878 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0058] Glucan phosphorylase 
TIGRFAM ID[TIGR02094] alpha-glucan phosphorylases 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATCTTT TTGGAAAAAT TACCGTAACA GCTGTAATAC CTGACGAACT ATCCAAGCTC 
AAAGACATCG CGTACAACTT GTGGTGGTCA TGGAACTCTG AAGCTATCGA CCTTTTTAGA
GAAATCGACC TTGCTTTGTG GGAAAAGCTT GGAAAGAATC CGGTAAGATT CCTCAAGGAA
GTAAGCCAGA AAAAGCTTGA AGCCAAGCTT AAAGACCCTG ATTATATGCA AAGATACAAA
AAAGTAGTCA ATGATTTTGA AACTTACATG AATGAAACCG ATACATGGTT TTCCAGGAAC
TTCCCCGACA AAAAAGACCA TATGGTAGCT TACTTCTCTG CGGAATATGG ATTGAATGAA
GTACTCCCCA TATATTCAGG CGGGCTTGGC GTATTATCCG GTGACCACTG CAAATCGGCA
AGTGACCTCG GAATACCCTT TACAGCAATC GGCTTGTTCT ATAAAGAAGG ATATTTCAGC
CAGCGCATAA ACTCTGAAGG ATGGCAGGAA ACGATATTCA CTCCGTTAAA CCCTTCAAAC
CTTCCAATAC AACCGGCCTT AAATGACAAA GGTGAAGAGG TAATCATAAG TGTGGAACTA
CCCGGAAGAG TTGTATACGC AAAAGTCTGG GTGGTAAAGG TGGGCCGTGT GAATCTGTAT
TTAATGGACA CTGATATAGA GCAGAACAGT CCTTATGACA GAGGTCTTAC CGCCAGACTC
TACGGCGGAG ACCAGGAAAC AAGAATACAA CAGGAAATCT TCCTTGGAAT AGGCGGTGCA
AGAGTCCTTG ACGCGTTAGG CATAAAAGCC ACCGTATATC ACATGAATGA AGGACATTCG
GCTTTCCTTG GCCTTGAGCT TATCAGAAAG CTAGTACAAA ACCATAATCT TCCTTTCAAT
CAGGCCAAAG AAGTCGTCGC CTCATCCGTT ATATTCACAA CCCATACACC TGTACCTGCT
GGTAACGACG TGTTCCCGCT TGAAATGATA GACAGGTATT TCGGAAATTA CTGGCCGTCT
TTGGGCATAA ACAGACATGA GTTTTTAGAT TTGGGATTAA GAATAGGAGA GCATCACAAC
TTCAACATGA CCGTCCTTGC CTTGACACTG GCAGGACAGA AAAACGGCGT AAGCGAGCTT
CACGGCGCAG TATCAAGAAA CATCTTCAAA AATGTATGGC CTGGAATACC TGAGGACGAA
ATACCGATCG GGCACATCAC AAACGGTATT CATACTCTTA CATGGCTTTC TCCAAGCATT
AAATATCTTT ACGACAAATA TCTTGATAAA GACTGGAAAG AACGGCTTCA TGAAAAAGAA
GTCTGGGAGA AGGTCGACGA CATTCCCGAC GAAGAGCTTT GGAAAACCCA TTGCGTCCTG
AAAACAAAAA TGATTGGATA TGTACGCGAA AAACTTAAAG AGCAGAGAGC TGCAAACGGA
GAGTCAATCG AAAGAATCAA AGAGGTTGAC ACACTGCTTG ACTCCAATGC TTTAACTATA
GGATTTGCAA GAAGATTTGC AACTTATAAA AGGGCAAACC TTATATTCAG AGATCTTGCC
CGAATTCAAA AATTGCTCAA CAACCCGGAA AAACCGGTAC AGATAATATT TGCCGGAAAA
GCCCATCCTG CAGACGGACC TGCTCATGAA ATCATCAAAT ATATCAATGA CATTGCAAAG
CAGGAAGGAT TCAACGGTAA AGTTATTTTA GTGGAAAACT ACAATATGAC ACTTGCCCGC
AATTTGGTTC AGGGAGTGGA TATTTGGCTC AACAACCCGA GAAGACCTCT TGAAGCCAGC
GGAACCAGCG GACAAAAAGT GGCTATAAAC GGAATAATCA ACTTCAGCGT ACTGGACGGT
TGGTGGTGCG AAGGTTACAA CGGCAAAAAC GGTTGGGCAA TCGGAGACGA TACCTTCTAC
GACAACGAAT ATCATCAGGA TAATGCCGAC AGTGAATCAA TTTACAACAT ACTGGAAAAG
CAAATTATAC CTACTTTCTT TGACAGAAAT GAAAAAGGTG TACCCGAAAA GTGGGTAAAA
ATAATGAAGG AATCAATAAA ATCCATCGCT GCCCAATACA GCACGCACAG AATGGTTCAG
GATTATATAA ATAAGTATTA TATTCCTGCA ATGGAAAGAT ATGATAAGAT AAAAGCAAGC
AATTATCAAT TTGCAGCCAA TATCTCAGAA TGGAAGAAGA AGGTAGCGCA TCTGTGGCCT
CAGGTACAGA TAATAGCTGA AAAAACTGCA AACCAATTGA AGGAAAGAAA CTTTATATCC
GGTGAATCCA TACCGATATA CGCCACTGTC AATCTTGGAG GTCTTGAACC TTCGGACGTA
AAGGTTCAGG CCTACTACGG AAGCATCGGA AAAAACAACT CTATCGAAAA TCCTGTAATA
GTTGACATGG ATGTAGTGGA AAGAAACAGC GACGGAACTT ATCTCTACTC TGCAAACATC
ACTTTGTATG AAGGCGGAGA GTACGGATAT ACCTTCAGAG TGATTCCTAA TCATCCGGAT
ATTATCAATC CGTTTGACTT GGGACTTATC AGATGGATTG TACAGTAA
 
Protein sequence
MYLFGKITVT AVIPDELSKL KDIAYNLWWS WNSEAIDLFR EIDLALWEKL GKNPVRFLKE 
VSQKKLEAKL KDPDYMQRYK KVVNDFETYM NETDTWFSRN FPDKKDHMVA YFSAEYGLNE
VLPIYSGGLG VLSGDHCKSA SDLGIPFTAI GLFYKEGYFS QRINSEGWQE TIFTPLNPSN
LPIQPALNDK GEEVIISVEL PGRVVYAKVW VVKVGRVNLY LMDTDIEQNS PYDRGLTARL
YGGDQETRIQ QEIFLGIGGA RVLDALGIKA TVYHMNEGHS AFLGLELIRK LVQNHNLPFN
QAKEVVASSV IFTTHTPVPA GNDVFPLEMI DRYFGNYWPS LGINRHEFLD LGLRIGEHHN
FNMTVLALTL AGQKNGVSEL HGAVSRNIFK NVWPGIPEDE IPIGHITNGI HTLTWLSPSI
KYLYDKYLDK DWKERLHEKE VWEKVDDIPD EELWKTHCVL KTKMIGYVRE KLKEQRAANG
ESIERIKEVD TLLDSNALTI GFARRFATYK RANLIFRDLA RIQKLLNNPE KPVQIIFAGK
AHPADGPAHE IIKYINDIAK QEGFNGKVIL VENYNMTLAR NLVQGVDIWL NNPRRPLEAS
GTSGQKVAIN GIINFSVLDG WWCEGYNGKN GWAIGDDTFY DNEYHQDNAD SESIYNILEK
QIIPTFFDRN EKGVPEKWVK IMKESIKSIA AQYSTHRMVQ DYINKYYIPA MERYDKIKAS
NYQFAANISE WKKKVAHLWP QVQIIAEKTA NQLKERNFIS GESIPIYATV NLGGLEPSDV
KVQAYYGSIG KNNSIENPVI VDMDVVERNS DGTYLYSANI TLYEGGEYGY TFRVIPNHPD
IINPFDLGLI RWIVQ