Gene Cthe_0639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0639 
Symbol 
ID4808168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp789712 
End bp790914 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content38% 
IMG OID640106053 
Productglycoside hydrolase family protein 
Protein accessionYP_001037067 
Protein GI125973157 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0296] 1,4-alpha-glucan branching enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.401976 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATATC CGTTTTATTA TTGTTGCGAC AATCTTGATA TTAAAAGTAT ATCTGTTATA 
GGAGATTTCA ATAACTGGGA CGGCGGCAAA AATATTTTGA AAAGGCATGA TGACGGTACA
TGGATGACGG AAATCGAACT GTCTCCCGGC AAATATAGAT ACAAATTTTT AATAAATGAC
AGCATACTTT TTAATGACCC AAACGCCCTG ATTTACACCT ATGATGACAA GGGCGGCACT
GCGTCTTTTA TCATAATTGA TGAAAGCAAC AAGAGGATTA TCAATACGAA ACCCGCTTCG
CTCAATATTG AAAGTTACAG CTTTTTGAGC GTTGACAAAG AACTTTCATA CTGTGAAAAC
GGTGAAAGAA AATGCTATAA GGACAAACAC TCAAGTATTA TGCTGAAAAT CGGTTTTTCC
GACATAACGG GCTTGCATAC GGTAAACGCA ATATGGTATG CTCCCAATAA CGAAATCCAT
GAGATTCAGG AAAGTGTCCT AAACGACGGC GAAAAAAATG AAAACAGGAA AAATGAAGCT
TTATTTCAGA TCAATATAAA TGATGATACA ATTGCGGGCG AATGGAAACT GCAAATATTT
ATAAACGGCA GCCTCGCTTT AGAAGACAGT TTCACAGTAG AATTAAAACA ACCGGAAGTC
TTGGATAAAA CGGCTTTGGA CGATGCTGCC ATGCTGTCCG CGGACCTGGA AGTACCTGAT
TTCTTAATCG AAGAGGATAT GCAGAACGAG TTTGAAGCAC CTGAATTTGA AGTGGAATCC
AATTTGCAAA CCAACGCCGA AAAAGAAGAA ATTTCGGAGG AAGACTTGTT GGGACTCTTT
GATGATATTA AAGAAATAGA TATGCCAGAG AAAGATACAT CTGGCAATCA AAAAACTCCT
GCCCCGGATG AAACTGAAAT GCCGGCCGAC GGCAATATTA TGAATTTGTT TGATGAAATT
AGAATTTCGG AATTAAAAGA CGAAAAAGCA CAGGATACGG AAAACAATAC CGAAAGTACT
TCAAAAGAGG CGGAAGAATC CGACAGTGCT ATTGATGAGC TGCTTGAACT AAGAGAGTTT
ATTGATTCCT CTCAGAATTC CGAAAAAGAC TCAATTGAGT CCTCCGGCGG CAAATCCGGG
GAAGAAGCAA ATGAAGAAGA TATCAGTGAC ATATTTATAG ATATAAAAAA TAGAACTGAT
TGA
 
Protein sequence
MKYPFYYCCD NLDIKSISVI GDFNNWDGGK NILKRHDDGT WMTEIELSPG KYRYKFLIND 
SILFNDPNAL IYTYDDKGGT ASFIIIDESN KRIINTKPAS LNIESYSFLS VDKELSYCEN
GERKCYKDKH SSIMLKIGFS DITGLHTVNA IWYAPNNEIH EIQESVLNDG EKNENRKNEA
LFQININDDT IAGEWKLQIF INGSLALEDS FTVELKQPEV LDKTALDDAA MLSADLEVPD
FLIEEDMQNE FEAPEFEVES NLQTNAEKEE ISEEDLLGLF DDIKEIDMPE KDTSGNQKTP
APDETEMPAD GNIMNLFDEI RISELKDEKA QDTENNTEST SKEAEESDSA IDELLELREF
IDSSQNSEKD SIESSGGKSG EEANEEDISD IFIDIKNRTD