Gene Cthe_0059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0059 
Symbol 
ID4808754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp90979 
End bp92439 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content40% 
IMG OID640105468 
Producttype 3a, cellulose-binding 
Protein accessionYP_001036493 
Protein GI125972583 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0360636 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAGAT TGGGAATAAT ATATGAAATT CAGGGCATGA AAGCTGTAGT TCTGACAAGC 
GAAGGCGAAT TTTTGATTAT TCGCAGGCGC AAAGATATGA AGGTTGGACA GCAGGTGAGT
TTTGAAAATG AGGATATATA TAATGTCAGG GGAAAGAGAT TTTTATATGT TGCTGCCGCC
GTTTCAAGTG TTGCGGCAGT GCTGGTTGTA ATGTTTCTGT ATTTTCAGTC TGCATTTTTG
AGTAATACCG ATAATATTTA CGGATATATA TGCGTTGATA TAAATCCCAG CGTTGAGCTG
GTAATTGATG AAACTTGCAG GGTTTTGGAA GTTAGACCTC AAAATAAAGA CGGAGAGCAA
TTGATTTCGG GATTGGAACT TTTGGACAAA AATGTTGAAG ATGTTGTTTA CGAGCTTATT
AACAGGTCCA TAAGTTTTGG TTTTGTCAAA GCTGATGATA ACAGAAAAAT TGTTCTTATC
AGCGGTGCTC TTAATGATAA ACGGAACGAA CTCAAAACGA AAAAAGAAAA CGATGAGGCT
GAATTGACGG AATTGCTTGA CAATATCAAA GCCAGGGTAG ATAGAATAGA TAATATTAAA
GTGCGCACCA TAACGGCAAC TTCCAGGGAA AGAAAAGATG CATTGAAATA CGGGCTTTCC
ATGGGAAAAT ACTGCCTATA CCTTGAAGCG CAGGAGTTGA ACGGCAGCAT TACCATTGAC
GAAGTGCATG ATATGAGTAT TTCAGATATG ATAGAGAAAT TGGAACAGAT GAAGCTGGCA
TTAAAAGATG AGGCAAGTCC AAAACTGCAA ACCACGCCGA CGCTTGGAGG GGAAACTGCA
CAAATATCGC CGGAATCCAT GCAACATTCC ACAGTGCCCG GGTTGCCGGA AACTCCATCA
TCTTCAGAGA AGACAATCGC ACCGACACTC CATGGAACTC CAGGTGTGCC TGATGAGAAA
ACATTACAGC CTTCAACGCC GACAGAAAGC TCAGAATATG TGCAAGACGG TACAAAAGGG
CTTAAAATAC AATATTACAG CAGAAAGCCC CATGATTCCG CAGGGATCGA CTTCAGCTTC
AGAATGTTTA ACACGGGAAA TGAAGCAATT GACCTTAAAG ATGTTAAAGT AAGGTATTAT
TTCAAAGAAG ATGTTTCGAT TGATGAAATG AACTGGGCGG TATACTTTTA CAGTTTGGGT
AGTGAAAAGG ATGTTCAGTG CAGGTTTTAT GAGCTTCCCG GAAAGAAAGA GGCAAACAAA
TATCTTGAAA TTACATTCAA ATCGGGGACG CTTTCTCCGA ACGATGTAAT GTATATCACA
GGTGAGTTTT ATAAGAATGA TTGGACAAAA TTCGAGCAAA GGGACGATTA TTCCTACAAT
CCTGCGGATT CCTATTCGGA TTGGAAAAGG ATGACTGCAT ACATTTCGAA CAAACTGGTA
TGGGGAATTG AGCCCAATTG A
 
Protein sequence
MNRLGIIYEI QGMKAVVLTS EGEFLIIRRR KDMKVGQQVS FENEDIYNVR GKRFLYVAAA 
VSSVAAVLVV MFLYFQSAFL SNTDNIYGYI CVDINPSVEL VIDETCRVLE VRPQNKDGEQ
LISGLELLDK NVEDVVYELI NRSISFGFVK ADDNRKIVLI SGALNDKRNE LKTKKENDEA
ELTELLDNIK ARVDRIDNIK VRTITATSRE RKDALKYGLS MGKYCLYLEA QELNGSITID
EVHDMSISDM IEKLEQMKLA LKDEASPKLQ TTPTLGGETA QISPESMQHS TVPGLPETPS
SSEKTIAPTL HGTPGVPDEK TLQPSTPTES SEYVQDGTKG LKIQYYSRKP HDSAGIDFSF
RMFNTGNEAI DLKDVKVRYY FKEDVSIDEM NWAVYFYSLG SEKDVQCRFY ELPGKKEANK
YLEITFKSGT LSPNDVMYIT GEFYKNDWTK FEQRDDYSYN PADSYSDWKR MTAYISNKLV
WGIEPN