Gene Cthe_0270 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0270 
Symbol 
ID4808553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp333151 
End bp334605 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content39% 
IMG OID640105682 
Productglycoside hydrolase family protein 
Protein accessionYP_001036702 
Protein GI125972792 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3325] Chitinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA TACCGTTACT TATGCTGCTG TCAGCAATAA TATTTTTATC TTTACATCCC 
ACACTGTCGT ATGCACAGGA CGACTCTCTT CCGACAAAAA GGATAGTAGG CTACTTTGCT
GAATGGAATA TATATCTTGA AAACAATTAT TATGAAGTTT CAGATATTCC ATGGGATATG
GTAACGCATA TAAATTATGC CTTTGCCAAA ATTGAAAATG GAAGGATAGC AATTATTGAT
AAATGGGCTG CGATCCAAAA GCCTTTTGGC GACGATACCT GGGATACACC GATAAGAGGC
CATTTTGGAC AACTGATAAA GTACAAAGAA CAGTATCCGC ATGTAAAAAC TCTGATTTCA
GTGGGAGGCT GGACCGAATC CAAATATTTT TCGGATGTTG CATTAACGGA AGAATCAAGA
AATACATTTG CGGACAGTTG TGTTGAGTTC ATTCGTACAT ACCGGTTTGA TGGAGTGGAT
ATTGACTGGG AATATCCTGT GAGCGGCGGA ATGCCGGAAA ATATTAGAAG GCCGGAGGAC
AAGCAAAACT TTACTCTTTT GTTAAAATGT CTTAGAGAAA AACTTGATGC CGCAGGTGCT
GAAGACGGAA AGCACTACCT TCTTACAATA GCCGCACCGG CGGGAAGCTT TAACATAAAA
AACACCGAGC CTGAGATTTA TCATCAATAC CTGGATTTTA TCAATATTAT GACTTATGAT
TACAGTGGCT CATGGGAAAA TGTGGCGAAT CATTTGGCTC CATTGTATAT GAATCCAAAC
GACCCGTCCT ATCCTGAAAG AAAAGAGAAA TTCAATGTGG ATTGGACGGT AAAGGAATAC
TTAAGACTTG GTGTTCCCGC GGAAAAAATA AATGTGGGAG TACCGTATTA CGCAGCAGGA
TGGCAGGAAG TTAACGGCGG TATAAACGGA CTTTTCGGAA CGTCATCAAA ACCGCTCAGC
AGTACTCAGT TCCACTATAT AAATAGTTTG CTTAAGTCAC CGGACTTAGG TTTTACAAGA
TACTGGGACG AGTATGCAAT GGTTCCCTAT CTGTGGAATC CTGAAAGTGC AACGTTCTAC
AGCTATGAAG ATGAAATTTC CCTTAAAAAT AAATGTGATT ATGTCATTGA AAACAATCTT
GGAGGAATAA TGATTTGGGA ATTGAGCGGG GATTACCCGG CGGAAGGAGG AACTACTTTG
ACATCTGTTA TATACGACAG TTTTACATCA TTCAAAGATG AAATTTACGG TGATTTAAAC
GGTGACGGAA AAGTTAATTC AAGCGATCTT GCAATTTTAA AAAGATATAT GCTGAGGGCC
ATAAGTGATT TTCCGATTCC GGAAGGCAGG AAACTTGCTG ATTTGAACAG GGACGGCAAT
GTTAATTCCA CTGATTATTC GATTCTTAAA AGGTATATAC TGAAAGCTAT TGACAATATA
CCTGTTGATG ATTGA
 
Protein sequence
MKKIPLLMLL SAIIFLSLHP TLSYAQDDSL PTKRIVGYFA EWNIYLENNY YEVSDIPWDM 
VTHINYAFAK IENGRIAIID KWAAIQKPFG DDTWDTPIRG HFGQLIKYKE QYPHVKTLIS
VGGWTESKYF SDVALTEESR NTFADSCVEF IRTYRFDGVD IDWEYPVSGG MPENIRRPED
KQNFTLLLKC LREKLDAAGA EDGKHYLLTI AAPAGSFNIK NTEPEIYHQY LDFINIMTYD
YSGSWENVAN HLAPLYMNPN DPSYPERKEK FNVDWTVKEY LRLGVPAEKI NVGVPYYAAG
WQEVNGGING LFGTSSKPLS STQFHYINSL LKSPDLGFTR YWDEYAMVPY LWNPESATFY
SYEDEISLKN KCDYVIENNL GGIMIWELSG DYPAEGGTTL TSVIYDSFTS FKDEIYGDLN
GDGKVNSSDL AILKRYMLRA ISDFPIPEGR KLADLNRDGN VNSTDYSILK RYILKAIDNI
PVDD