Gene Cthe_1613 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1613 
Symbol 
ID4809308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1942344 
End bp1944767 
Gene Length2424 bp 
Protein Length807 aa 
Translation table11 
GC content47% 
IMG OID640107029 
Productglycosyl hydrolase-like protein 
Protein accessionYP_001038030 
Protein GI125974120 
COG category[R] General function prediction only 
COG ID[COG3858] Predicted glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.5027 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAATG CCCGAATGTA TGAAGCACTT AGAGACTACG GCGACCGCTT TGATACGGTA 
GGCATTTTTA CTTTTGAGGT TGACGAAACA GGCACAATCA CTGAAACCGG TACCAGCATC
AGCAGCATGC TTCCGTATAT TCAGAAATGG CCGCACATTA AGTGGCTGCT CACTATTATG
AATCATGGAA TAGCCAATAT TTTTACTGCA CTTCGCAACA ACGAAAACGG TGCAAAGGAT
AAGTTTCTCA CTGAAATCAT CCGAATAATG AACAAGTATC CATGGTGCGC TGGGGTAGAT
ATTGACCTGG AGCGCGGCGG CGGTTATGAA AACAAGGATG CGGCGAATGC ACTATTTAGG
GATATATACA ATACAGTTAA GTCTTATGAT GCAACAAAGC TTGTCAACAT CTGCCTGCCG
GGTATGATCG GTGTTCAAGG CTCGGTGGGC GGTGAAAACT GGTGTGTTTA TGCCGATCTT
AACGACTATT GCGATACCGC CGCCATCATG AGCTACGGCA TGGCATGGGC GGGTTCTGCT
CCCGGTCCGG TATCTCCCCG TGACTGGCTT GAGGGCATAT ATGATTATGC TGTTTCCGTT
ATGTCGCCGG ACAAGATATT CATGGGTTTG CCTGCTTATG GCTGGAACTG GAGGATCCAT
GATACGCCTG AAAACCTCGG AATAACCTAT CGAGGAGTGT CTAATACCTA CTATGCGGCT
AAATACTGGA TGACTGGGGT TTACAATTTC ACAGGTGATG CACCGCCCCA GCCGTTTATT
CCAATTGTGG CTTACTGGGA TGACTATAAC AAAGTACCTT GGGCTCTTCC TCATGTATAT
GACTATATGG AAGGATGGGA TGCTGTATCC TGGGAATATC CGCTGCTAAA AGGGGTTTAC
AACAGGCGAA GATATTTGAC AAGCTATGGC AAGGAGCAGA AAGCGGAGTT CGGAACCATT
TATATTGACA GGAACGGAGT TCCGGATGAA TACGAAGGAA ATGTCATTAT TACTGATGAG
ATGACCTCAC TGGGAGATGC CCAGGCGTCA GCAGAGTACC GTTTTGAGAT AAGAGAAGCG
GGATATTACG ATATTGCAGT ACAGCTTTGC TTTCCTTACT GGGACAAAAA TGCGATTATT
GTTTCCCTTG ATGGTATATC AAAGACTTTC AGCGAGAACC GTTTATGGTG GCCATACTGG
AGAAGAGTTT GCTGGTTGAC ACTTGCAAAA GGTGTATTTC TCCAAGAGGG AACGCATGCT
ATCAGCATAA GCGGTGGTGT GCCGGGAGTC CAGTTTTACG GTTTTAGGGT TTGCAGTGGA
TTTTCGGAGT ATCCCTTTGC CGGTGAAGCC AGCTTTATGC TCTCTCCTCG TCGGTTCAAG
GATGTAAATG GTGTGATGGT TGAGCCGGAT CGAGGTTTTA AACTGACCTT TGAAATGTTG
CGAAGAAAAC CCGACTCGGC GCTTATTTGG TATGAGGATT TTCGAGACAG GAACATCCTG
CCTGAAAACT ACTGGACTGT GCTGGATGGC GAATGGGATG TTTGGCAAGA CCCAGACAGC
ACAGAAAACC GTCCATATTC CCAGCTCGAG GGATATGGCA AACTTGCATG GAAATACGAC
GGGTTTTCCG ATATTCATAT CCGGGCAAGG CTGGCTTTCC CTCAAAATAG CAGTGGACGG
GCTGGGGTGT TCCTTGGGGA TATTTTCTGC TGCTTAAATT ATGACACGCA AAGAGTCGAG
CTTTATCAAG GTAATTCCTT GCTTGGTAGC TATTCCACCA GTTTCTCAAA AACTGTAGAT
GCCGATCTTC GTGCTAATCC GAATATGTAT ACTATAGAGA TGCGAAAACG CGGCAATAAG
GTAAGAGTAT ATTCAGGTGC AGCTTCAACC CTGCGCTTCA CAGTGAATGT AACCGGTGGT
AGTGGTTATG TAGGTTACTG CTCGGACAAC CGGACGGTAT GCGAGCTGCT GCGACTGGGC
GATGCATGGG TATATGAACC ATACGAGCGT TTTGATGTGG AACTTCCGGA TGGAAATATA
ACCAGCTTTG GCAGGCTTGC TCGCACTGGT GTCACGTGGG ATGATGAATT TCAGGTGTTT
TCAGTAAATA ACGATGTGGA GGAATCGATA ACTCGCAGTG AGGACATTTC GATGGACTAT
GACTTCTTCC ACTCACAACT TTTGGCTCTT TCTTGCGGTA ATGACTATGA AGTAAAGATT
ATACTGAAAG ACATCAATAT CTGGATATCC CGTCTCTTCC TCGGAGATGC AGATGGTTTT
TCTATTCTGT ATTATCAGGA TGTGGACAGC CTTGTTTACT GGGCAAACGA AGCGGCTTAT
CGATGGAAAC TGCGAGGTAT AGCCATCTGG TCTCTTGGGC AGGAGGATAT GCGGTTGTGG
GAGGCGCTTC CGAAGCAAAT ATAG
 
Protein sequence
MGNARMYEAL RDYGDRFDTV GIFTFEVDET GTITETGTSI SSMLPYIQKW PHIKWLLTIM 
NHGIANIFTA LRNNENGAKD KFLTEIIRIM NKYPWCAGVD IDLERGGGYE NKDAANALFR
DIYNTVKSYD ATKLVNICLP GMIGVQGSVG GENWCVYADL NDYCDTAAIM SYGMAWAGSA
PGPVSPRDWL EGIYDYAVSV MSPDKIFMGL PAYGWNWRIH DTPENLGITY RGVSNTYYAA
KYWMTGVYNF TGDAPPQPFI PIVAYWDDYN KVPWALPHVY DYMEGWDAVS WEYPLLKGVY
NRRRYLTSYG KEQKAEFGTI YIDRNGVPDE YEGNVIITDE MTSLGDAQAS AEYRFEIREA
GYYDIAVQLC FPYWDKNAII VSLDGISKTF SENRLWWPYW RRVCWLTLAK GVFLQEGTHA
ISISGGVPGV QFYGFRVCSG FSEYPFAGEA SFMLSPRRFK DVNGVMVEPD RGFKLTFEML
RRKPDSALIW YEDFRDRNIL PENYWTVLDG EWDVWQDPDS TENRPYSQLE GYGKLAWKYD
GFSDIHIRAR LAFPQNSSGR AGVFLGDIFC CLNYDTQRVE LYQGNSLLGS YSTSFSKTVD
ADLRANPNMY TIEMRKRGNK VRVYSGAAST LRFTVNVTGG SGYVGYCSDN RTVCELLRLG
DAWVYEPYER FDVELPDGNI TSFGRLARTG VTWDDEFQVF SVNNDVEESI TRSEDISMDY
DFFHSQLLAL SCGNDYEVKI ILKDINIWIS RLFLGDADGF SILYYQDVDS LVYWANEAAY
RWKLRGIAIW SLGQEDMRLW EALPKQI