Gene Cthe_2812 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2812 
Symbol 
ID4809649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3319096 
End bp3320931 
Gene Length1836 bp 
Protein Length611 aa 
Translation table11 
GC content45% 
IMG OID640108232 
Productglycoside hydrolase family protein 
Protein accessionYP_001039204 
Protein GI125975294 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.283645 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAAA ATCTTTTTGC AAAAAGAGCG GTGGCTTTCC TGCTCGGTAT TGTGATTACG 
GCTGCAGGGA TTGTCTCTTT CAACACCGTA AGCACCAGTG CCGCCGGAGA ATACAATTAT
GCAAAGGCGC TGCAGTATTC CATGTTCTTC TATGATGCGA ACATGTGCGG TACAGGTGTT
GACGAGAACA GCCTTTTGTC ATGGAGAGGA GACTGCCACG TATATGATGC AAGACTTCCT
CTGGATTCCC AGAACACCAA CATGTCCGAT GGTTTTATAA GCAGCAACAG AAGTGTGCTT
GACCCTGACG GAGACGGCAA AGTTGACGTG TCAGGCGGTT TTCATGACGC CGGCGACCAT
GTGAAGTTTG GTTTGCCTGA GGCTTATGCC GCTTCAACAG TGGGTTGGGG TTACTATGAA
TTTAAAGACC AGTTCCGTGC AACGGGACAG GCCGTCCATG CTGAAGTAAT TTTAAGATAC
TTCAATGACT ATTTTATGAG ATGTACTTTC AGAGACGCTT CCGGAAATGT TGTGGCGTTC
TGTCATCAGG TGGGCGACGG AGATATCGAC CATGCATTTT GGGGTGCTCC GGAAAATGAC
ACCATGTTCA GAAGAGGTTG GTTTATTACC AAAGAAAAGC CTGGAACTGA CATTATTTCG
GCAACAGCAG CTTCTTTAGC AATAAACTAC ATGAATTTTA AAGACACAGA CCCTCAATAT
GCGGCAAAAA GCCTTGATTA TGCAAAAGCT TTGTTTGATT TTGCGGAGAA AAATCCAAAA
GGGGTAGTTC AGGGAGAGGA CGGACCAAAA GGTTATTATG GTTCAAGCAA ATGGCAGGAT
GACTACTGCT GGGCTGCCGC ATGGCTTTAT TTGGCAACGC AGAATGAGCA CTATTTGGAT
GAAGCATTTA AATATTATGA TTATTATGCT CCGCCGGGAT GGATACATTG CTGGAATGAC
GTGTGGTCGG GAACCGCATG TATTTTGGCG GAAATAAATG ATTTGTACGA CAAGGACAGC
CAGAATTTCG AAGACAGGTA TAAAAGAGCT TCCAATAAGA ATCAGTGGGA GCAGATAGAC
TTCTGGAAAC CCATACAAGA TTTGCTTGAC AAGTGGTCGG GTGGCGGTAT TACAGTTACA
CCGGGCGGAT ACGTTTTCCT CAATCAGTGG GGTTCTGCAA GATACAATAC TGCCGCTCAG
CTGATAGCTC TTGTTTATGA CAAGCATCAT GGTGACACAC CGTCAAAATA TGCTAACTGG
GCACGGTCGC AGATGGATTA TCTGTTGGGT AAAAACCCGT TGAATCGCTG CTATGTTGTA
GGCTACAGCA GCAATTCGGT CAAATACCCG CACCACAGAG CGGCTTCCGG ACTGAAAGAT
GCCAATGATT CTTCTCCGCA CAAATATGTG TTGTATGGTG CCCTGGTCGG AGGGCCGGAT
GCAAGTGACC AGCATGTGGA TAGAACAAAT GATTATATTT ACAATGAGGT TGCCATTGAC
TATAATGCCG CTTTTGTGGG AGCATGTGCA GGTCTTTACA GATTCTTCGG GGATTCTTCA
ATGCAGATAG ACCCGTCAAT GCCGTCGCAT AACGTACCTG TACCACCGAC ACCCACACCT
CCTGATACGC AAATTGTATA TGGAGATTTG AACGGCGACC AGAAAGTGAC TTCCACAGAC
TATACGATGC TCAAGAGGTA TTTGATGAAA AGCATTGATA GGTTTAATAC TTCCGAACAA
GCTGCGGATT TGAACAGAGA CGGCAAAATC AATTCCACGG ACTTGACAAT ATTGAAAAGA
TATTTGCTTT ACAGCATACC GTCTCTCCCT ATATAA
 
Protein sequence
MRKNLFAKRA VAFLLGIVIT AAGIVSFNTV STSAAGEYNY AKALQYSMFF YDANMCGTGV 
DENSLLSWRG DCHVYDARLP LDSQNTNMSD GFISSNRSVL DPDGDGKVDV SGGFHDAGDH
VKFGLPEAYA ASTVGWGYYE FKDQFRATGQ AVHAEVILRY FNDYFMRCTF RDASGNVVAF
CHQVGDGDID HAFWGAPEND TMFRRGWFIT KEKPGTDIIS ATAASLAINY MNFKDTDPQY
AAKSLDYAKA LFDFAEKNPK GVVQGEDGPK GYYGSSKWQD DYCWAAAWLY LATQNEHYLD
EAFKYYDYYA PPGWIHCWND VWSGTACILA EINDLYDKDS QNFEDRYKRA SNKNQWEQID
FWKPIQDLLD KWSGGGITVT PGGYVFLNQW GSARYNTAAQ LIALVYDKHH GDTPSKYANW
ARSQMDYLLG KNPLNRCYVV GYSSNSVKYP HHRAASGLKD ANDSSPHKYV LYGALVGGPD
ASDQHVDRTN DYIYNEVAID YNAAFVGACA GLYRFFGDSS MQIDPSMPSH NVPVPPTPTP
PDTQIVYGDL NGDQKVTSTD YTMLKRYLMK SIDRFNTSEQ AADLNRDGKI NSTDLTILKR
YLLYSIPSLP I