Gene Cthe_0912 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0912 
Symbol 
ID4810533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1089285 
End bp1092518 
Gene Length3234 bp 
Protein Length1077 aa 
Translation table11 
GC content42% 
IMG OID640106331 
Productglycoside hydrolase family protein 
Protein accessionYP_001037339 
Protein GI125973429 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3693] Beta-1,4-xylanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAACA AGAGAGTTTT GGCAAAAATA ACGGCTCTTG TGGTATTGCT GGGAGTGTTT 
TTTGTATTAC CGTCAAACAT AAGTCAGCTA TATGCTGATT ATGAAGTGGT TCATGACACT
TTTGAAGTTA ACTTTGACGG ATGGTGTAAC TTGGGAGTCG ACACATATTT AACGGCAGTT
GAAAATGAAG GAAACAACGG TACAAGAGGT ATGATGGTAA TAAATCGCTC CAGTGCGAGT
GACGGTGCGT ATTCGGAAAA AGGTTTCTAT CTCGACGGTG GTGTAGAATA CAAGTACAGT
GTTTTTGTAA AACACAACGG GACCGGCACC GAAACTTTCA AACTTTCTGT GTCCTATTTG
GATTCGGAAA CAGAAGAAGA AAATAAGGAA GTAATTGCAA CAAAGGATGT TGTGGCCGGA
GAATGGACTG AGATTTCGGC AAAATACAAA GCACCCAAAA CTGCAGTGAA TATTACTTTG
TCAATTACAA CCGACAGCAC TGTAGATTTC ATTTTTGACG ATGTAACCAT AACCCGTAAA
GGAATGGCTG AGGCAAACAC AGTATATGCA GCAAACGCTG TGCTGAAAGA TATGTATGCA
AACTATTTCA GAGTTGGTTC GGTACTTAAC TCCGGAACGG TAAACAATTC ATCAATAAAG
GCCTTGATTT TAAGAGAGTT TAACAGTATT ACCTGTGAAA ATGAAATGAA GCCTGATGCC
ACACTGGTTC AATCAGGATC AACCAATACA AATATCAGGG TTTCTCTTAA TCGTGCAGCA
AGTATTTTAA ACTTCTGTGC ACAAAATAAT ATAGCCGTCA GAGGTCATAC ACTGGTTTGG
CACAGCCAGA CACCTCAATG GTTTTTCAAA GACAATTTCC AGGACAACGG AAACTGGGTT
TCCCAATCAG TTATGGACCA GCGTTTGGAA AGCTACATAA AAAATATGTT TGCTGAAATC
CAAAGACAGT ATCCGTCTTT GAATCTTTAT GCCTATGACG TTGTAAATGA GGCAGTAAGT
GATGATGCAA ACAGGACCAG ATATTATGGC GGGGCGAGGG AACCTGGATA CGGAAATGGT
AGATCTCCAT GGGTTCAGAT CTACGGAGAC AACAAATTTA TTGAGAAAGC ATTTACATAT
GCAAGAAAAT ATGCTCCGGC AAATTGTAAG CTTTACTACA ACGATTACAA CGAATATTGG
GATCATAAGA GAGACTGTAT TGCCTCAATT TGTGCAAACT TGTACAACAA GGGCTTGCTT
GACGGTGTGG GAATGCAGTC CCATATTAAT GCGGATATGA ATGGATTCTC AGGTATACAA
AATTATAAAG CAGCTTTGCA GAAATATATA AATATCGGTT GTGATGTCCA AATTACCGAG
CTTGATATTA GTACAGAAAA CGGCAAATTT AGCTTACAGC AGCAGGCTGA TAAATATAAA
GCTGTTTTCC AGGCAGCTGT TGATATAAAC AGAACCTCCA GCAAAGGAAA GGTTACGGCT
GTCTGTGTAT GGGGACCTAA TGACGCCAAT ACTTGGCTCG GTTCACAAAA TGCACCTCTT
TTGTTTAACG CAAACAATCA ACCGAAACCG GCATACAATG CGGTTGCATC CATTATTCCT
CAGTCCGAAT GGGGCGACGG TAACAATCCG GCCGGCGGCG GAGGAGGAGG CAAACCGGAA
GAGCCGGATG CAAACGGATA TTATTATCAT GACACTTTTG AAGGAAGCGT AGGACAGTGG
ACAGCCAGAG GACCTGCGGA AGTTCTGCTT AGCGGAAGAA CGGCTTACAA AGGTTCAGAA
TCACTCTTGG TAAGGAACCG TACGGCAGCA TGGAACGGAG CACAACGGGC GCTGAATCCC
AGAACGTTTG TTCCCGGAAA CACATATTGT TTCAGCGTAG TGGCATCGTT TATTGAAGGT
GCGTCTTCCA CAACATTCTG CATGAAGCTG CAATACGTAG ACGGAAGCGG CACTCAACGG
TATGATACCA TAGATATGAA AACTGTGGGT CCAAATCAGT GGGTTCACCT GTACAATCCG
CAATACAGAA TTCCTTCCGA TGCAACAGAT ATGTATGTTT ATGTGGAAAC AGCGGATGAC
ACCATTAACT TCTACATAGA TGAGGCAATC GGAGCGGTTG CCGGAACTGT AATCGAAGGA
CCTGCTCCAC AGCCTACACA GCCTCCGGTA CTGCTTGGCG ATGTAAACGG TGATGGAACC
ATTAACTCAA CTGACTTGAC AATGTTAAAG AGAAGCGTGT TGAGGGCAAT CACCCTTACC
GACGATGCAA AGGCTAGAGC AGACGTTGAC AAGAATGGAT CGATAAACAG CACTGATGTT
TTACTTCTTT CACGCTACCT TTTAAGAGTA ATCGACAAAT TTCCTGTAGC AGAAAATCCT
TCTTCTTCTT TTAAATATGA GTCGGCCGTG CAATATCGGC CGGCTCCTGA TTCTTATTTA
AACCCTTGTC CGCAGGCGGG AAGAATTGTC AAGGAAACAT ATACAGGAAT AAACGGAACT
AAGAGTCTTA ATGTATATCT TCCATACGGT TATGATCCGA ACAAAAAATA TAACATTTTC
TACCTTATGC ATGGCGGCGG TGAAAATGAG AATACGATTT TCAGCAACGA TGTTAAATTG
CAAAATATCC TTGACCACGC GATTATGAAC GGTGAACTTG AGCCTTTGAT TGTAGTAACA
CCCACTTTCA ACGGCGGAAA CTGCACGGCC CAAAACTTTT ATCAGGAATT CAGGCAAAAT
GTCATTCCTT TTGTGGAAAG CAAGTACTCT ACTTATGCAG AATCAACAAC CCCACAGGGA
ATAGCCGCTT CAAGAATGCA CAGAGGTTTC GGCGGATTCT CAATGGGAGG ATTGACAACA
TGGTATGTAA TGGTTAACTG CCTTGATTAC GTTGCATATT TTATGCCTTT AAGCGGTGAC
TACTGGTATG GAAACAGTCC GCAGGATAAG GCTAATTCAA TTGCTGAAGC AATTAACAGA
TCCGGACTTT CAAAGAGGGA GTATTTCGTA TTTGCGGCCA CCGGTTCCGA GGATATTGCA
TATGCTAATA TGAATCCTCA AATTGAAGCT ATGAAGGCTT TGCCGCATTT TGATTATACT
TCGGATTTTT CCAAAGGTAA TTTTTACTTT CTTGTAGCTC CGGGCGCCAC TCACTGGTGG
GGATACGTAA GACATTATAT TTATGATGCA CTTCCATATT TCTTCCATGA ATGA
 
Protein sequence
MKNKRVLAKI TALVVLLGVF FVLPSNISQL YADYEVVHDT FEVNFDGWCN LGVDTYLTAV 
ENEGNNGTRG MMVINRSSAS DGAYSEKGFY LDGGVEYKYS VFVKHNGTGT ETFKLSVSYL
DSETEEENKE VIATKDVVAG EWTEISAKYK APKTAVNITL SITTDSTVDF IFDDVTITRK
GMAEANTVYA ANAVLKDMYA NYFRVGSVLN SGTVNNSSIK ALILREFNSI TCENEMKPDA
TLVQSGSTNT NIRVSLNRAA SILNFCAQNN IAVRGHTLVW HSQTPQWFFK DNFQDNGNWV
SQSVMDQRLE SYIKNMFAEI QRQYPSLNLY AYDVVNEAVS DDANRTRYYG GAREPGYGNG
RSPWVQIYGD NKFIEKAFTY ARKYAPANCK LYYNDYNEYW DHKRDCIASI CANLYNKGLL
DGVGMQSHIN ADMNGFSGIQ NYKAALQKYI NIGCDVQITE LDISTENGKF SLQQQADKYK
AVFQAAVDIN RTSSKGKVTA VCVWGPNDAN TWLGSQNAPL LFNANNQPKP AYNAVASIIP
QSEWGDGNNP AGGGGGGKPE EPDANGYYYH DTFEGSVGQW TARGPAEVLL SGRTAYKGSE
SLLVRNRTAA WNGAQRALNP RTFVPGNTYC FSVVASFIEG ASSTTFCMKL QYVDGSGTQR
YDTIDMKTVG PNQWVHLYNP QYRIPSDATD MYVYVETADD TINFYIDEAI GAVAGTVIEG
PAPQPTQPPV LLGDVNGDGT INSTDLTMLK RSVLRAITLT DDAKARADVD KNGSINSTDV
LLLSRYLLRV IDKFPVAENP SSSFKYESAV QYRPAPDSYL NPCPQAGRIV KETYTGINGT
KSLNVYLPYG YDPNKKYNIF YLMHGGGENE NTIFSNDVKL QNILDHAIMN GELEPLIVVT
PTFNGGNCTA QNFYQEFRQN VIPFVESKYS TYAESTTPQG IAASRMHRGF GGFSMGGLTT
WYVMVNCLDY VAYFMPLSGD YWYGNSPQDK ANSIAEAINR SGLSKREYFV FAATGSEDIA
YANMNPQIEA MKALPHFDYT SDFSKGNFYF LVAPGATHWW GYVRHYIYDA LPYFFHE