Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0912 |
Symbol | |
ID | 4810533 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1089285 |
End bp | 1092518 |
Gene Length | 3234 bp |
Protein Length | 1077 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640106331 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001037339 |
Protein GI | 125973429 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3693] Beta-1,4-xylanase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAACA AGAGAGTTTT GGCAAAAATA ACGGCTCTTG TGGTATTGCT GGGAGTGTTT TTTGTATTAC CGTCAAACAT AAGTCAGCTA TATGCTGATT ATGAAGTGGT TCATGACACT TTTGAAGTTA ACTTTGACGG ATGGTGTAAC TTGGGAGTCG ACACATATTT AACGGCAGTT GAAAATGAAG GAAACAACGG TACAAGAGGT ATGATGGTAA TAAATCGCTC CAGTGCGAGT GACGGTGCGT ATTCGGAAAA AGGTTTCTAT CTCGACGGTG GTGTAGAATA CAAGTACAGT GTTTTTGTAA AACACAACGG GACCGGCACC GAAACTTTCA AACTTTCTGT GTCCTATTTG GATTCGGAAA CAGAAGAAGA AAATAAGGAA GTAATTGCAA CAAAGGATGT TGTGGCCGGA GAATGGACTG AGATTTCGGC AAAATACAAA GCACCCAAAA CTGCAGTGAA TATTACTTTG TCAATTACAA CCGACAGCAC TGTAGATTTC ATTTTTGACG ATGTAACCAT AACCCGTAAA GGAATGGCTG AGGCAAACAC AGTATATGCA GCAAACGCTG TGCTGAAAGA TATGTATGCA AACTATTTCA GAGTTGGTTC GGTACTTAAC TCCGGAACGG TAAACAATTC ATCAATAAAG GCCTTGATTT TAAGAGAGTT TAACAGTATT ACCTGTGAAA ATGAAATGAA GCCTGATGCC ACACTGGTTC AATCAGGATC AACCAATACA AATATCAGGG TTTCTCTTAA TCGTGCAGCA AGTATTTTAA ACTTCTGTGC ACAAAATAAT ATAGCCGTCA GAGGTCATAC ACTGGTTTGG CACAGCCAGA CACCTCAATG GTTTTTCAAA GACAATTTCC AGGACAACGG AAACTGGGTT TCCCAATCAG TTATGGACCA GCGTTTGGAA AGCTACATAA AAAATATGTT TGCTGAAATC CAAAGACAGT ATCCGTCTTT GAATCTTTAT GCCTATGACG TTGTAAATGA GGCAGTAAGT GATGATGCAA ACAGGACCAG ATATTATGGC GGGGCGAGGG AACCTGGATA CGGAAATGGT AGATCTCCAT GGGTTCAGAT CTACGGAGAC AACAAATTTA TTGAGAAAGC ATTTACATAT GCAAGAAAAT ATGCTCCGGC AAATTGTAAG CTTTACTACA ACGATTACAA CGAATATTGG GATCATAAGA GAGACTGTAT TGCCTCAATT TGTGCAAACT TGTACAACAA GGGCTTGCTT GACGGTGTGG GAATGCAGTC CCATATTAAT GCGGATATGA ATGGATTCTC AGGTATACAA AATTATAAAG CAGCTTTGCA GAAATATATA AATATCGGTT GTGATGTCCA AATTACCGAG CTTGATATTA GTACAGAAAA CGGCAAATTT AGCTTACAGC AGCAGGCTGA TAAATATAAA GCTGTTTTCC AGGCAGCTGT TGATATAAAC AGAACCTCCA GCAAAGGAAA GGTTACGGCT GTCTGTGTAT GGGGACCTAA TGACGCCAAT ACTTGGCTCG GTTCACAAAA TGCACCTCTT TTGTTTAACG CAAACAATCA ACCGAAACCG GCATACAATG CGGTTGCATC CATTATTCCT CAGTCCGAAT GGGGCGACGG TAACAATCCG GCCGGCGGCG GAGGAGGAGG CAAACCGGAA GAGCCGGATG CAAACGGATA TTATTATCAT GACACTTTTG AAGGAAGCGT AGGACAGTGG ACAGCCAGAG GACCTGCGGA AGTTCTGCTT AGCGGAAGAA CGGCTTACAA AGGTTCAGAA TCACTCTTGG TAAGGAACCG TACGGCAGCA TGGAACGGAG CACAACGGGC GCTGAATCCC AGAACGTTTG TTCCCGGAAA CACATATTGT TTCAGCGTAG TGGCATCGTT TATTGAAGGT GCGTCTTCCA CAACATTCTG CATGAAGCTG CAATACGTAG ACGGAAGCGG CACTCAACGG TATGATACCA TAGATATGAA AACTGTGGGT CCAAATCAGT GGGTTCACCT GTACAATCCG CAATACAGAA TTCCTTCCGA TGCAACAGAT ATGTATGTTT ATGTGGAAAC AGCGGATGAC ACCATTAACT TCTACATAGA TGAGGCAATC GGAGCGGTTG CCGGAACTGT AATCGAAGGA CCTGCTCCAC AGCCTACACA GCCTCCGGTA CTGCTTGGCG ATGTAAACGG TGATGGAACC ATTAACTCAA CTGACTTGAC AATGTTAAAG AGAAGCGTGT TGAGGGCAAT CACCCTTACC GACGATGCAA AGGCTAGAGC AGACGTTGAC AAGAATGGAT CGATAAACAG CACTGATGTT TTACTTCTTT CACGCTACCT TTTAAGAGTA ATCGACAAAT TTCCTGTAGC AGAAAATCCT TCTTCTTCTT TTAAATATGA GTCGGCCGTG CAATATCGGC CGGCTCCTGA TTCTTATTTA AACCCTTGTC CGCAGGCGGG AAGAATTGTC AAGGAAACAT ATACAGGAAT AAACGGAACT AAGAGTCTTA ATGTATATCT TCCATACGGT TATGATCCGA ACAAAAAATA TAACATTTTC TACCTTATGC ATGGCGGCGG TGAAAATGAG AATACGATTT TCAGCAACGA TGTTAAATTG CAAAATATCC TTGACCACGC GATTATGAAC GGTGAACTTG AGCCTTTGAT TGTAGTAACA CCCACTTTCA ACGGCGGAAA CTGCACGGCC CAAAACTTTT ATCAGGAATT CAGGCAAAAT GTCATTCCTT TTGTGGAAAG CAAGTACTCT ACTTATGCAG AATCAACAAC CCCACAGGGA ATAGCCGCTT CAAGAATGCA CAGAGGTTTC GGCGGATTCT CAATGGGAGG ATTGACAACA TGGTATGTAA TGGTTAACTG CCTTGATTAC GTTGCATATT TTATGCCTTT AAGCGGTGAC TACTGGTATG GAAACAGTCC GCAGGATAAG GCTAATTCAA TTGCTGAAGC AATTAACAGA TCCGGACTTT CAAAGAGGGA GTATTTCGTA TTTGCGGCCA CCGGTTCCGA GGATATTGCA TATGCTAATA TGAATCCTCA AATTGAAGCT ATGAAGGCTT TGCCGCATTT TGATTATACT TCGGATTTTT CCAAAGGTAA TTTTTACTTT CTTGTAGCTC CGGGCGCCAC TCACTGGTGG GGATACGTAA GACATTATAT TTATGATGCA CTTCCATATT TCTTCCATGA ATGA
|
Protein sequence | MKNKRVLAKI TALVVLLGVF FVLPSNISQL YADYEVVHDT FEVNFDGWCN LGVDTYLTAV ENEGNNGTRG MMVINRSSAS DGAYSEKGFY LDGGVEYKYS VFVKHNGTGT ETFKLSVSYL DSETEEENKE VIATKDVVAG EWTEISAKYK APKTAVNITL SITTDSTVDF IFDDVTITRK GMAEANTVYA ANAVLKDMYA NYFRVGSVLN SGTVNNSSIK ALILREFNSI TCENEMKPDA TLVQSGSTNT NIRVSLNRAA SILNFCAQNN IAVRGHTLVW HSQTPQWFFK DNFQDNGNWV SQSVMDQRLE SYIKNMFAEI QRQYPSLNLY AYDVVNEAVS DDANRTRYYG GAREPGYGNG RSPWVQIYGD NKFIEKAFTY ARKYAPANCK LYYNDYNEYW DHKRDCIASI CANLYNKGLL DGVGMQSHIN ADMNGFSGIQ NYKAALQKYI NIGCDVQITE LDISTENGKF SLQQQADKYK AVFQAAVDIN RTSSKGKVTA VCVWGPNDAN TWLGSQNAPL LFNANNQPKP AYNAVASIIP QSEWGDGNNP AGGGGGGKPE EPDANGYYYH DTFEGSVGQW TARGPAEVLL SGRTAYKGSE SLLVRNRTAA WNGAQRALNP RTFVPGNTYC FSVVASFIEG ASSTTFCMKL QYVDGSGTQR YDTIDMKTVG PNQWVHLYNP QYRIPSDATD MYVYVETADD TINFYIDEAI GAVAGTVIEG PAPQPTQPPV LLGDVNGDGT INSTDLTMLK RSVLRAITLT DDAKARADVD KNGSINSTDV LLLSRYLLRV IDKFPVAENP SSSFKYESAV QYRPAPDSYL NPCPQAGRIV KETYTGINGT KSLNVYLPYG YDPNKKYNIF YLMHGGGENE NTIFSNDVKL QNILDHAIMN GELEPLIVVT PTFNGGNCTA QNFYQEFRQN VIPFVESKYS TYAESTTPQG IAASRMHRGF GGFSMGGLTT WYVMVNCLDY VAYFMPLSGD YWYGNSPQDK ANSIAEAINR SGLSKREYFV FAATGSEDIA YANMNPQIEA MKALPHFDYT SDFSKGNFYF LVAPGATHWW GYVRHYIYDA LPYFFHE
|
| |