Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1838 |
Symbol | |
ID | 4809384 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2181190 |
End bp | 2183049 |
Gene Length | 1860 bp |
Protein Length | 619 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640107252 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001038252 |
Protein GI | 125974342 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3693] Beta-1,4-xylanase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.561717 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGAAGA AAAAACTGTT GACCCTTTTG ACAGTCTTTG CTCTGCTGAC TGTCGGTATC TGCGGAAGTT TTTTGCCGTT ACCCAAAGCA TCCGCAGCAG CTCTGATTTA CGATGATTTT GAAACAGGTC TGAACGGATG GGGACCAAGA GGACCGGAAA CCGTCGAACT TACCACCGAG GAAGCTTACT CGGGAAGATA CAGTTTGAAG GTCAGCGGAC GTACCAGCAC ATGGAACGGG CCCATGGTTG ACAAAACCGA TGTGTTGACT TTGGGCGAAA GCTATAAGTT GGGCGTATAT GTAAAATTCG TGGGTGATTC CTATTCAAAT GAGCAAAGAT TCAGTTTGCA GCTTCAATAT AACGACGGAG CAGGAGATGT ATACCAAAAT ATAAAAACCG CCACGGTTTA CAAGGGAACA TGGACTTTGC TGGAAGGACA GCTTACAGTT CCCAGCCATG CAAAGGACGT AAAAATATAT GTGGAAACCG AATTTAAAAA TTCTCCGAGT CCGCAGGACT TGATGGATTT CTATATTGAC GATTTCACAG CAACACCTGC AAATTTGCCT GAAATTGAGA AAGATATTCC AAGCTTGAAA GATGTCTTTG CCGGTTATTT CAAAGTGGGT GGTGCCGCAA CTGTGGCGGA ACTGGCGCCG AAGCCTGCAA AAGAGCTTTT CCTCAAGCAT TATAACAGCT TGACTTTTGG TAATGAGTTA AAACCGGAAA GTGTACTTGA CTATGATGCT ACAATTGCTT ATATGGAGGC AAACGGAGGC GACCAGGTTA ATCCGCAGAT AACCTTGAGA GCGGCAAGAC CCCTGTTGGA GTTTGCGAAA GAACACAACA TACCTGTAAG AGGACATACC CTTGTATGGC ACAGCCAGAC ACCGGACTGG TTCTTCAGAG AAAATTACTC TCAGGACGAA AATGCTCCCT GGGCATCCAA GGAAGTAATG CTGCAAAGGT TGGAAAACTA CATAAAGAAT TTAATGGAAG CTTTGGCGAC CGAATATCCG ACGGTTAAGT TCTATGCATG GGACGTTGTG AATGAGGCTG TTGATCCTAA TACTTCAGAC GGTATGAGAA CTCCGGGTTC GAATAACAAA AATCCCGGAA GCTCCCTGTG GATGCAAACC GTTGGAAGAG ATTTTATTGT TAAAGCTTTT GAATATGCAA GAAAATATGC TCCTGCGGAT TGTAAACTCT TCTACAATGA CTATAATGAA TATGAAGACA GAAAATGTGA TTTTATTATT GAAATTCTTA CCGAACTTAA AGCCAAAGGC CTGGTTGACG GTATGGGTAT GCAATCCCAC TGGGTTATGG ATTATCCAAG CATAAGCATG TTTGAAAAAT CCATCAGAAG ATATGCAGCA TTGGGATTGG AAATTCAGCT TACCGAGCTG GATATAAGAA ATCCTGACAA CAGCCAGTGG GCTTTGGAAC GTCAGGCTAA TCGTTATAAG GAGCTTGTAA CAAAATTGGT CGATTTGAAA AAAGAAGGCA TAAACATTAC GGCATTGGTA TTCTGGGGAA TAACCGACGC GACAAGCTGG CTTGGAGGAT ATCCGCTCCT GTTTGACGCG GAATACAAGG CAAAACCTGC ATTTTATGCT ATAGTTAACA GCGTTCCGCC GCTTCCGACA GAACCGCCGG TTCAGGTTAT ACCCGGTGAT GTAAACGGTG ACGGTCGTGT AAATTCATCC GACTTGACTC TTATGAAAAG ATACCTTTTA AAATCCATAA GCGACTTCCC GACACCGGAA GGAAAAATTG CGGCGGATTT AAACGAAGAC GGCAAGGTAA ACTCGACAGA TTTGTTAGCG CTGAAAAAAC TCGTTCTGAG AGAACTTTGA
|
Protein sequence | MLKKKLLTLL TVFALLTVGI CGSFLPLPKA SAAALIYDDF ETGLNGWGPR GPETVELTTE EAYSGRYSLK VSGRTSTWNG PMVDKTDVLT LGESYKLGVY VKFVGDSYSN EQRFSLQLQY NDGAGDVYQN IKTATVYKGT WTLLEGQLTV PSHAKDVKIY VETEFKNSPS PQDLMDFYID DFTATPANLP EIEKDIPSLK DVFAGYFKVG GAATVAELAP KPAKELFLKH YNSLTFGNEL KPESVLDYDA TIAYMEANGG DQVNPQITLR AARPLLEFAK EHNIPVRGHT LVWHSQTPDW FFRENYSQDE NAPWASKEVM LQRLENYIKN LMEALATEYP TVKFYAWDVV NEAVDPNTSD GMRTPGSNNK NPGSSLWMQT VGRDFIVKAF EYARKYAPAD CKLFYNDYNE YEDRKCDFII EILTELKAKG LVDGMGMQSH WVMDYPSISM FEKSIRRYAA LGLEIQLTEL DIRNPDNSQW ALERQANRYK ELVTKLVDLK KEGINITALV FWGITDATSW LGGYPLLFDA EYKAKPAFYA IVNSVPPLPT EPPVQVIPGD VNGDGRVNSS DLTLMKRYLL KSISDFPTPE GKIAADLNED GKVNSTDLLA LKKLVLREL
|
| |