Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2138 |
Symbol | |
ID | 4811185 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2537843 |
End bp | 2539585 |
Gene Length | 1743 bp |
Protein Length | 580 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640107542 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001038535 |
Protein GI | 125974625 |
COG category | [R] General function prediction only |
COG ID | [COG3940] Predicted beta-xylosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATACTAT TAGCCACCTC AGTGTGTTAC TCAAGTAATC CTTCGATAAC GCAGGCTTCA ACGGGTGCCG ATGGTGCTAT AGCAAAGCTT CAGTCATATA ATTATTCGCA TATGTACATA AGAAATGCAA ACTTTGATGT AAGAATAGAT GATAATGTTA CACCGGAAAC AGATGCCCAA TGGGTGCTTG TTCCCGGCCT TGCCAACAGC GGCGAGGGAT ATGTGTCTAT TCAATCTGTT GATCATCTGG GATATTACTT AAGACACTGG AATTATGATT TCAGATTGGA AAAAAACGAC GGGACCAGAA TTTTTGCGGA GGATGCAACA TTTAAAATGG TTCCGGGGCT TGCGGATCCT TCATATACTT CTTTCCAGTC CTATAATTAT CCTACCAGAT ATATAAGGCA TTATAATTAC CTGTTGAGGT TGGATGAAAT TGTAACAGCC CTTGACAGAG AGGATGCCAC ATTCAGAGTG ATTGACAGTT CATCTGTGGA TCCCGACAAA GCGGATGATT CGGTTATTGT AACAAATCCG ATCGTCAGGC GGCGGGCAGA TCCCTGGGTT TACAGGCATA CCGACGGTTA TTATTATATG ACGGCATCCG TACCGGAGTA TGACAGAATA GAGCTAAGAC GGTCGAGAAC ATTGCAAGGA CTTTCAACAG CCACACCTAA GACGATATGG AGAAGGCACT CCAGCGGAAT TATGGGAGGA CATATCTGGG CTCCGGAGAT TCATTTCATT GACGGAAAAT GGTATATTTA TTTTTCAGCA GGAACATCCA CTAATTACTT CGATATACGT TTGTATGTCC TTGAATGTTC GGATTCAAAT CCTCTTACCG GAACCTGGGT GGAAAAAGGT CAATTAAAGA CCAACTGGGA GTCTTTCACC CTCGATGCAA CCACCTTTGA ACACAACGGT ACAAGGTATC TTGTATGGGC TCAGAAAGAC CCTAAAATAG CTAGTAACAG CAATATTTAT ATTGCCAAAA TGAATGGGCC TCTTGCCATA ACCGGAAACC AGGTTATGAT TTCGACACCC GAATATTCAT GGGAAAAGAT AGGTTATGCT GTCAATGAAG GCCCTGCCGT TTTAAAGAAA AACGGTAAAA TTTTCATAAC CTTTTCAGCA AGCGCTACAG ACGCAAACTA TTGCATGGGA TTGTTAACTG CCTCCGACAC CGCCAATTTA TTGGATCCGA AATCCTGGCA CAAATCGCCG AACCCCGTAT TTCAGAGCAA TCCATCCACA GGGCAGTACG GACCCGGGCA TAATTCCTTT ACAACTTCAC CCGACGGAAA AGTGGATATT ATGGTATATC ATGCGAGGAA CTACCGGGAT ATAACGGGAG ATCCTTTGTA TGACCCGAAC AGGCATACCC GTGCGCAAAT AGTCAACTGG AATGCTGACG GTACACCGGA CTTCGGAATA CCGGTTGCTG ACGGAACAAA TGTTATATAT ATCCCGCCCC AGACACCAAC ACCAATACCT ACAAACAGTC CTACACCCAG TGATTTAATC TACGGCGATA TAAATGGAGA TAATTCTGTC AATTCAACGG ATTTAACAAT TTTAAAAAGA TATTTGCTTG GAAGTACTGT CCCGACGGCA CCGAACTGGA GACTGGCCGC CGACCTTAAT TTGGACGGAA ATATAAACTC AACGGATTTT ACAATACTTA AGAGGTACAT TTTAGGCAGA ATTGAAGCAC CGCCCTGGGT AAATCAGACA TAG
|
Protein sequence | MILLATSVCY SSNPSITQAS TGADGAIAKL QSYNYSHMYI RNANFDVRID DNVTPETDAQ WVLVPGLANS GEGYVSIQSV DHLGYYLRHW NYDFRLEKND GTRIFAEDAT FKMVPGLADP SYTSFQSYNY PTRYIRHYNY LLRLDEIVTA LDREDATFRV IDSSSVDPDK ADDSVIVTNP IVRRRADPWV YRHTDGYYYM TASVPEYDRI ELRRSRTLQG LSTATPKTIW RRHSSGIMGG HIWAPEIHFI DGKWYIYFSA GTSTNYFDIR LYVLECSDSN PLTGTWVEKG QLKTNWESFT LDATTFEHNG TRYLVWAQKD PKIASNSNIY IAKMNGPLAI TGNQVMISTP EYSWEKIGYA VNEGPAVLKK NGKIFITFSA SATDANYCMG LLTASDTANL LDPKSWHKSP NPVFQSNPST GQYGPGHNSF TTSPDGKVDI MVYHARNYRD ITGDPLYDPN RHTRAQIVNW NADGTPDFGI PVADGTNVIY IPPQTPTPIP TNSPTPSDLI YGDINGDNSV NSTDLTILKR YLLGSTVPTA PNWRLAADLN LDGNINSTDF TILKRYILGR IEAPPWVNQT
|
| |