Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2119 |
Symbol | |
ID | 4810979 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2516532 |
End bp | 2518814 |
Gene Length | 2283 bp |
Protein Length | 760 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640107526 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001038519 |
Protein GI | 125974609 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3693] Beta-1,4-xylanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGATTGTAG GAAAAGTTCT TGATATGGAT GAAAAGACGG CCATTATAAT GACTGATGAC TTTGCTTTTC TGAATGTGGT AAGGACCTCT GAAATGGCAG TTGGTAAAAA AGTGAAGGTT TTGGACTCGG ATATAATCAA GCCTAAAAAT TCTTTGCGCA GATATTTGCC GGTTGCGGCA GTTGCTGCAT GCTTTGTAAT TGTGTTGTCT TTTGTGCTGA TGTTTATTAA TGGAAATACG GCAAGAAAAA ATATATATGC TTATGTTGGC ATTGATATAA ATCCAAGTAT TGAGCTTTGG ATAAATTACA ACAACAAAAT AGCCGAAGCC AAAGCACTGA ACGGCGATGC CGAGACAGTG CTGGAAGGAC TTGAATTAAA AGAAAAAACA GTGGCGGAGG CTGTGAATGA GATTGTGCAA AAGAGCATGG AGCTTGGATT TATTTCCAGG GAGAAGGAGA ATATAATCCT TATATCCACA GCCTGTGATT TAAAAGCAGG GGAAGGTTCG GAGAATAAGG ACGTTCAAAA TAAAATCGGT CAGCTTTTTG ATGATGTGAA CAAGGCGGTT TCAGACCTTA AAAACAGCGG TATTACAACC AGGATTTTAA ATCTTACTTT GGAGGAAAGA GAATCATCCA AAGAAGAAAA TATTTCAATG GGCAGATATG CCGTGTATTT AAAAGCCAAA GAGCAGAATG TAAATTTGAC TATTGATGAG ATTAAAGATG CGGATTTGCT GGAGCTCATT GCCAAGGTCG GCATTGATAA TGAGAATGTT CCGGAGGATA TTGTAACAGA GGATAAAGAC AACCTGGATG CGATAAATAC CGGACCTGCG GAAAGTGCCG TGCCTGAAGT GACTGAAACT TTGCCTGCTA CCTCAACACC CGGCAGGACG GAAGGTAACA CCGCGACAGG TTCGGTTGAC AGTACACCTG CTTTGTCCAA GAACGAAACA CCGGGCAAAA CAGAAACACC GGGAAGGACA TTCAATACAC CGGCAAAATC GTCACTGGGG CAATCTTCGA CGCCCAAGCC GGTATCACCC GTTCAAACGG CAACAGCGAC AAAGGGTATT GGGACTTTGA CACCCAGGAA CTCACCGACG CCAGTAATAC CTTCCACCGG CATTCAATGG ATTGACCAGG CAAATGAGAG AATAAATGAG ATTCGAAAGA GAAATGTACA GATAAAAGTT GTGGATTCCA GCAACAAGCC GATAGAGAAT GCTTATGTGG AGGCGGTTCT CACCAACCAT GCCTTTGGAT TTGGTACAGC CATTACAAGA AGGGCAATGT ACGATTCGAA TTACACTAAG TTTATAAAAG ACCACTTTAA CTGGGCTGTG TTTGAAAATG AATCCAAATG GTATACAAAT GAACCCAGTA TGGGAATTAT AACCTATGAC GATGCTGACT ATTTGTATGA ATTTTGCCGG AGCAACGGAA TAAAAGTGAG GGGACACTGT ATTTTCTGGG AGGCTGAAGA GTGGCAGCCT GCATGGGTAA GGAGTTTGGA TCCGTTTACC CTGCGTTTTG CGGTAGACAA CCGCTTAAAC AGTGCTGTAG GTCATTTTAA AGGAAAATTC GAACATTGGG ATGTAAACAA CGAAATGATT CACGGCAACT TTTTCAAGAG CCGTCTGGGT GAGTCCATCT GGCCTTATAT GTTCAACAGG GCAAGAGAAA TTGATCCGAA TGCAAAGTAT TTTGTAAACA ACAATATAAC CACTTTGAAA GAAGCGGATG ATTGTGTGGC CCTGGTAAAC TGGCTCAGAT CCCAGGGAGT TCGTGTGGAC GGCGTAGGAG TGCACGGACA CTTTGGTGAC TCGGTGGACC GCAATCTTCT CAAAGGAATA CTTGACAAGC TGTCTGTCTT GAACCTTCCG ATATGGATTA CCGAATATGA TTCGGTTACA CCGGATGAAT ACAGGAGGGC GGACAATCTG GAGAACCTCT ATCGTACGGC TTTCAGCCAT CCTTCCGTTG AAGGAATCGT AATGTGGGGT TTCTGGGAAA GAGTGCACTG GAGAGGAAGA GATGCGTCAA TAGTAAATGA TAACTGGACG TTGAATGAAG CCGGCAGAAG GTTTGAGTCC TTGATGAATG AGTGGACTAC CAGGGCTTAT GGAAGCACGG ATGGTTCGGG CAGCTTTGGC TTCAGAGGAT TCTATGGAAC ATACAGGATA ACCGTGACAG TGCCGGGAAA AGGAAAGTAC AACTATACTT TGAATCTGAA CCGCGGCAGC GGAACATTGC AGACTACTTA CAGAATTCCC TGA
|
Protein sequence | MIVGKVLDMD EKTAIIMTDD FAFLNVVRTS EMAVGKKVKV LDSDIIKPKN SLRRYLPVAA VAACFVIVLS FVLMFINGNT ARKNIYAYVG IDINPSIELW INYNNKIAEA KALNGDAETV LEGLELKEKT VAEAVNEIVQ KSMELGFISR EKENIILIST ACDLKAGEGS ENKDVQNKIG QLFDDVNKAV SDLKNSGITT RILNLTLEER ESSKEENISM GRYAVYLKAK EQNVNLTIDE IKDADLLELI AKVGIDNENV PEDIVTEDKD NLDAINTGPA ESAVPEVTET LPATSTPGRT EGNTATGSVD STPALSKNET PGKTETPGRT FNTPAKSSLG QSSTPKPVSP VQTATATKGI GTLTPRNSPT PVIPSTGIQW IDQANERINE IRKRNVQIKV VDSSNKPIEN AYVEAVLTNH AFGFGTAITR RAMYDSNYTK FIKDHFNWAV FENESKWYTN EPSMGIITYD DADYLYEFCR SNGIKVRGHC IFWEAEEWQP AWVRSLDPFT LRFAVDNRLN SAVGHFKGKF EHWDVNNEMI HGNFFKSRLG ESIWPYMFNR AREIDPNAKY FVNNNITTLK EADDCVALVN WLRSQGVRVD GVGVHGHFGD SVDRNLLKGI LDKLSVLNLP IWITEYDSVT PDEYRRADNL ENLYRTAFSH PSVEGIVMWG FWERVHWRGR DASIVNDNWT LNEAGRRFES LMNEWTTRAY GSTDGSGSFG FRGFYGTYRI TVTVPGKGKY NYTLNLNRGS GTLQTTYRIP
|
| |