Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1963 |
Symbol | |
ID | 4810746 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2340386 |
End bp | 2342899 |
Gene Length | 2514 bp |
Protein Length | 837 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640107379 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001038374 |
Protein GI | 125974464 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3693] Beta-1,4-xylanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.658538 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAAGAA AACTTTTCAG TGTATTACTT GTTGGCTTGA TGCTTATGAC ATCGTTGCTT GTCACAATAA GCAGTACATC AGCGGCATCC TTGCCAACCA TGCCGCCTTC GGGATATGAC CAGGTAAGGA ACGGCGTTCC GAGAGGGCAG GTCGTAAATA TTTCTTATTT CTCCACGGCC ACCAACAGTA CCAGGCCGGC AAGAGTTTAT TTGCCGCCGG GATATTCAAA GGACAAAAAA TACAGTGTTT TGTATCTCTT ACACGGCATA GGCGGTAGTG AAAACGACTG GTTCGAAGGG GGAGGCAGAG CCAATGTTAT TGCCGACAAT CTGATTGCCG AGGGAAAAAT CAAGCCCCTG ATAATTGTAA CACCGAATAC TAACGCCGCC GGTCCGGGAA TAGCGGACGG TTATGAAAAT TTCACAAAAG ATTTGCTCAA CAGTCTTATT CCCTATATCG AATCTAACTA TTCAGTCTAC ACCGACCGCG AACATCGGGC GATTGCAGGA CTTTCAATGG GTGGAGGACA ATCGTTTAAT ATTGGATTGA CCAATCTCGA TAAATTTGCC TATATTGGCC CGATTTCAGC GGCTCCAAAC ACTTATCCAA ATGAGAGGCT TTTTCCTGAC GGAGGAAAAG CTGCAAGGGA GAAATTGAAA CTGCTCTTTA TTGCCTGCGG AACCAATGAC AGTCTGATAG GTTTTGGACA GAGAGTACAT GAATATTGCG TTGCCAACAA CATTAACCAT GTCTATTGGC TTATTCAGGG CGGAGGACAC GATTTTAATG TGTGGAAGCC CGGATTGTGG AATTTCCTTC AAATGGCAGA TGAAGCCGGA TTGACGAGGG ATGGAAACAC TCCGGTTCCG ACACCCAGTC CAAAGCCGGC TAACACACGT ATTGAAGCGG AAGATTATGA CGGTATTAAT TCTTCAAGTA TTGAGATAAT AGGTGTTCCA CCTGAAGGAG GCAGAGGAAT AGGTTATATT ACCAGTGGTG ATTATCTGGT ATACAAGAGT ATAGACTTTG GAAACGGAGC AACGTCGTTT AAGGCCAAGG TTGCAAATGC AAATACTTCC AATATTGAAC TTAGATTAAA CGGTCCGAAT GGTACTCTCA TAGGCACACT CTCGGTAAAA TCCACAGGAG ATTGGAATAC ATATGAGGAG CAAACTTGCA GCATTAGCAA AGTCACCGGA ATAAATGATT TGTACTTGGT ATTCAAAGGC CCTGTAAACA TAGACTGGTT CACTTTTGGC GTTGAAAGCA GTTCCACAGG TCTGGGGGAT TTAAATGGTG ACGGAAATAT TAACTCGTCG GACCTTCAGG CGTTAAAGAG GCATTTGCTC GGTATATCAC CGCTTACGGG AGAGGCTCTT TTAAGAGCGG ATGTAAATAG GAGCGGCAAA GTGGATTCTA CTGACTATTC AGTGCTGAAA AGATATATAC TCCGCATTAT TACAGAGTTC CCCGGACAAG GTGATGTACA GACACCCAAT CCGTCTGTTA CTCCGACACA AACTCCTATC CCCACGATTT CGGGAAATGC TCTTAGGGAT TATGCGGAGG CAAGGGGAAT AAAAATCGGA ACATGTGTCA ACTATCCGTT TTACAACAAT TCAGATCCAA CCTACAACAG CATTTTGCAA AGAGAATTTT CAATGGTTGT ATGTGAAAAT GAAATGAAGT TTGATGCTTT GCAGCCGAGA CAAAACGTTT TTGATTTTTC GAAAGGAGAC CAGTTGCTTG CTTTTGCAGA AAGAAACGGT ATGCAGATGA GGGGACATAC GTTGATTTGG CACAATCAAA ACCCGTCATG GCTTACAAAC GGTAACTGGA ACCGGGATTC GCTGCTTGCG GTAATGAAAA ATCACATTAC CACTGTTATG ACCCATTACA AAGGTAAAAT TGTTGAGTGG GATGTGGCAA ACGAATGTAT GGATGATTCC GGCAACGGCT TAAGAAGCAG CATATGGAGA AATGTAATCG GTCAGGACTA CCTTGACTAT GCTTTCAGGT ATGCAAGAGA AGCAGATCCC GATGCACTTC TTTTCTACAA TGATTATAAT ATTGAAGACT TGGGTCCAAA GTCCAATGCG GTATTTAACA TGATTAAAAG TATGAAGGAA AGAGGTGTGC CGATTGACGG AGTAGGATTC CAATGCCACT TTATCAATGG AATGAGCCCC GAGTACCTTG CCAGCATTGA TCAAAATATT AAGAGATATG CGGAAATAGG CGTTATAGTA TCCTTTACCG AAATAGATAT ACGCATACCT CAGTCGGAAA ACCCGGCAAC TGCATTCCAG GTACAGGCAA ACAACTATAA GGAACTTATG AAAATTTGTC TGGCAAACCC CAATTGCAAT ACCTTTGTAA TGTGGGGATT CACAGATAAA TACACATGGA TTCCGGGAAC TTTCCCAGGA TATGGCAATC CATTGATTTA TGACAGCAAT TACAATCCGA AACCGGCATA CAATGCAATA AAGGAAGCTC TTATGGGCTA TTGA
|
Protein sequence | MSRKLFSVLL VGLMLMTSLL VTISSTSAAS LPTMPPSGYD QVRNGVPRGQ VVNISYFSTA TNSTRPARVY LPPGYSKDKK YSVLYLLHGI GGSENDWFEG GGRANVIADN LIAEGKIKPL IIVTPNTNAA GPGIADGYEN FTKDLLNSLI PYIESNYSVY TDREHRAIAG LSMGGGQSFN IGLTNLDKFA YIGPISAAPN TYPNERLFPD GGKAAREKLK LLFIACGTND SLIGFGQRVH EYCVANNINH VYWLIQGGGH DFNVWKPGLW NFLQMADEAG LTRDGNTPVP TPSPKPANTR IEAEDYDGIN SSSIEIIGVP PEGGRGIGYI TSGDYLVYKS IDFGNGATSF KAKVANANTS NIELRLNGPN GTLIGTLSVK STGDWNTYEE QTCSISKVTG INDLYLVFKG PVNIDWFTFG VESSSTGLGD LNGDGNINSS DLQALKRHLL GISPLTGEAL LRADVNRSGK VDSTDYSVLK RYILRIITEF PGQGDVQTPN PSVTPTQTPI PTISGNALRD YAEARGIKIG TCVNYPFYNN SDPTYNSILQ REFSMVVCEN EMKFDALQPR QNVFDFSKGD QLLAFAERNG MQMRGHTLIW HNQNPSWLTN GNWNRDSLLA VMKNHITTVM THYKGKIVEW DVANECMDDS GNGLRSSIWR NVIGQDYLDY AFRYAREADP DALLFYNDYN IEDLGPKSNA VFNMIKSMKE RGVPIDGVGF QCHFINGMSP EYLASIDQNI KRYAEIGVIV SFTEIDIRIP QSENPATAFQ VQANNYKELM KICLANPNCN TFVMWGFTDK YTWIPGTFPG YGNPLIYDSN YNPKPAYNAI KEALMGY
|
| |