Gene Cthe_1963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1963 
Symbol 
ID4810746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2340386 
End bp2342899 
Gene Length2514 bp 
Protein Length837 aa 
Translation table11 
GC content43% 
IMG OID640107379 
Productglycoside hydrolase family protein 
Protein accessionYP_001038374 
Protein GI125974464 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3693] Beta-1,4-xylanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.658538 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAGAA AACTTTTCAG TGTATTACTT GTTGGCTTGA TGCTTATGAC ATCGTTGCTT 
GTCACAATAA GCAGTACATC AGCGGCATCC TTGCCAACCA TGCCGCCTTC GGGATATGAC
CAGGTAAGGA ACGGCGTTCC GAGAGGGCAG GTCGTAAATA TTTCTTATTT CTCCACGGCC
ACCAACAGTA CCAGGCCGGC AAGAGTTTAT TTGCCGCCGG GATATTCAAA GGACAAAAAA
TACAGTGTTT TGTATCTCTT ACACGGCATA GGCGGTAGTG AAAACGACTG GTTCGAAGGG
GGAGGCAGAG CCAATGTTAT TGCCGACAAT CTGATTGCCG AGGGAAAAAT CAAGCCCCTG
ATAATTGTAA CACCGAATAC TAACGCCGCC GGTCCGGGAA TAGCGGACGG TTATGAAAAT
TTCACAAAAG ATTTGCTCAA CAGTCTTATT CCCTATATCG AATCTAACTA TTCAGTCTAC
ACCGACCGCG AACATCGGGC GATTGCAGGA CTTTCAATGG GTGGAGGACA ATCGTTTAAT
ATTGGATTGA CCAATCTCGA TAAATTTGCC TATATTGGCC CGATTTCAGC GGCTCCAAAC
ACTTATCCAA ATGAGAGGCT TTTTCCTGAC GGAGGAAAAG CTGCAAGGGA GAAATTGAAA
CTGCTCTTTA TTGCCTGCGG AACCAATGAC AGTCTGATAG GTTTTGGACA GAGAGTACAT
GAATATTGCG TTGCCAACAA CATTAACCAT GTCTATTGGC TTATTCAGGG CGGAGGACAC
GATTTTAATG TGTGGAAGCC CGGATTGTGG AATTTCCTTC AAATGGCAGA TGAAGCCGGA
TTGACGAGGG ATGGAAACAC TCCGGTTCCG ACACCCAGTC CAAAGCCGGC TAACACACGT
ATTGAAGCGG AAGATTATGA CGGTATTAAT TCTTCAAGTA TTGAGATAAT AGGTGTTCCA
CCTGAAGGAG GCAGAGGAAT AGGTTATATT ACCAGTGGTG ATTATCTGGT ATACAAGAGT
ATAGACTTTG GAAACGGAGC AACGTCGTTT AAGGCCAAGG TTGCAAATGC AAATACTTCC
AATATTGAAC TTAGATTAAA CGGTCCGAAT GGTACTCTCA TAGGCACACT CTCGGTAAAA
TCCACAGGAG ATTGGAATAC ATATGAGGAG CAAACTTGCA GCATTAGCAA AGTCACCGGA
ATAAATGATT TGTACTTGGT ATTCAAAGGC CCTGTAAACA TAGACTGGTT CACTTTTGGC
GTTGAAAGCA GTTCCACAGG TCTGGGGGAT TTAAATGGTG ACGGAAATAT TAACTCGTCG
GACCTTCAGG CGTTAAAGAG GCATTTGCTC GGTATATCAC CGCTTACGGG AGAGGCTCTT
TTAAGAGCGG ATGTAAATAG GAGCGGCAAA GTGGATTCTA CTGACTATTC AGTGCTGAAA
AGATATATAC TCCGCATTAT TACAGAGTTC CCCGGACAAG GTGATGTACA GACACCCAAT
CCGTCTGTTA CTCCGACACA AACTCCTATC CCCACGATTT CGGGAAATGC TCTTAGGGAT
TATGCGGAGG CAAGGGGAAT AAAAATCGGA ACATGTGTCA ACTATCCGTT TTACAACAAT
TCAGATCCAA CCTACAACAG CATTTTGCAA AGAGAATTTT CAATGGTTGT ATGTGAAAAT
GAAATGAAGT TTGATGCTTT GCAGCCGAGA CAAAACGTTT TTGATTTTTC GAAAGGAGAC
CAGTTGCTTG CTTTTGCAGA AAGAAACGGT ATGCAGATGA GGGGACATAC GTTGATTTGG
CACAATCAAA ACCCGTCATG GCTTACAAAC GGTAACTGGA ACCGGGATTC GCTGCTTGCG
GTAATGAAAA ATCACATTAC CACTGTTATG ACCCATTACA AAGGTAAAAT TGTTGAGTGG
GATGTGGCAA ACGAATGTAT GGATGATTCC GGCAACGGCT TAAGAAGCAG CATATGGAGA
AATGTAATCG GTCAGGACTA CCTTGACTAT GCTTTCAGGT ATGCAAGAGA AGCAGATCCC
GATGCACTTC TTTTCTACAA TGATTATAAT ATTGAAGACT TGGGTCCAAA GTCCAATGCG
GTATTTAACA TGATTAAAAG TATGAAGGAA AGAGGTGTGC CGATTGACGG AGTAGGATTC
CAATGCCACT TTATCAATGG AATGAGCCCC GAGTACCTTG CCAGCATTGA TCAAAATATT
AAGAGATATG CGGAAATAGG CGTTATAGTA TCCTTTACCG AAATAGATAT ACGCATACCT
CAGTCGGAAA ACCCGGCAAC TGCATTCCAG GTACAGGCAA ACAACTATAA GGAACTTATG
AAAATTTGTC TGGCAAACCC CAATTGCAAT ACCTTTGTAA TGTGGGGATT CACAGATAAA
TACACATGGA TTCCGGGAAC TTTCCCAGGA TATGGCAATC CATTGATTTA TGACAGCAAT
TACAATCCGA AACCGGCATA CAATGCAATA AAGGAAGCTC TTATGGGCTA TTGA
 
Protein sequence
MSRKLFSVLL VGLMLMTSLL VTISSTSAAS LPTMPPSGYD QVRNGVPRGQ VVNISYFSTA 
TNSTRPARVY LPPGYSKDKK YSVLYLLHGI GGSENDWFEG GGRANVIADN LIAEGKIKPL
IIVTPNTNAA GPGIADGYEN FTKDLLNSLI PYIESNYSVY TDREHRAIAG LSMGGGQSFN
IGLTNLDKFA YIGPISAAPN TYPNERLFPD GGKAAREKLK LLFIACGTND SLIGFGQRVH
EYCVANNINH VYWLIQGGGH DFNVWKPGLW NFLQMADEAG LTRDGNTPVP TPSPKPANTR
IEAEDYDGIN SSSIEIIGVP PEGGRGIGYI TSGDYLVYKS IDFGNGATSF KAKVANANTS
NIELRLNGPN GTLIGTLSVK STGDWNTYEE QTCSISKVTG INDLYLVFKG PVNIDWFTFG
VESSSTGLGD LNGDGNINSS DLQALKRHLL GISPLTGEAL LRADVNRSGK VDSTDYSVLK
RYILRIITEF PGQGDVQTPN PSVTPTQTPI PTISGNALRD YAEARGIKIG TCVNYPFYNN
SDPTYNSILQ REFSMVVCEN EMKFDALQPR QNVFDFSKGD QLLAFAERNG MQMRGHTLIW
HNQNPSWLTN GNWNRDSLLA VMKNHITTVM THYKGKIVEW DVANECMDDS GNGLRSSIWR
NVIGQDYLDY AFRYAREADP DALLFYNDYN IEDLGPKSNA VFNMIKSMKE RGVPIDGVGF
QCHFINGMSP EYLASIDQNI KRYAEIGVIV SFTEIDIRIP QSENPATAFQ VQANNYKELM
KICLANPNCN TFVMWGFTDK YTWIPGTFPG YGNPLIYDSN YNPKPAYNAI KEALMGY