Gene Cthe_2138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2138 
Symbol 
ID4811185 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2537843 
End bp2539585 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content43% 
IMG OID640107542 
Productglycoside hydrolase family protein 
Protein accessionYP_001038535 
Protein GI125974625 
COG category[R] General function prediction only 
COG ID[COG3940] Predicted beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATACTAT TAGCCACCTC AGTGTGTTAC TCAAGTAATC CTTCGATAAC GCAGGCTTCA 
ACGGGTGCCG ATGGTGCTAT AGCAAAGCTT CAGTCATATA ATTATTCGCA TATGTACATA
AGAAATGCAA ACTTTGATGT AAGAATAGAT GATAATGTTA CACCGGAAAC AGATGCCCAA
TGGGTGCTTG TTCCCGGCCT TGCCAACAGC GGCGAGGGAT ATGTGTCTAT TCAATCTGTT
GATCATCTGG GATATTACTT AAGACACTGG AATTATGATT TCAGATTGGA AAAAAACGAC
GGGACCAGAA TTTTTGCGGA GGATGCAACA TTTAAAATGG TTCCGGGGCT TGCGGATCCT
TCATATACTT CTTTCCAGTC CTATAATTAT CCTACCAGAT ATATAAGGCA TTATAATTAC
CTGTTGAGGT TGGATGAAAT TGTAACAGCC CTTGACAGAG AGGATGCCAC ATTCAGAGTG
ATTGACAGTT CATCTGTGGA TCCCGACAAA GCGGATGATT CGGTTATTGT AACAAATCCG
ATCGTCAGGC GGCGGGCAGA TCCCTGGGTT TACAGGCATA CCGACGGTTA TTATTATATG
ACGGCATCCG TACCGGAGTA TGACAGAATA GAGCTAAGAC GGTCGAGAAC ATTGCAAGGA
CTTTCAACAG CCACACCTAA GACGATATGG AGAAGGCACT CCAGCGGAAT TATGGGAGGA
CATATCTGGG CTCCGGAGAT TCATTTCATT GACGGAAAAT GGTATATTTA TTTTTCAGCA
GGAACATCCA CTAATTACTT CGATATACGT TTGTATGTCC TTGAATGTTC GGATTCAAAT
CCTCTTACCG GAACCTGGGT GGAAAAAGGT CAATTAAAGA CCAACTGGGA GTCTTTCACC
CTCGATGCAA CCACCTTTGA ACACAACGGT ACAAGGTATC TTGTATGGGC TCAGAAAGAC
CCTAAAATAG CTAGTAACAG CAATATTTAT ATTGCCAAAA TGAATGGGCC TCTTGCCATA
ACCGGAAACC AGGTTATGAT TTCGACACCC GAATATTCAT GGGAAAAGAT AGGTTATGCT
GTCAATGAAG GCCCTGCCGT TTTAAAGAAA AACGGTAAAA TTTTCATAAC CTTTTCAGCA
AGCGCTACAG ACGCAAACTA TTGCATGGGA TTGTTAACTG CCTCCGACAC CGCCAATTTA
TTGGATCCGA AATCCTGGCA CAAATCGCCG AACCCCGTAT TTCAGAGCAA TCCATCCACA
GGGCAGTACG GACCCGGGCA TAATTCCTTT ACAACTTCAC CCGACGGAAA AGTGGATATT
ATGGTATATC ATGCGAGGAA CTACCGGGAT ATAACGGGAG ATCCTTTGTA TGACCCGAAC
AGGCATACCC GTGCGCAAAT AGTCAACTGG AATGCTGACG GTACACCGGA CTTCGGAATA
CCGGTTGCTG ACGGAACAAA TGTTATATAT ATCCCGCCCC AGACACCAAC ACCAATACCT
ACAAACAGTC CTACACCCAG TGATTTAATC TACGGCGATA TAAATGGAGA TAATTCTGTC
AATTCAACGG ATTTAACAAT TTTAAAAAGA TATTTGCTTG GAAGTACTGT CCCGACGGCA
CCGAACTGGA GACTGGCCGC CGACCTTAAT TTGGACGGAA ATATAAACTC AACGGATTTT
ACAATACTTA AGAGGTACAT TTTAGGCAGA ATTGAAGCAC CGCCCTGGGT AAATCAGACA
TAG
 
Protein sequence
MILLATSVCY SSNPSITQAS TGADGAIAKL QSYNYSHMYI RNANFDVRID DNVTPETDAQ 
WVLVPGLANS GEGYVSIQSV DHLGYYLRHW NYDFRLEKND GTRIFAEDAT FKMVPGLADP
SYTSFQSYNY PTRYIRHYNY LLRLDEIVTA LDREDATFRV IDSSSVDPDK ADDSVIVTNP
IVRRRADPWV YRHTDGYYYM TASVPEYDRI ELRRSRTLQG LSTATPKTIW RRHSSGIMGG
HIWAPEIHFI DGKWYIYFSA GTSTNYFDIR LYVLECSDSN PLTGTWVEKG QLKTNWESFT
LDATTFEHNG TRYLVWAQKD PKIASNSNIY IAKMNGPLAI TGNQVMISTP EYSWEKIGYA
VNEGPAVLKK NGKIFITFSA SATDANYCMG LLTASDTANL LDPKSWHKSP NPVFQSNPST
GQYGPGHNSF TTSPDGKVDI MVYHARNYRD ITGDPLYDPN RHTRAQIVNW NADGTPDFGI
PVADGTNVIY IPPQTPTPIP TNSPTPSDLI YGDINGDNSV NSTDLTILKR YLLGSTVPTA
PNWRLAADLN LDGNINSTDF TILKRYILGR IEAPPWVNQT