Gene Cthe_1838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1838 
Symbol 
ID4809384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2181190 
End bp2183049 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content44% 
IMG OID640107252 
Productglycoside hydrolase family protein 
Protein accessionYP_001038252 
Protein GI125974342 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3693] Beta-1,4-xylanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.561717 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGAAGA AAAAACTGTT GACCCTTTTG ACAGTCTTTG CTCTGCTGAC TGTCGGTATC 
TGCGGAAGTT TTTTGCCGTT ACCCAAAGCA TCCGCAGCAG CTCTGATTTA CGATGATTTT
GAAACAGGTC TGAACGGATG GGGACCAAGA GGACCGGAAA CCGTCGAACT TACCACCGAG
GAAGCTTACT CGGGAAGATA CAGTTTGAAG GTCAGCGGAC GTACCAGCAC ATGGAACGGG
CCCATGGTTG ACAAAACCGA TGTGTTGACT TTGGGCGAAA GCTATAAGTT GGGCGTATAT
GTAAAATTCG TGGGTGATTC CTATTCAAAT GAGCAAAGAT TCAGTTTGCA GCTTCAATAT
AACGACGGAG CAGGAGATGT ATACCAAAAT ATAAAAACCG CCACGGTTTA CAAGGGAACA
TGGACTTTGC TGGAAGGACA GCTTACAGTT CCCAGCCATG CAAAGGACGT AAAAATATAT
GTGGAAACCG AATTTAAAAA TTCTCCGAGT CCGCAGGACT TGATGGATTT CTATATTGAC
GATTTCACAG CAACACCTGC AAATTTGCCT GAAATTGAGA AAGATATTCC AAGCTTGAAA
GATGTCTTTG CCGGTTATTT CAAAGTGGGT GGTGCCGCAA CTGTGGCGGA ACTGGCGCCG
AAGCCTGCAA AAGAGCTTTT CCTCAAGCAT TATAACAGCT TGACTTTTGG TAATGAGTTA
AAACCGGAAA GTGTACTTGA CTATGATGCT ACAATTGCTT ATATGGAGGC AAACGGAGGC
GACCAGGTTA ATCCGCAGAT AACCTTGAGA GCGGCAAGAC CCCTGTTGGA GTTTGCGAAA
GAACACAACA TACCTGTAAG AGGACATACC CTTGTATGGC ACAGCCAGAC ACCGGACTGG
TTCTTCAGAG AAAATTACTC TCAGGACGAA AATGCTCCCT GGGCATCCAA GGAAGTAATG
CTGCAAAGGT TGGAAAACTA CATAAAGAAT TTAATGGAAG CTTTGGCGAC CGAATATCCG
ACGGTTAAGT TCTATGCATG GGACGTTGTG AATGAGGCTG TTGATCCTAA TACTTCAGAC
GGTATGAGAA CTCCGGGTTC GAATAACAAA AATCCCGGAA GCTCCCTGTG GATGCAAACC
GTTGGAAGAG ATTTTATTGT TAAAGCTTTT GAATATGCAA GAAAATATGC TCCTGCGGAT
TGTAAACTCT TCTACAATGA CTATAATGAA TATGAAGACA GAAAATGTGA TTTTATTATT
GAAATTCTTA CCGAACTTAA AGCCAAAGGC CTGGTTGACG GTATGGGTAT GCAATCCCAC
TGGGTTATGG ATTATCCAAG CATAAGCATG TTTGAAAAAT CCATCAGAAG ATATGCAGCA
TTGGGATTGG AAATTCAGCT TACCGAGCTG GATATAAGAA ATCCTGACAA CAGCCAGTGG
GCTTTGGAAC GTCAGGCTAA TCGTTATAAG GAGCTTGTAA CAAAATTGGT CGATTTGAAA
AAAGAAGGCA TAAACATTAC GGCATTGGTA TTCTGGGGAA TAACCGACGC GACAAGCTGG
CTTGGAGGAT ATCCGCTCCT GTTTGACGCG GAATACAAGG CAAAACCTGC ATTTTATGCT
ATAGTTAACA GCGTTCCGCC GCTTCCGACA GAACCGCCGG TTCAGGTTAT ACCCGGTGAT
GTAAACGGTG ACGGTCGTGT AAATTCATCC GACTTGACTC TTATGAAAAG ATACCTTTTA
AAATCCATAA GCGACTTCCC GACACCGGAA GGAAAAATTG CGGCGGATTT AAACGAAGAC
GGCAAGGTAA ACTCGACAGA TTTGTTAGCG CTGAAAAAAC TCGTTCTGAG AGAACTTTGA
 
Protein sequence
MLKKKLLTLL TVFALLTVGI CGSFLPLPKA SAAALIYDDF ETGLNGWGPR GPETVELTTE 
EAYSGRYSLK VSGRTSTWNG PMVDKTDVLT LGESYKLGVY VKFVGDSYSN EQRFSLQLQY
NDGAGDVYQN IKTATVYKGT WTLLEGQLTV PSHAKDVKIY VETEFKNSPS PQDLMDFYID
DFTATPANLP EIEKDIPSLK DVFAGYFKVG GAATVAELAP KPAKELFLKH YNSLTFGNEL
KPESVLDYDA TIAYMEANGG DQVNPQITLR AARPLLEFAK EHNIPVRGHT LVWHSQTPDW
FFRENYSQDE NAPWASKEVM LQRLENYIKN LMEALATEYP TVKFYAWDVV NEAVDPNTSD
GMRTPGSNNK NPGSSLWMQT VGRDFIVKAF EYARKYAPAD CKLFYNDYNE YEDRKCDFII
EILTELKAKG LVDGMGMQSH WVMDYPSISM FEKSIRRYAA LGLEIQLTEL DIRNPDNSQW
ALERQANRYK ELVTKLVDLK KEGINITALV FWGITDATSW LGGYPLLFDA EYKAKPAFYA
IVNSVPPLPT EPPVQVIPGD VNGDGRVNSS DLTLMKRYLL KSISDFPTPE GKIAADLNED
GKVNSTDLLA LKKLVLREL