Gene Cthe_2590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2590 
Symbol 
ID4809012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3061974 
End bp3063893 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content45% 
IMG OID640108004 
Productglycoside hydrolase family protein 
Protein accessionYP_001038983 
Protein GI125975073 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3693] Beta-1,4-xylanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.549614 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA CAGCATCATT TATTCTTGTT CTTTCATTGC TGCTGGCTTT TATTATTCCG 
GCTGAAGGGG GGTTGGCAGC AGAAGGCAAT TTACTTTTCA ACCCGGGCTT TGAACTGGGA
AGCACCGAAG GATGGTATCC TTACGGAGAG TGTACCATTG AGGCGGTCGG TACGGAAGCG
CACAGCGGAA ATTATAGCGT TTTTGTTACG GACAGGACTC AGGATTGGAA CGGTGTGGCC
CAGGACATGC TTGACAAGCT GACCGTAGGC ATGACCTATC AGGTTTCGGC ATGGGTCAAA
GTTGCAGGGA CAGGAAGTCA TCAAGTCAAA ATATCCATGA AGAAGGTCGA AACCGGCAAG
GAGCCGGTGT ATGACAACAT TGCGTCAATT ACCGTTGAGG GATCCGAATG GTACAGACTG
TCGGGTCCGT ACAGTTATAC CGGCACGAAT GTTACAAACC TTGAACTTTA CATAGAGGGA
CCCCAGCCGG GTGTCAGCTA CTATGTGGAT GATGTTACGG TAACTGAAGT GGGTTCCGCC
GCAACATGGA AGGAAGAGGC GAATGCAAGA ATTGAGCAAA TCAGAAAAAG GGATGCGAAA
ATAAGGATTG TGGATTCAAA CAATAAACCT GTTTCGGGAG TAAGCATTGA TGTTCGCCAG
GTAAAGCATG AATTTGGTTT TGGATCGGCT ATTACCATGA ATGGAATACA TGATCCCCGT
TATACCGAGT TCTTCAAAAA TAATTATGAG TGGGCGGTAT TTGAAAATGA AGCAAAATGG
TATTCCAACG AAAGCAGCCA GGGCAACGTT TCTTACGCCA ATGCGGACTA CTTGTACAAT
TGGTGTGCCG AAAACGGCAT AAAGGTAAGA GGCCATTGTA TATTCTGGGA GCCCGAAGAA
TGGCAGCCTT CATGGCTTAA AGGACTTACC GGAGATGCTC TTATGAAAGC GATAGATGCA
AGGCTTGAAA GTGTTGTTCC CCACTTTAGA GGCAAATTCC TTCACTGGGA TGTAAACAAT
GAGATGCTCC ATGGAGATTT CTTCAAAAGC CGCTTGGGAG AGTCCATATG GCCTTACATG
TTCAAAAGGG CCAGGGAGCT TGATCCGGAT GCAAAACTCT TTGTAAATGA TTATAACATT
ATCACTTATG TTGAGGGAGA TGCATACATA AGGCAGATTG AATGGCTCCT GCAAAACGGT
GCCGAGATAG ATGGCATAGG GGTGCAGGGA CATTTTGATG AAGATGTTGA ACCCCTTGTT
GTAAAAGCCA GACTGGACAA TTTGGCGACT TTGGGAATTC CCATATGGGT AACCGAATAT
GACTCTAAAA CGCCGGATGT AAACAAGAGA GCGGAGAATC TTGAAAACCT TTACCGTATC
GCATTCAGTC ATCCGGCGGT GGAAGGTATT ATAATGTGGG GATTCTGGGC AGGTAACCAC
TGGAGAGGTC AGGATGCCGC AATAGTAGAT CATGACTGGA CTGTAAATGA GGCAGGAAAG
AGATACCAGG CTTTGTTGAA AGAGTGGACC ACGATTACCT CAGGTACTAC CGACAGCACA
GGTGCATTTG ATTTCAGAGG TTTCCACGGC ACGTATGAAA TTACTGTGAG TGTTCCGGGG
AAAGAGCCTT TTGTAAAGAC CATTGAGCTT ACCAAAGGGA ACGGAACGGC TGTATATACA
TTTACTGTGG ACGGAACTGA TGCCGGAAAT GTTTTGTATG GGGATTTGAA CCAGGACGGC
CAGGTAAGTT CAACGGACTT GGTTGCCATG AAAAGATACC TGTTGAAAAA TTTTGAACTG
TCTGGCGTAG GGCTTGAGGC TGCAGATTTG AACAGCGATG GCAAAGTTAA TTCCACGGAT
TTGGTTGCTC TTAAAAGATT TTTGTTAAAA GAAATAGATG AATTGCCTTT AAAACGTTAA
 
Protein sequence
MKKTASFILV LSLLLAFIIP AEGGLAAEGN LLFNPGFELG STEGWYPYGE CTIEAVGTEA 
HSGNYSVFVT DRTQDWNGVA QDMLDKLTVG MTYQVSAWVK VAGTGSHQVK ISMKKVETGK
EPVYDNIASI TVEGSEWYRL SGPYSYTGTN VTNLELYIEG PQPGVSYYVD DVTVTEVGSA
ATWKEEANAR IEQIRKRDAK IRIVDSNNKP VSGVSIDVRQ VKHEFGFGSA ITMNGIHDPR
YTEFFKNNYE WAVFENEAKW YSNESSQGNV SYANADYLYN WCAENGIKVR GHCIFWEPEE
WQPSWLKGLT GDALMKAIDA RLESVVPHFR GKFLHWDVNN EMLHGDFFKS RLGESIWPYM
FKRARELDPD AKLFVNDYNI ITYVEGDAYI RQIEWLLQNG AEIDGIGVQG HFDEDVEPLV
VKARLDNLAT LGIPIWVTEY DSKTPDVNKR AENLENLYRI AFSHPAVEGI IMWGFWAGNH
WRGQDAAIVD HDWTVNEAGK RYQALLKEWT TITSGTTDST GAFDFRGFHG TYEITVSVPG
KEPFVKTIEL TKGNGTAVYT FTVDGTDAGN VLYGDLNQDG QVSSTDLVAM KRYLLKNFEL
SGVGLEAADL NSDGKVNSTD LVALKRFLLK EIDELPLKR