Gene Cthe_1400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1400 
Symbol 
ID4809061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1710973 
End bp1712220 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content41% 
IMG OID640106823 
Productglycosyl hydrolase 53 
Protein accessionYP_001037824 
Protein GI125973914 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3867] Arabinogalactan endo-1,4-beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.195161 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTATCGA AAATTACAAA ACTTACTGTT ATCTTTATCC TCATCCTGTC TGTCACATTT 
CTTTCAATGC CCAAAACTTA CACACAAGCG GCTCCAACTT TTGCAAAGGG GGCTGATGTA
AGCTGGCTGC CGGAGATGGA GGCAAACGGC TACAAATTTT ATAATGACGA CGGGATTGAG
CAGGATTGTC TTCAAATTTT AAAAGACCAT GGGATAGACT CAATCAGACT CAGGGTATGG
GTCAATCCTC CAAACGGTTA CTGCAACAAA GAAGAGACAA TTAAGATGGC ATTAAGAGCT
AAAAAAATGG GATTCAGAAT AATGATAAAC TTTCATTACA GTGACTCATG GGCCGACCCG
GGGAAACAAA CAAAACCGGC AGCATGGGCC AAATATGATT TTAACGGCTT GATGAAAGCT
GTATATGACT ATACATATGA TGTCATGAGC GCTTTGAAAG CCAACGGCAT AACTCCCGAG
TGGGTTCAGG TTGGAAATGA AACCAATAAC GGCATGCTGT GGGAAGACGG CAAGGCAACG
AACAGCATGA GGAATTTTGC ATGGCTCATC AACTGCGGTT ATGATGCCGT AAAAGCGGTA
AGTCCTGAAA CTAAAGTTAT AGTTCATATT GCAAATGGAT ATGACAATGC ATTGTACAGA
TGGATATTTG ATGGAATTAC CGCAAACGGT GCAAGATTTG ACGTAATAGG TATGTCGCTG
TATCCTACGG CGTCTGACTG GCCACAATTG ACAAACCAGT GTCTAAACAA TATGAAAGAC
ATGATATCAA GATACGGAAA AGAAATAATG ATATGTGAGA TTGGAATGGA TTACTGGGAG
GCGCAGGCTT GCAAAGACTT TATTACCGAT ATAATTCAAA AGACAAAATC CCTGCCCGAT
AATAAAGGCC TTGGAGTATT CTACTGGGAG CCTCAATGTT ATAACTGGCA GTATTATAAT
AAAGGAGCCT TTGATTTATC CGGAAAACCC ACAATTGCGC TGGATGCGTT TTTGACCTCC
GATACACCGG ATAATCTTAT TTATGGAGAT TTGAACGGTG ACGGACGCGT GAATTCCACG
GACTACACTT TGCTGAAGAG ATATTTGCTT GGCGCTATAC AAACTTTCCC TTATGAAAGG
GGAATTAAGG CTGCGGACCT GAATTTGGAC GGTCGTATCA ATTCGACTGA TTATACTGTG
CTAAAAAGAT ATTTACTCAA TGCCATACCA TCACTTCCTG TAAAATAG
 
Protein sequence
MLSKITKLTV IFILILSVTF LSMPKTYTQA APTFAKGADV SWLPEMEANG YKFYNDDGIE 
QDCLQILKDH GIDSIRLRVW VNPPNGYCNK EETIKMALRA KKMGFRIMIN FHYSDSWADP
GKQTKPAAWA KYDFNGLMKA VYDYTYDVMS ALKANGITPE WVQVGNETNN GMLWEDGKAT
NSMRNFAWLI NCGYDAVKAV SPETKVIVHI ANGYDNALYR WIFDGITANG ARFDVIGMSL
YPTASDWPQL TNQCLNNMKD MISRYGKEIM ICEIGMDYWE AQACKDFITD IIQKTKSLPD
NKGLGVFYWE PQCYNWQYYN KGAFDLSGKP TIALDAFLTS DTPDNLIYGD LNGDGRVNST
DYTLLKRYLL GAIQTFPYER GIKAADLNLD GRINSTDYTV LKRYLLNAIP SLPVK