Gene Cthe_2972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2972 
Symbol 
ID4810860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3490091 
End bp3492142 
Gene Length2052 bp 
Protein Length683 aa 
Translation table11 
GC content45% 
IMG OID640108394 
Productglycoside hydrolase family protein 
Protein accessionYP_001039362 
Protein GI125975452 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0726] Predicted xylanase/chitin deacetylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAAA AATTACTGGT AACTTTCCTG ATTTTAATTA CTTTTACCGT TTCACTGACT 
TTGTTTCCGG TAAATGTACG CGCTGATGTA GTAATTACGT CAAACCAGAC GGGTACTCAT
GGCGGGTACA ACTTTGAGTA CTGGAAAGAC ACCGGAAACG GAACCATGGT CCTCAAAGAC
GGTGGTGCGT TCAGCTGCGA ATGGAGCAAT ATCAACAATA TTCTTTTCCG TAAAGGTTTC
AAATACGATG AAACAAAGAC ACATGATCAA CTTGGATACA TAACGGTAAC TTATTCCTGC
AACTATCAGC CAAACGGAAA CTCTTATCTG GGAGTCTACG GATGGACCAG CAATCCGCTT
GTAGAGTATT ACATCATCGA GAGCTGGGGA ACCTGGAGAC CACCGGGAGC AACACCAAAG
GGCACTATTA CCGTTGACGG TGGTACATAC GAGATATACG AGACCACCAG AGTTAACCAG
CCTTCCATCA AAGGTACAGC TACTTTCCAG CAATACTGGA GTGTACGTAC ATCAAAACGT
ACAAGCGGAA CCATATCCGT AACCGAACAC TTTAAAGCCT GGGAACGTCT GGGTATGAAA
ATGGGAAAAA TGTATGAGGT TGCTTTGGTT GTAGAAGGAT ACCAGAGCAG CGGAAAAGCC
GACGTAACCA GCATGACAAT TACTGTTGGC AACGCACCGT CAACATCATC ACCACCAGGT
CCGACACCTG AACCGACTCC AAGAAGTGCT TTTTCAAAAA TCGAAGCTGA GGAGTACAAC
TCCCTCAAGT CATCAACCAT TCAGACCATA GGCACTTCCG ACGGAGGAAG CGGTATAGGT
TATATTGAAA GCGGTGACTA TCTGGTATTT AACAAAATAA ACTTTGGAAA CGGCGCAAAC
TCTTTCAAGG CAAGGGTTGC ATCCGGTGCG GACACACCCA CCAATATCCA GTTAAGACTC
GGAAGCCCGA CCGGTACTCT TATAGGAACT CTTACGGTGG CTTCCACAGG TGGTTGGAAC
AATTACGAGG AAAAATCCTG CAGCATAACC AACACTACAG GACAGCACGA CTTATATCTG
GTATTCTCAG GTCCTGTTAA CATTGACTAC TTCATATTCG ACTCGAATGG CGTAAATCCT
ACACCCACCT CTCAGCCTCA ACAAGGCCAG GTTTTGGGTG ACTTGAACGG AGACAAACAA
GTAAATTCAA CAGACTACAC AGCACTGAAG AGACATTTGC TCAATATAAC CAGACTTTCA
GGAACTGCTC TTGCCAACGC CGATTTAAAC GGTGACGGCA AAGTTGATTC CACTGACCTT
ATGATTCTAC ACAGATATCT TCTCGGTATA ATTTCATCTT TTCCACGCAG CAATCCACAA
CCAAGCAGTA ACCCTCAACC AAGCAGCAAT CCGCAGCCAA CGATTAATCC AAATGCGAAA
CTGGTGGCTC TTACCTTTGA CGACGGTCCG GACAACGTAC TTACGGCACG GGTTCTCGAC
AAGCTTGATA AATATAACGT TAAGGCTACA TTCATGGTAG TAGGTCAGAG AGTCAATGAT
TCGACGGCTG CCATCATCAG AAGGATGGTT AATTCAGGCC ATGAAATAGG AAACCACTCA
TGGAGTTATT CAGGCATGGC CAATATGAGT CCGGATCAGA TAAGGAAATC CATTGCCGAT
ACAAATGCAG TTATTCAAAA ATATGCTGGA ACAACACCCA AGTTCTTCCG TCCGCCGAAC
CTCGAAACAA GCCCAACATT ATTCAACAAT GTTGACTTGG TGTTTGTCGG CGGCTTAACG
GCAAATGACT GGATTCCATC CACAACCGCC GAACAGAGGG CTGCCGCAGT TATAAACGGT
GTCAGAGACG GTACAATAAT TCTTTTGCAT GATGTTCAAC CTGAGCCACA CCCGACACCG
GAAGCTCTGG ATATAATCAT CCCTACACTT AAGAGCCGGG GCTATGAATT TGTGACCTTG
ACTGAGTTGT TCACGTTAAA GGGTGTGCCA ATTGACCCAT CAGTCAAAAG AATGTATAAC
TCTGTACCGT AA
 
Protein sequence
MKQKLLVTFL ILITFTVSLT LFPVNVRADV VITSNQTGTH GGYNFEYWKD TGNGTMVLKD 
GGAFSCEWSN INNILFRKGF KYDETKTHDQ LGYITVTYSC NYQPNGNSYL GVYGWTSNPL
VEYYIIESWG TWRPPGATPK GTITVDGGTY EIYETTRVNQ PSIKGTATFQ QYWSVRTSKR
TSGTISVTEH FKAWERLGMK MGKMYEVALV VEGYQSSGKA DVTSMTITVG NAPSTSSPPG
PTPEPTPRSA FSKIEAEEYN SLKSSTIQTI GTSDGGSGIG YIESGDYLVF NKINFGNGAN
SFKARVASGA DTPTNIQLRL GSPTGTLIGT LTVASTGGWN NYEEKSCSIT NTTGQHDLYL
VFSGPVNIDY FIFDSNGVNP TPTSQPQQGQ VLGDLNGDKQ VNSTDYTALK RHLLNITRLS
GTALANADLN GDGKVDSTDL MILHRYLLGI ISSFPRSNPQ PSSNPQPSSN PQPTINPNAK
LVALTFDDGP DNVLTARVLD KLDKYNVKAT FMVVGQRVND STAAIIRRMV NSGHEIGNHS
WSYSGMANMS PDQIRKSIAD TNAVIQKYAG TTPKFFRPPN LETSPTLFNN VDLVFVGGLT
ANDWIPSTTA EQRAAAVING VRDGTIILLH DVQPEPHPTP EALDIIIPTL KSRGYEFVTL
TELFTLKGVP IDPSVKRMYN SVP