Gene Cthe_2360 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2360 
Symbol 
ID4808998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2820240 
End bp2823026 
Gene Length2787 bp 
Protein Length928 aa 
Translation table11 
GC content44% 
IMG OID640107771 
Productglycoside hydrolase family protein 
Protein accessionYP_001038755 
Protein GI125974845 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAACTT CAATTATTGC CGTACCTTCT TATGTATCCG CCGAACCGGA ATATAATTTT 
GCCAAAGCGC TGCAGCTGTC CCTTTATTTC TACGATGCCA ACAAGTGCGG TGAGGGGATA
ACCGGAGGAA GGCTTGAATG GAGAGGAGAC TGTCACTTAG AAGACAAGGA AGTTCCTCTT
ATTCCGATGA CAGAGGATTA TTTCGGTACC AATATGTCAC AGGAATACAT TGACAAGTAC
AGACATATTT TCGACCCTGA CGGAGACGGT ACCGTAGACT TAAGCGGCGG AATGCACGAC
GCCGGAGACC ATGTCAAATT CGGGCTTCCG GGTACATACG CAGCATCAAC CCTGGGGTGG
GGTTTCTATG AATTCAGAGA AGCTTATGAA AAAGTTGGAG TAGACGACCA TGCCAAAGAA
ATTTTGAGAT GGTTCAATGA CTTCTACCTT AAGGCTACAT TTTTGGATGA AGACGGAAAC
GTAATAGCTT ATTGCTATCA GGTTGGAGAA GGTAACATTG ACCATAATTT CTGGAATCCG
CCTGAACTTC AAAGCTCCAA AGTACTTGAT TCGTTCGCCA GACCTGCTTA TTTTGCCACT
CCGGAAACTC CTGCAAGCGA CCAATGTGCA GGTGCTGCGG CATCACTTGC AATAAACTAT
CTTAATTTTA AAGATGAAGA TCCCGAATAT GCTCAAAGAT GCCTGGATAC TGCCATTGCC
CTCTATGAGT TTGCAAAGAA ATACAGAGGA CTTGGTGAAT CCGGAGGATT CTACGGTTCG
TCTTATGATC ATGACGAACT GGCGTGGGCC GCTGTCTGGC TCAACATTGC AACAGGAGAC
ATGCAGTATA TAGATGACAT TGTTGCAACC GACAGCGAGG GAAATTATAT AGGATATATG
CAAAGAATCA TTAAAGACAC AACCAACACA TGGCAAAATA TATGGGTTCA CTGTTGGGAT
ACCGTTTGGG GCGGAGTATT TGCAAAACTT GCTCCAATAA CAAATACGGA AAGAGACTGG
TATATCTTCC GCTGGAATCT GGAATACTGG TCCGGAATTC CTCATGAAAA TCCAAATGAC
ACGACTTTCC TTGCTAAAAG TCCCGGTGGC TTCAGCGTCG TAAACGCATA CGGATCTGCA
AGGTATAATA CAGCAGCCCA GCTTTGTGCC CTTGTATATA GAAAATATAC CGGCAGAAGC
GATTTTGCTG ACTGGGCAAA AAGTCAGATG GAGTATATTA TGGGTAACAA CCCCATGAAC
AGATGCTATA TCGTGGGATA TTCCGAAAAC AGTGCAAAAC ATCCTCACCA TCGTGCAGCC
CATGGTTCCA AGACATTCAG CATGCTGGAT CCTGAAGAAC ACCGTCATAC ACTGTGGGGG
GCATTGGTGG GAGGTCCCGA CCTTGACGAT TTCCATGTGG ATGAAACCAC AGACTATGTA
TACAACGAAG TTGCCGTAGA CTACAATGCC GCTTTCGTCG GCGCCTGTGC AGGATTGTAT
TATTACTATG GTGAAGCATT GGGACACAAA CCTGTTCCAA ACTTCCCGCC AAAAGAAGAA
GCTGTAGAGG AGTATTATGT GGAAGGAAAA ATTGAACAGG AAAACAAGGA AAGAACCCAG
GTAACCATCA AAATTTTCAA TGATACCTGC CATCCTCCCC GTTTTGAGAC CGGCTTGATG
GCACGTTACT TCTTCAATAT TTCAGAACTT CTGGATGCAG GTCAAAGCAT CGATGACGTG
AAAATAGAGG TTTATTACGA CGAAAACAAG GCAAGCTACG ACGGTCCTGC CGAGGTAAGA
GGTCCTATCA AGTACGATGA CGCAGGCACC TATTATGTGG AAGTTGACTG GAGTGGCCGT
ATTATTTACG GTAAACGCGA AATCCAGCTT GCACTGATAT CAAGCCTGGA TTCCAACTAC
AAAAGCAACT GGAATCCGGA AAATGACTAC AGCAGGGAAG GATTGGGCAA GGAATTTGTC
AGAACCGAAA AAATTCCTCT TTACCTGAAC GGCGTAAAAG TATTTGGAAA TGAGCCTCCT
CAAATTGAAC CGTCGCCGAC GCCTTCGGAC GGACCGGGCT CAACACCGCC GCCGTCTCAA
AAGCCTTCAT TGGAGGTTCT GTACAAATAC GGAGATACAA CGGCAGCTAC AAAAGATATC
AGAGGCTCCA TTAAAATTAA AAATACAGGA ACAAAACCGG TAAATCTTTC CGACGTAAAA
GTACGCTACT GGTTTACCAA AGACGGGGCT TCAAGTCAGG AATTTGTATG CGATTATGCG
CATTTGTCTG AAAGCATGAT CACAGCAAAA TTCGTAGATC TTGAAAACCC GGTTGAAAAT
GCGGATAATT ATCTTGAGAT TGGCTTTGAC AGCAATGCAG GTATTTTGGG ACCGGGCTCC
GATACCGGCG AAATTCAATT CAGAATCGTA AAAGGAGATT ATGAGTCTTA TGATCAATCC
AATGATTATT CCTGCATGGC TACTGCAAAA GACTTTACCG CAAATCCGAA CATTACGGCG
TATGTAAATT CTGTTTTAGT GTACGGCAAC CCGCCGGTAG ATGAAGAGGA GGAAATTGAA
ATAGTGTACG GAGATTTGAA CGGAGACGGA AGAGTAAATT CTACAGATTT GCTTCTGATG
AAAAAACGTA TTATCAGGGA AATTGACAAG TTCAACGTTC CCGACGAAAA TGCAGACTTA
AATCTGGACG GAAAAATAAA CTCTTCCGAT TATACGATAC TTAAAAGATA TGTTTTAAAG
TCAATAGAAA AACTGCCCGT AAAGTAA
 
Protein sequence
MITSIIAVPS YVSAEPEYNF AKALQLSLYF YDANKCGEGI TGGRLEWRGD CHLEDKEVPL 
IPMTEDYFGT NMSQEYIDKY RHIFDPDGDG TVDLSGGMHD AGDHVKFGLP GTYAASTLGW
GFYEFREAYE KVGVDDHAKE ILRWFNDFYL KATFLDEDGN VIAYCYQVGE GNIDHNFWNP
PELQSSKVLD SFARPAYFAT PETPASDQCA GAAASLAINY LNFKDEDPEY AQRCLDTAIA
LYEFAKKYRG LGESGGFYGS SYDHDELAWA AVWLNIATGD MQYIDDIVAT DSEGNYIGYM
QRIIKDTTNT WQNIWVHCWD TVWGGVFAKL APITNTERDW YIFRWNLEYW SGIPHENPND
TTFLAKSPGG FSVVNAYGSA RYNTAAQLCA LVYRKYTGRS DFADWAKSQM EYIMGNNPMN
RCYIVGYSEN SAKHPHHRAA HGSKTFSMLD PEEHRHTLWG ALVGGPDLDD FHVDETTDYV
YNEVAVDYNA AFVGACAGLY YYYGEALGHK PVPNFPPKEE AVEEYYVEGK IEQENKERTQ
VTIKIFNDTC HPPRFETGLM ARYFFNISEL LDAGQSIDDV KIEVYYDENK ASYDGPAEVR
GPIKYDDAGT YYVEVDWSGR IIYGKREIQL ALISSLDSNY KSNWNPENDY SREGLGKEFV
RTEKIPLYLN GVKVFGNEPP QIEPSPTPSD GPGSTPPPSQ KPSLEVLYKY GDTTAATKDI
RGSIKIKNTG TKPVNLSDVK VRYWFTKDGA SSQEFVCDYA HLSESMITAK FVDLENPVEN
ADNYLEIGFD SNAGILGPGS DTGEIQFRIV KGDYESYDQS NDYSCMATAK DFTANPNITA
YVNSVLVYGN PPVDEEEEIE IVYGDLNGDG RVNSTDLLLM KKRIIREIDK FNVPDENADL
NLDGKINSSD YTILKRYVLK SIEKLPVK