Gene Cthe_2089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2089 
Symbol 
ID4810949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2482535 
End bp2484760 
Gene Length2226 bp 
Protein Length741 aa 
Translation table11 
GC content45% 
IMG OID640107496 
Productglycoside hydrolase family protein 
Protein accessionYP_001038489 
Protein GI125974579 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000609917 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAAAAA GCAGAAAGAT TTCTATTCTG TTGGCAGTTG CAATGCTGGT ATCCATAATG 
ATACCCACAA CTGCATTCGC AGGTCCTACA AAGGCACCTA CAAAAGATGG GACATCTTAT
AAGGATCTTT TCCTTGAACT CTACGGAAAA ATTAAAGATC CTAAGAACGG ATATTTCAGC
CCAGACGAGG GAATTCCTTA TCACTCAATT GAAACATTGA TCGTTGAAGC GCCGGACTAC
GGTCACGTTA CTACCAGTGA GGCTTTCAGC TATTATGTAT GGCTTGAAGC AATGTATGGA
AATCTCACAG GCAACTGGTC CGGAGTAGAA ACAGCATGGA AAGTTATGGA GGATTGGATA
ATTCCTGACA GCACAGAGCA GCCGGGTATG TCTTCTTACA ATCCAAACAG CCCTGCCACA
TATGCTGACG AATATGAGGA TCCTTCATAC TATCCTTCAG AGTTGAAGTT TGATACCGTA
AGAGTTGGAT CCGACCCTGT ACACAACGAC CTTGTATCCG CATACGGTCC TAACATGTAC
CTCATGCACT GGTTGATGGA CGTTGACAAC TGGTACGGTT TTGGTACAGG AACACGGGCA
ACATTCATAA ACACCTTCCA AAGAGGTGAA CAGGAATCCA CATGGGAAAC CATTCCTCAT
CCGTCAATAG AAGAGTTCAA ATACGGCGGA CCGAACGGAT TCCTTGATTT GTTTACAAAG
GACAGATCAT ATGCAAAACA GTGGCGTTAT ACAAACGCTC CTGACGCAGA AGGCCGTGCT
ATACAGGCTG TTTACTGGGC AAACAAATGG GCAAAGGAGC AGGGTAAAGG TTCTGCCGTT
GCTTCCGTTG TATCCAAGGC TGCAAAGATG GGTGACTTCT TGAGAAACGA CATGTTCGAC
AAATACTTCA TGAAGATCGG TGCACAGGAC AAGACTCCTG CTACCGGTTA TGACAGTGCA
CACTACCTTA TGGCCTGGTA TACTGCATGG GGTGGTGGAA TTGGTGCATC CTGGGCATGG
AAGATCGGAT GCAGCCACGC ACACTTCGGA TATCAGAACC CATTCCAGGG ATGGGTAAGT
GCAACACAGA GCGACTTTGC TCCTAAATCA TCCAACGGTA AGAGAGACTG GACAACAAGC
TACAAGAGAC AGCTTGAATT CTATCAGTGG TTGCAGTCGG CTGAAGGTGG TATTGCCGGT
GGAGCAACCA ACTCCTGGAA CGGTAGATAT GAGAAATATC CTGCTGGTAC GTCAACGTTC
TATGGTATGG CATATGTTCC GCATCCTGTA TACGCTGACC CGGGTAGTAA CCAGTGGTTC
GGATTCCAGG CATGGTCAAT GCAGCGTGTA ATGGAGTACT ACCTCGAAAC AGGAGATTCA
TCAGTTAAGA ATTTGATTAA GAAGTGGGTC GACTGGGTAA TGAGCGAAAT TAAGCTCTAT
GACGATGGAA CATTTGCAAT TCCTAGCGAC CTCGAGTGGT CAGGTCAGCC TGATACATGG
ACCGGAACAT ACACAGGCAA CCCGAACCTC CATGTAAGAG TAACTTCTTA CGGTACTGAC
CTTGGTGTTG CAGGTTCACT TGCAAATGCT CTTGCAACTT ATGCCGCAGC TACAGAAAGA
TGGGAAGGAA AACTTGATAC AAAAGCAAGA GACATGGCTG CTGAACTGGT TAACCGTGCA
TGGTACAACT TCTACTGCTC TGAAGGAAAA GGTGTTGTTA CTGAGGAAGC ACGTGCTGAC
TACAAACGTT TCTTTGAGCA GGAAGTATAC GTTCCGGCAG GTTGGAGCGG TACTATGCCG
AACGGTGACA AGATTCAGCC TGGTATTAAG TTCATAGACA TCCGTACAAA ATATAGACAA
GATCCTTACT ACGATATAGT ATATCAGGCA TACTTGAGAG GCGAAGCTCC TGTATTGAAT
TATCACCGCT TCTGGCATGA AGTTGACCTT GCAGTTGCAA TGGGTGTATT GGCTACATAC
TTCCCGGATA TGACATATAA AGTACCTGGT ACTCCTTCTA CTAAATTATA CGGCGACGTC
AATGATGACG GAAAAGTTAA CTCAACTGAC GCTGTAGCAT TGAAGAGATA TGTTTTGAGA
TCAGGTATAA GCATCAACAC TGACAATGCC GATTTGAATG AAGACGGCAG AGTTAATTCA
ACTGACTTAG GAATTTTGAA GAGATATATT CTCAAAGAAA TAGATACATT GCCGTACAAG
AACTAA
 
Protein sequence
MVKSRKISIL LAVAMLVSIM IPTTAFAGPT KAPTKDGTSY KDLFLELYGK IKDPKNGYFS 
PDEGIPYHSI ETLIVEAPDY GHVTTSEAFS YYVWLEAMYG NLTGNWSGVE TAWKVMEDWI
IPDSTEQPGM SSYNPNSPAT YADEYEDPSY YPSELKFDTV RVGSDPVHND LVSAYGPNMY
LMHWLMDVDN WYGFGTGTRA TFINTFQRGE QESTWETIPH PSIEEFKYGG PNGFLDLFTK
DRSYAKQWRY TNAPDAEGRA IQAVYWANKW AKEQGKGSAV ASVVSKAAKM GDFLRNDMFD
KYFMKIGAQD KTPATGYDSA HYLMAWYTAW GGGIGASWAW KIGCSHAHFG YQNPFQGWVS
ATQSDFAPKS SNGKRDWTTS YKRQLEFYQW LQSAEGGIAG GATNSWNGRY EKYPAGTSTF
YGMAYVPHPV YADPGSNQWF GFQAWSMQRV MEYYLETGDS SVKNLIKKWV DWVMSEIKLY
DDGTFAIPSD LEWSGQPDTW TGTYTGNPNL HVRVTSYGTD LGVAGSLANA LATYAAATER
WEGKLDTKAR DMAAELVNRA WYNFYCSEGK GVVTEEARAD YKRFFEQEVY VPAGWSGTMP
NGDKIQPGIK FIDIRTKYRQ DPYYDIVYQA YLRGEAPVLN YHRFWHEVDL AVAMGVLATY
FPDMTYKVPG TPSTKLYGDV NDDGKVNSTD AVALKRYVLR SGISINTDNA DLNEDGRVNS
TDLGILKRYI LKEIDTLPYK N