Gene Cthe_0043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0043 
Symbol 
ID4808808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp54373 
End bp56601 
Gene Length2229 bp 
Protein Length742 aa 
Translation table11 
GC content47% 
IMG OID640105452 
Productglycoside hydrolase family protein 
Protein accessionYP_001036477 
Protein GI125972567 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAA TGGGGTTGGG CAGATTTAGC AATAAATTTG TCCTGACTGT TTTGCTGTTC 
TTTTTGACCG CCGTGCTGCT GCCGTCGTCT TCTTTTGAAT CAAAAGTACA GGCTGCTTCT
TCGCCTCGTT ACGGCGGTGC ATATTATAAT TACGGAGAAG CTTTGCAAAA AGCGATTCTT
TTTTACAAGG CCAATCGTCT TGGAGATTTG CCCGATGATT ACATCCTGCC TTACAGGGCT
GACGCCGCAA TGACCGACGG TCAGGATGTG GGTCTGGATC TTACCGGAGG ATGGGCTGAT
GCCGGAGACG GAATCAAATT TACCCATCCC ATGTCCTATG CCGCGGGCCA ATTGGGCTGG
GCTGTGTATG AATATCGTCA GGCCTTTGAA AAGGCAGGGC TGTTGGATGA TATTCTGGAC
GAAATAAAAT GGGCTACGGA CTTTTTCATT AAAGCTCATC CGGAGCCAAA TGTATTGTAT
TATATGTGCG GCTACAATGA TTCCGACCAC TCGGTATGGG TTCCCCACGA ACTTTTGGAT
TATGTGACGG ACAGAAAGTC CTTTGTGTTA AACCCGTCAA CCCCGGGCTC GGATGTGGCA
GGACAAACTG CGGCGTGTCT GGCTATTGCA TCAATAATAT TTGAGCCGAC AGATCCCGAA
TATGCCGAAA CCTGTTTGAC CCATGCAAAG CAGATTTTTG AGTTTGGTGA TAAATACAGA
GGCAAAAATC CTCTGGATGT TTTATACCCG TCCGGAGGTT ATCTGGATGA TTTGGCCTGG
GGTGCCGTGT GGCTCTACAT AAAAACCGGA GATTCCACTT ATCTTGAGAA AGCAAAATCC
TTCCTGCCTG TTACTTCATT GGGCGGAGGG CATACCCATT GCTGGGATGA TGTCAGCTAC
GGTGCTGCGC TTAAAATAGC CCAGCTTACT CATGACGAAG GATATGCCGC CATGGTCGAA
AAGAACCTTG ATTTCTGGTT GCCCGGAGGA GGAATCACTT ACACTCCGGG AGGCCTGGCA
TGGCTTTCAC CGTGGGGTTC TTTGCGTTAT GCATCCACAG CTGCGTTTTT GGCCTTTGTG
TGGGCTGACG ACCCAACGGT TGGAACACCG TCAAAGAAAG AAACTTACCG TGCTTTTGCG
GAAAAACAAA TAAACTATAT TCTCGGAGAT AACCCTCGAA AAGGAAGTTA TGTGGTGGGA
TTTGGCGAGA ATTCTCCAAA ACATCCGCAT CACAGAACGG CTCATGGTTC GTGGGTAAGT
ATGCTTGAAG TGCCGAGTTT CCATCGCCAC ATTCTCTACG GCGCATTGGT TGGTGGACCG
AGTTCCGATG ACAGCTGGGA AGATGATATT TCCGATTATA CCCGGAATGA AGTGGCAACC
GATTACAATG CAGGTTTTGT AGGGGCTTTG GCAAAAATGT ATGACATGTA TGGAGGAGAA
CCCCTTGAAA ATTGGCCTCA ACCTGAAGAT TTCAGAGCAC CGGAAGACGA CATTGTTGAA
TACTTCTGCC GTGGCTGGAT AATTTACGAA GGCTATGGAA CCTTAAACTT GCTGCTTCAG
GTTAACAACC GTTCCGGATG GCCTCCGACA ATGAAGGATA AGTTGTCTGT CCGTTACTTC
ATGGATTTAA CTGAAGTGTT TGAGTCCGGA GGAACTGTGG ATGACGTTCA GATAAGTCTT
GGACAGAACG AGGGAGCAAA ACTTATAGGT TTGAAGCATT ACAGGGACAA CATTTACTAC
TTTACGGTTG ACTTTACGGG AACCATGATT ATGCCTGCCG AGTGGGAGAT GTGTGAAAAA
GATGCCCATG TAACAATCAA ATACAGAGAC GGCATAACAG GTTCTAATGA AAACGACTGG
TCATATCAGA ACTTGAGAAA GGATCCGGAT TATGATGCTA CATCCTTTGC AGGACTGACT
CCTTATATAC CTGTATATGA CAACGGTGTG CTTTTGTGGG GTGAAGAACC GCCTGCCGGC
GGTGATGATC CCGGGTCTTC GCCGCCGCCG ACTCCAACTG AGCCGGTAAT TGTGTATGGT
GATTTGAACG GCGACGGAAA TATAAATTCC ACGGATTTTA CGATGTTAAA GAGAGCAATA
TTGGGTAATC CGGCTCCCGG TACAAATTTG GCAGCCGGTG ATTTGAACAG GGACGGTAAT
ACAAATTCAA CAGACTTGAT GATTTTAAGA AGGTATTTGC TAAAGTTAAT TGGTTCGTTA
CCTATATAA
 
Protein sequence
MKKMGLGRFS NKFVLTVLLF FLTAVLLPSS SFESKVQAAS SPRYGGAYYN YGEALQKAIL 
FYKANRLGDL PDDYILPYRA DAAMTDGQDV GLDLTGGWAD AGDGIKFTHP MSYAAGQLGW
AVYEYRQAFE KAGLLDDILD EIKWATDFFI KAHPEPNVLY YMCGYNDSDH SVWVPHELLD
YVTDRKSFVL NPSTPGSDVA GQTAACLAIA SIIFEPTDPE YAETCLTHAK QIFEFGDKYR
GKNPLDVLYP SGGYLDDLAW GAVWLYIKTG DSTYLEKAKS FLPVTSLGGG HTHCWDDVSY
GAALKIAQLT HDEGYAAMVE KNLDFWLPGG GITYTPGGLA WLSPWGSLRY ASTAAFLAFV
WADDPTVGTP SKKETYRAFA EKQINYILGD NPRKGSYVVG FGENSPKHPH HRTAHGSWVS
MLEVPSFHRH ILYGALVGGP SSDDSWEDDI SDYTRNEVAT DYNAGFVGAL AKMYDMYGGE
PLENWPQPED FRAPEDDIVE YFCRGWIIYE GYGTLNLLLQ VNNRSGWPPT MKDKLSVRYF
MDLTEVFESG GTVDDVQISL GQNEGAKLIG LKHYRDNIYY FTVDFTGTMI MPAEWEMCEK
DAHVTIKYRD GITGSNENDW SYQNLRKDPD YDATSFAGLT PYIPVYDNGV LLWGEEPPAG
GDDPGSSPPP TPTEPVIVYG DLNGDGNINS TDFTMLKRAI LGNPAPGTNL AAGDLNRDGN
TNSTDLMILR RYLLKLIGSL PI