Gene Cthe_0536 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0536 
Symbol 
ID4808285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp654353 
End bp656044 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content42% 
IMG OID640105950 
Productglycoside hydrolase family protein 
Protein accessionYP_001036965 
Protein GI125973055 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000215938 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAT TTCTTGTTTT ATTGATTGCA TTAATAATGA TTGCAACCTT ATTAGTGGTT 
CCCGGTGTGC AAACATCGGC AGAAGGGTCA TATGCTGATT TGGCAGAACC GGATGACGAC
TGGCTGCATG TGGAAGGTAC GAATATTGTT GACAAGTACG GCAATAAAGT TTGGATAACA
GGAGCCAACT GGTTTGGATT CAATTGTAGA GAGAGAATGC TTTTGGATTC ATACCATAGC
GATATTATAG CAGATATTGA ATTGGTTGCG GATAAAGGTA TAAATGTGGT TAGAATGCCG
ATTGCGACAG ATTTGCTCTA TGCATGGAGC CAGGGGATAT ATCCGCCTTC AACCGATACA
AGCTACAACA ATCCGGCTTT GGCTGGATTA AACAGCTATG AGTTGTTTAA TTTCATGCTG
GAAAATTTCA AAAGAGTCGG TATCAAAGTT ATACTTGATG TGCACAGCCC GGAAACTGAC
AACCAGGGGC ATAACTATCC TCTTTGGTAC AATACCACTA TAACGGAGGA GATATTCAAA
AAGGCCTGGG TATGGGTGGC TGAGCGCTAT AAAAATGATG ACACAATAAT CGGATTTGAC
CTAAAAAATG AGCCCCATAC CAATACCGGC ACCATGAAAA TAAAAGCTCA AAGTGCCATA
TGGGATGACT CCAACCATCC GAACAACTGG AAAAGAGTGG CTGAGGAAAC TGCCTTGGCA
ATATTGGAAG TACATCCAAA TGTATTGATA TTTGTTGAAG GTGTGGAGAT GTATCCCAAA
GACGGCATAT GGGATGACGA AACTTTTGAC ACAAGCCCGT GGACAGGAAA CAATGACTAT
TACGGAAACT GGTGGGGCGG TAACTTAAGA GGCGTGAAGG ATTATCCGAT TAATCTTGGA
AAATATCAGT CGCAGCTTGT TTATTCACCT CATGATTATG GCCCGATAGT TTATGAGCAG
GATTGGTTTA AAGGCGATTT TATCACTGCC AATGATGAAC AGGCAAAAAG GATTCTGTAT
GAGCAATGCT GGAGAGACAA TTGGGCATAT ATCATGGAAG AAGGAATATC ACCGTTGCTC
CTTGGCGAAT GGGGAGGTAT GACCGAAGGC GGCCACCCGC TTCTTGACCT GAACTTGAAG
TATTTAAGAT GCATGAGAGA TTTTATATTG GAAAACAAAT ATAAATTGCA TCATACTTTC
TGGTGCATAA ACATTGACTC GGCAGATACC GGCGGATTGT TTACCCGTGA TGAGGGAACA
CCGTTCCCGG GGGGAAGAGA TCTTAAGTGG AATGACAACA AGTACGACAA TTACTTGTAT
CCTGTTCTTT GGAAAACCGA GGACGGAAAG TTTATAGGTC TTGACCACAA GATTCCTCTC
GGCAGAAACG GTATATCAAT AAGTCAGCTT TCAAACTATA CACCGTCGGT TACTCCGTCT
CCCAGCGCAA CTCCTTCTCC GACAACAATA ACTGCACCGC CGACGGATAC CGTTACATAC
GGAGATGTGA ACGGAGACGG AAGGGTAAAC TCCAGCGATG TGGCATTGTT GAAAAGATAT
TTGTTGGGTT TGGTTGAAAA TATCAATAAA GAAGCAGCGG ACGTAAATGT CAGTGGAACT
GTAAATTCAA CGGATTTGGC AATTATGAAA AGGTATGTTT TGCGTAGCAT AAGTGAGTTG
CCGTATAAAT AA
 
Protein sequence
MKKFLVLLIA LIMIATLLVV PGVQTSAEGS YADLAEPDDD WLHVEGTNIV DKYGNKVWIT 
GANWFGFNCR ERMLLDSYHS DIIADIELVA DKGINVVRMP IATDLLYAWS QGIYPPSTDT
SYNNPALAGL NSYELFNFML ENFKRVGIKV ILDVHSPETD NQGHNYPLWY NTTITEEIFK
KAWVWVAERY KNDDTIIGFD LKNEPHTNTG TMKIKAQSAI WDDSNHPNNW KRVAEETALA
ILEVHPNVLI FVEGVEMYPK DGIWDDETFD TSPWTGNNDY YGNWWGGNLR GVKDYPINLG
KYQSQLVYSP HDYGPIVYEQ DWFKGDFITA NDEQAKRILY EQCWRDNWAY IMEEGISPLL
LGEWGGMTEG GHPLLDLNLK YLRCMRDFIL ENKYKLHHTF WCINIDSADT GGLFTRDEGT
PFPGGRDLKW NDNKYDNYLY PVLWKTEDGK FIGLDHKIPL GRNGISISQL SNYTPSVTPS
PSATPSPTTI TAPPTDTVTY GDVNGDGRVN SSDVALLKRY LLGLVENINK EAADVNVSGT
VNSTDLAIMK RYVLRSISEL PYK