Gene Cthe_0607 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0607 
Symbol 
ID4808209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp744284 
End bp745333 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content44% 
IMG OID640106021 
Productpeptidase M42 
Protein accessionYP_001037035 
Protein GI125973125 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.201581 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACTATA TAAATATATT AAAGGATTTA AGTACATATC CCGGGGTATC CGGGCAGGAA 
GACAAGCTTT CCGGGTACAT TGCAAAGCTG TTTGAAAAAT ACTGTGACAG TGTGGAAATA
GATGAATTCT ACAATGTTAT CGGGATAAAA AAGGGCATAG GCGGTTCCGG AGGCAGAAGG
ATTATGGTTA CAGCCCATCT TGACGAAATA GGTTTGATGG TAAAAAGTAT TGACGAAAAG
GGGTTTATCA CTGTCTCAAA CATTGGGGGT GTGGACAGCA AGGTCCTTCT GGCCCAGGAA
GTTGTAATTC ATGGAAAGAA AGAGATATAT GGCATTATAG GCGCAAAGCC TCCGCACCTT
TTGACTCCGG AAGAGATAAA AAAGGCGGTT AAGATGGAGG ACTTGGTTAT AGATACGGGG
CTTTCTGCAG AAGAAGTGAG AAAATATGTA TCTGTGGGAG ATATTGTGAC TTTTAAGGTC
GAGCCGTTAG TCCTTCAGAA CAACAGATTT AGTTCAAAGT CTCTGGACAA CCGGGCGGGA
GTTGTTGCTT TGCTGGACAT AATGGAAAAT TTGACTTTGC TCAATCACAA AGATGATGTA
TGGTTTGTGG CTACGGTTCA GGAAGAAGTG GGGCTTAGGG GAGCCAATAT TGCCGCCTAT
AATATAAACC CGGATTTGGC AATAGTGATT GATGTCTGCC ACGGCCAGAT ACCCGGCACA
CCGAAGGAAT CGGTGTTTCC TGTAGGTAAA GGTCCGGCTG TCGCCGTCGG TCCGAATCTT
CATAGAAAAT ACACAAAAAA GATGATTGAG CTTGCCAAAG AGGAAAATAT ACCTTACCAG
ATAGATGTGG AGCCCGGGGA CACCGGTACC GAGGCTTGGG CCGTACAGGT TTCAAGAGAG
GGAATTCCGA CGCTTTTGGT TTCAATTCCT CTAAAGTACA TGCATACGGT AATAGAAACT
TTAAGCATAG ATGATATAAA AAATACCGGA AGACTGATTG CAAGATTTAT TTCAATGACA
GGAAACGAAA TGGAGGAAGG ACTGTGCTGA
 
Protein sequence
MDYINILKDL STYPGVSGQE DKLSGYIAKL FEKYCDSVEI DEFYNVIGIK KGIGGSGGRR 
IMVTAHLDEI GLMVKSIDEK GFITVSNIGG VDSKVLLAQE VVIHGKKEIY GIIGAKPPHL
LTPEEIKKAV KMEDLVIDTG LSAEEVRKYV SVGDIVTFKV EPLVLQNNRF SSKSLDNRAG
VVALLDIMEN LTLLNHKDDV WFVATVQEEV GLRGANIAAY NINPDLAIVI DVCHGQIPGT
PKESVFPVGK GPAVAVGPNL HRKYTKKMIE LAKEENIPYQ IDVEPGDTGT EAWAVQVSRE
GIPTLLVSIP LKYMHTVIET LSIDDIKNTG RLIARFISMT GNEMEEGLC