Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0607 |
Symbol | |
ID | 4808209 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 744284 |
End bp | 745333 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640106021 |
Product | peptidase M42 |
Protein accession | YP_001037035 |
Protein GI | 125973125 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.201581 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACTATA TAAATATATT AAAGGATTTA AGTACATATC CCGGGGTATC CGGGCAGGAA GACAAGCTTT CCGGGTACAT TGCAAAGCTG TTTGAAAAAT ACTGTGACAG TGTGGAAATA GATGAATTCT ACAATGTTAT CGGGATAAAA AAGGGCATAG GCGGTTCCGG AGGCAGAAGG ATTATGGTTA CAGCCCATCT TGACGAAATA GGTTTGATGG TAAAAAGTAT TGACGAAAAG GGGTTTATCA CTGTCTCAAA CATTGGGGGT GTGGACAGCA AGGTCCTTCT GGCCCAGGAA GTTGTAATTC ATGGAAAGAA AGAGATATAT GGCATTATAG GCGCAAAGCC TCCGCACCTT TTGACTCCGG AAGAGATAAA AAAGGCGGTT AAGATGGAGG ACTTGGTTAT AGATACGGGG CTTTCTGCAG AAGAAGTGAG AAAATATGTA TCTGTGGGAG ATATTGTGAC TTTTAAGGTC GAGCCGTTAG TCCTTCAGAA CAACAGATTT AGTTCAAAGT CTCTGGACAA CCGGGCGGGA GTTGTTGCTT TGCTGGACAT AATGGAAAAT TTGACTTTGC TCAATCACAA AGATGATGTA TGGTTTGTGG CTACGGTTCA GGAAGAAGTG GGGCTTAGGG GAGCCAATAT TGCCGCCTAT AATATAAACC CGGATTTGGC AATAGTGATT GATGTCTGCC ACGGCCAGAT ACCCGGCACA CCGAAGGAAT CGGTGTTTCC TGTAGGTAAA GGTCCGGCTG TCGCCGTCGG TCCGAATCTT CATAGAAAAT ACACAAAAAA GATGATTGAG CTTGCCAAAG AGGAAAATAT ACCTTACCAG ATAGATGTGG AGCCCGGGGA CACCGGTACC GAGGCTTGGG CCGTACAGGT TTCAAGAGAG GGAATTCCGA CGCTTTTGGT TTCAATTCCT CTAAAGTACA TGCATACGGT AATAGAAACT TTAAGCATAG ATGATATAAA AAATACCGGA AGACTGATTG CAAGATTTAT TTCAATGACA GGAAACGAAA TGGAGGAAGG ACTGTGCTGA
|
Protein sequence | MDYINILKDL STYPGVSGQE DKLSGYIAKL FEKYCDSVEI DEFYNVIGIK KGIGGSGGRR IMVTAHLDEI GLMVKSIDEK GFITVSNIGG VDSKVLLAQE VVIHGKKEIY GIIGAKPPHL LTPEEIKKAV KMEDLVIDTG LSAEEVRKYV SVGDIVTFKV EPLVLQNNRF SSKSLDNRAG VVALLDIMEN LTLLNHKDDV WFVATVQEEV GLRGANIAAY NINPDLAIVI DVCHGQIPGT PKESVFPVGK GPAVAVGPNL HRKYTKKMIE LAKEENIPYQ IDVEPGDTGT EAWAVQVSRE GIPTLLVSIP LKYMHTVIET LSIDDIKNTG RLIARFISMT GNEMEEGLC
|
| |