Gene Cthe_0985 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0985 
Symbol 
ID4811279 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1177642 
End bp1178925 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content38% 
IMG OID640106403 
Productpeptidase M16-like protein 
Protein accessionYP_001037410 
Protein GI125973500 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATGA AAGTTGTTGA ATATAAAAAC ATCGATGAAA CGGTATATGT TCATGAACAT 
TCAAGCGGTC TTAAATCCTT TGTGGTGCCC AAAAAAGGTT ATTCAAAAAA ATATGCGAAT
TTTGCAACCC ATTACGGTTC CATCAATAAT GAGTTTGTCG TGCCCGGGGA AAAAGATTCC
ATCAGAGTTC CCGACGGAAT AGCCCACTTT TTGGAGCATA AGCTTTTTGA ACAAAAAGAC
GGAAGCGTTA TGGATAAGTT TTCACAGCTT GGTTCAAATC CCAATGCATA TACAAGCTTT
GCCCAGACGG TTTATCTTTT TTCCTGCACT GACAGGTTCG AGGACAATTT CCGACTCCTT
TTGGATTTTG TTCAAAATCC TTTTATAACA GAGGAAAGCG TGGAAAAAGA AAAGGACATA
ATAGCTCAGG AAATCAGGAT GTATGAAGAT GATCCGAACT GGAGAGTTTT CTTCAACTTG
CTGGATGCAT TTTATGTAAA TAATCCTGTA AAAATTGACA TAGCAGGAAC GGTTGAAAGT
ATAAGCAAAA TCAACCGAGA CATTTTGTAC AAGTGCTATA ATACTTTCTA CCATCCTTCC
AATATGATGA TTCTGGTAGT TGGAGATGTT GAACCGAAAG AAGTGTTCGG ACAGATTGAG
GAAAGTATAG ATGCAAAGAG CAGCAAGCCT GAAATAAAGA GGATTTTTCC CGAGGAACCC
AAAACAATCA ACCGGGACTA TGTTGAACAG AAGCTTGCGG TTGCCATGCC CATGTTTCAA
ATGGGATTTA AAGACAATGA TTTTAATTCA AAGGGAATTG AGTGCTTAAA GAGGGAAGTT
GCGGTAAAGC TCATACTTGA AATGATAATG GGCAGAAGTT CAAGCCTTTA TAACGAGTTG
TACAACGAGG GTCTTATCAA CAACACCTTT GATTTTGATT ACACCATTGA AGAGAATTAT
GCATATTCGG CTTTCGGTGG CGAATCCAAA GATCCCTTGA TGGTAAAAGA AAGAGTCGTG
GATGAAATCA GGAAAATACA GGCGAACGGG CTTGACAAAA ACAGCTACGA ACGGATTAAA
AGAGCCATGA AAGGAAGATT CATAAAGCAG CTCAACTCGG TGGAGAGAAT TTCGCACATG
TTTATATCGG TGTATTTTAA AGATGTAAGC ATGTTTGACT ATCCGGATGT TTATGACAAT
ATGACTTTTG ATTATGTAAA AGAGGTTTTT GAGAATCATT TCAATTTGGA TAATCTGGCT
GTATCGGTGG TAAACCCGGT ATAG
 
Protein sequence
MNMKVVEYKN IDETVYVHEH SSGLKSFVVP KKGYSKKYAN FATHYGSINN EFVVPGEKDS 
IRVPDGIAHF LEHKLFEQKD GSVMDKFSQL GSNPNAYTSF AQTVYLFSCT DRFEDNFRLL
LDFVQNPFIT EESVEKEKDI IAQEIRMYED DPNWRVFFNL LDAFYVNNPV KIDIAGTVES
ISKINRDILY KCYNTFYHPS NMMILVVGDV EPKEVFGQIE ESIDAKSSKP EIKRIFPEEP
KTINRDYVEQ KLAVAMPMFQ MGFKDNDFNS KGIECLKREV AVKLILEMIM GRSSSLYNEL
YNEGLINNTF DFDYTIEENY AYSAFGGESK DPLMVKERVV DEIRKIQANG LDKNSYERIK
RAMKGRFIKQ LNSVERISHM FISVYFKDVS MFDYPDVYDN MTFDYVKEVF ENHFNLDNLA
VSVVNPV