Gene Cthe_1773 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1773 
Symbol 
ID4810018 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2094165 
End bp2096609 
Gene Length2445 bp 
Protein Length814 aa 
Translation table11 
GC content41% 
IMG OID640107187 
Productpeptidase S16, lon-like protein 
Protein accessionYP_001038187 
Protein GI125974277 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1067] Predicted ATP-dependent protease 
TIGRFAM ID[TIGR00764] lon-related putative ATP-dependent protease 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGCAAA TTAGTGGGTT GCCTGCCGGT ATGTTAAGGA AGGAATGTGA TCCTAATTCT 
TTCAAGTTTA ATGACACTTC AGAGTTGGAA CCCCTTGAAG GAATTATAGG TCAGGAACGT
GCTGTGCGTG CCATGACATT CGGACTTAAA ATCAATACCC GCGGTTACAA TATTTTTATG
AGTGGTATGA CCGGAACCGG CAAAACTAGT TACGCTGTAA ATTATATTAA GAAAATAGCT
AAAAATTGCA AGACTCCGGA TGACTGGTGC TATGTATATA ATTTTGAGAA TCCGAATCAG
CCTAAAGCGA TAAATCTGCC TGCAGGACTT GGCAAAGTGT TTAAAAAGGA CATGGAGGAA
TTTATAAAAG TACTTCAGCA GGAAATCAGC AGGGCTTTTG AAAGTGAGGA CTATGAAAGA
GAAAGGGCGG CCATTGCAAA TGAATATCAG GGAAAAAAGG CCGAACTTAT GGAAATATTA
AACAGGGATG CTGAAAAACA AGGCTTCAAA GTCAGGACAA CAAACGCAGG AATATACTTT
CTTCCGGTAA TTGAAGGCAA GACAATAACG GAGGAGGAAT ACGGGCAACT TGATGAAAAG
ATTAAGCAGG AAATAACGGA AAGATCAAAT ATAGTTCAGC TTGAGACTTT GGAAATAATC
AGAAAGATAA AAAATATTGA AAGGGAAGCG GAAGAAAGGG TTGCTGAATG GGAGAATAAA
ATTGCCTTGT TTGCCGTAGG CATGCAGATA AATGACCTCA AAGAAAAGTA CAAGGATTAC
AAAGAAGTGG TTAAATATTT GGAACAGGTT CAGGAAGATA TTCTTCAAAA TCTTGATGAT
TTCAGGGAGG AAGAGTATTC TGAAGAACAA CAGCTCATTA TGCCCTGGCT TAAAGGTAAT
GAAGGCTCGC CTGTAGACAA ATATAAAGTA AATCTTTTGG TGGACAATTC CGGTCTTGAA
GGAGCTCCTG TCATAGTCGA TTTCAATCCT ACATATTACA ATCTTATTGG AAGAGTGGAA
TATGAAAACG AATTTGGAAC AATGATAACT GATTTTACAA TGATAAAACC GGGATTGTTC
CATCAGGCAA ACGGAGGTTA TCTGATACTC CAGGCAAAGG ATGTACTTAG CAATGTCCAA
TCCTGGGAAG CTCTAAAAAG GGCACTGAAA ACCCGCCAGA TAACCATTGA GAATATGAAG
GAGCAAATGG GACTTGTGGC AGTGTCGACA TTAAAGCCCG AGCCCATACC TTTGCAGGTC
AAAGTGATTT TGGTGGGAAA CGAGTTTTTG CACCAGCTGC TTTATGAATA TGATGAGGAT
TTCAAAAAGC TCTTTAAAAT AAAAGTGGAT TTTGACGAAG AGATGGACAG AAACGAAGAC
AATACCTTGA AACTGGCGCA GTTTATAAGC TCATTCTGCA GAAGGGAGAA CGCCCCGCAT
TTTGACAGGA CCGGGGTGGC AAAGGTGGTT GAGTACAGTT CGCGCCTGGT CGGCGATCAG
AACAAGCTTT CCACCAGGTT TAATGATATT GTTGAGATAC TTTGTGAATC TGCGGCATGG
GCTCAAATCG ACGGAAGCAG TCTGGTCAAA GCGGAGCATG TAAATAAAGC GATTCAGGAG
AAGATATACA GGTCAAACAA GTATGATAAA AAGCTTTTGG AGCTTTTGAA GGACGGTATT
ATAATTTTGG ATACCGAAGG CGAGGCAGTG GGACAGATAA ACGGCCTTAC CGTACTTGAT
ATTGGAGACT ATTGCTTCGG AAAGCCCACG AGGATAACCG CAAACACCTT TATGGGTGAA
AAAGGAATAG TAAATATTGA AAGAGAAGTT GAAATGAGCG GGACATCCCA TACAAAAGGG
GTTCTGATAT TGAGCGGGTA CATTGGTCAA AAATATGCCC AGGATATACC GCTGTCTCTG
ACTGCAAGCC TGTGCTTCGA ACAGCTGTAC AGCGGAGTTG ACGGCGACAG TGCATCAAGC
GCGGAGCTCT ATGCGATTCT GTCAAGCCTG GCGGAGGTTC CCATAAAACA GAGCATTGCG
GTAACGGGTT CGGTTAACCA GAAAGGAGAA ATTCAACCTA TTGGCGGGGT TAATGAGAAA
ATAGAAGGAT TCTTCGAGCT TTGCAAAGCC CGTGGACTTA ACGGCAAGCA TGGAGTAATT
ATTCCTTACC AGAATGTAAG AAATCTTGCT TTGAACGATG AGGTTATTGA AGCGGTGAAA
GAAGGCAAGT TCCATATATA TGCCGTAAAA ACCATAGATG AGGGAATTGA AATACTTACA
GGAATGAAAG CAGGGGAAAA GAGAGAAGAC GGAACTTATC CTGAGGGAAC AATAAACTAT
CTTGTATATG AGAAACTTAA AAAATATGCA AGAACGGTTG CCGGATTTGG CAAGGATGAA
AAGGAAGCAA AGGATGCAAA GGATGCAAAG AAGAATTCTG ATTAA
 
Protein sequence
MPQISGLPAG MLRKECDPNS FKFNDTSELE PLEGIIGQER AVRAMTFGLK INTRGYNIFM 
SGMTGTGKTS YAVNYIKKIA KNCKTPDDWC YVYNFENPNQ PKAINLPAGL GKVFKKDMEE
FIKVLQQEIS RAFESEDYER ERAAIANEYQ GKKAELMEIL NRDAEKQGFK VRTTNAGIYF
LPVIEGKTIT EEEYGQLDEK IKQEITERSN IVQLETLEII RKIKNIEREA EERVAEWENK
IALFAVGMQI NDLKEKYKDY KEVVKYLEQV QEDILQNLDD FREEEYSEEQ QLIMPWLKGN
EGSPVDKYKV NLLVDNSGLE GAPVIVDFNP TYYNLIGRVE YENEFGTMIT DFTMIKPGLF
HQANGGYLIL QAKDVLSNVQ SWEALKRALK TRQITIENMK EQMGLVAVST LKPEPIPLQV
KVILVGNEFL HQLLYEYDED FKKLFKIKVD FDEEMDRNED NTLKLAQFIS SFCRRENAPH
FDRTGVAKVV EYSSRLVGDQ NKLSTRFNDI VEILCESAAW AQIDGSSLVK AEHVNKAIQE
KIYRSNKYDK KLLELLKDGI IILDTEGEAV GQINGLTVLD IGDYCFGKPT RITANTFMGE
KGIVNIEREV EMSGTSHTKG VLILSGYIGQ KYAQDIPLSL TASLCFEQLY SGVDGDSASS
AELYAILSSL AEVPIKQSIA VTGSVNQKGE IQPIGGVNEK IEGFFELCKA RGLNGKHGVI
IPYQNVRNLA LNDEVIEAVK EGKFHIYAVK TIDEGIEILT GMKAGEKRED GTYPEGTINY
LVYEKLKKYA RTVAGFGKDE KEAKDAKDAK KNSD