Gene Cthe_2253 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2253 
Symbol 
ID4809991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2680508 
End bp2682307 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content43% 
IMG OID640107659 
ProductATP-dependent metalloprotease FtsH 
Protein accessionYP_001038648 
Protein GI125974738 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000844037 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAATATT TTAAAAATAT AAGCTTTTAT ATAGTGCTGT TTGTGATGTT GCTGGCTTTT 
CTGGTCATAG TGCAAAGCCG GCCTGTTGAA AAGGAACAAA AGTATTCCCA GCTTATAAGC
GATATTCACA ACGGTAAGGT TCAGGAAATT ATACTGGAAG ACAATAAAGC CACGGTAAAA
TATAAAGAAG AGGGACAGAG GGACCAGTTT GTTTATATAC CCGATGTTGA AGTGTTCATG
AATGAGATAA ACGACCTTAT CAGAGAAGGA GAGCTTGAGT TTCGAAGCAA GGTTCCTTAT
TCGCCGCCGT GGTGGATTTC TATATTGCCT ACTTTGGTAA TTATAGTTGT GTTTGTGCTG
TTCTGGGTGT TCTTCCTCCA GCAGTCTCAG GGCGGCGGAA GCAGAGTAAT GTCTTTTGGA
AAAAGCAGAG CCAAAATGAC AATAGACGAT AAGAGAAAAG TGACTTTCAA TGACGTCGCA
GGAGCGGATG AGGAGAAAGA GGAGCTAAGG GAAATAGTCG AATTTCTGAA AAATTCCAAA
AAGTTTTTGG AGTTGGGAGC GAGGATACCA AAAGGAGTGC TTTTGGTGGG ACCTCCGGGT
ACAGGTAAAA CCTTGCTGGC AAAAGCCGTG TCCGGAGAAG CCGGAGTTCC GTTCTTCAGC
ATAAGCGGAT CGGACTTTGT GGAAATGTTT GTCGGTGTAG GTGCTTCGAG GGTTAGAGAC
CTTTTTGAAC AGGCAAAGAA GAATGCCCCC TGTATTGTTT TCATAGATGA AATTGATGCT
GTCGGAAGAC ATAGAGGAGC AGGTCTCGGC GGCGGCCATG ACGAGAGAGA ACAGACGCTT
AATCAGCTGT TGGTTGAAAT GGACGGTTTC GGAGCAAATG AAGGGGTAAT AATTCTGGCT
GCAACAAACA GACCGGATAT TTTGGACCCT GCGCTTTTAA GACCGGGAAG ATTTGATAGA
AGAGTAGTTG TGGGGCTTCC GGACATAAAA GGAAGAGAGG AAATACTGAA AGTTCATGCC
AAAGGCAAGC CCCTGGCGGA AGATGTCAAA TTGGATGAAC TTGCAAAAAG TACCCCCGGA
TTTACCGGAG CGGACCTTGA AAACCTTCTT AATGAAGCAG CTTTGCTTGC TGCCAGAGCC
AACAAGAAAG TGATAACAAT GGCTGAAATA AAAGAAGCGA CATTCAAGGT GGTCATGGGA
CCGGAGAAAA AGAGCAGGGT AATGAGCGAG AAGGAAAAAA GGCTTACTGC CTATCATGAA
GCCGGTCATG CCATTGCAAT AAAAGAAGTT TCCACTACCG ACAGGGTTGA CAGGATATCC
ATAATTCCGT CGGGTATGGC AGGCGGTTTT ACCGCACACA AACCAGATGA GGATAAAAAT
TATGAGACCA AGTCCCATCT TATTGAAAAG ATTATCGTTG CATTGGGAGG AAGAGCGGCA
GAGGAGATTG TGCTGGGTGA AGTGAGCACC GGAGCATACT CCGATCTGAA GCAGGCAAAC
GGAATTGCCA GAAGCATGAT TACCAAATAT GGTATGAGTG ATACCCTTGG CAATCTCGTA
TTTGCAAATG AAAGTGACGA GGTATTTATC GGTAGGGATT TTGTCCAGAC AAAAAACTAC
AGTGAGGAGA TAGCTGCTCA AATTGACAGA GAAGTAAAGA AGATAATAGA TTCATGTTAT
GAAAGAATAA AAAATATTTT GAAAGAGAAT ATAAACAAAC TTCATGCCGT AGCCAACGCC
CTTATGGAAA AGGAAAAACT TGAAGGTTAC GAGTTTGAAG AGTTGTATGC AAATGCATGA
 
Protein sequence
MKYFKNISFY IVLFVMLLAF LVIVQSRPVE KEQKYSQLIS DIHNGKVQEI ILEDNKATVK 
YKEEGQRDQF VYIPDVEVFM NEINDLIREG ELEFRSKVPY SPPWWISILP TLVIIVVFVL
FWVFFLQQSQ GGGSRVMSFG KSRAKMTIDD KRKVTFNDVA GADEEKEELR EIVEFLKNSK
KFLELGARIP KGVLLVGPPG TGKTLLAKAV SGEAGVPFFS ISGSDFVEMF VGVGASRVRD
LFEQAKKNAP CIVFIDEIDA VGRHRGAGLG GGHDEREQTL NQLLVEMDGF GANEGVIILA
ATNRPDILDP ALLRPGRFDR RVVVGLPDIK GREEILKVHA KGKPLAEDVK LDELAKSTPG
FTGADLENLL NEAALLAARA NKKVITMAEI KEATFKVVMG PEKKSRVMSE KEKRLTAYHE
AGHAIAIKEV STTDRVDRIS IIPSGMAGGF TAHKPDEDKN YETKSHLIEK IIVALGGRAA
EEIVLGEVST GAYSDLKQAN GIARSMITKY GMSDTLGNLV FANESDEVFI GRDFVQTKNY
SEEIAAQIDR EVKKIIDSCY ERIKNILKEN INKLHAVANA LMEKEKLEGY EFEELYANA