Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2253 |
Symbol | |
ID | 4809991 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2680508 |
End bp | 2682307 |
Gene Length | 1800 bp |
Protein Length | 599 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640107659 |
Product | ATP-dependent metalloprotease FtsH |
Protein accession | YP_001038648 |
Protein GI | 125974738 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0465] ATP-dependent Zn proteases |
TIGRFAM ID | [TIGR01241] ATP-dependent metalloprotease FtsH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000844037 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAATATT TTAAAAATAT AAGCTTTTAT ATAGTGCTGT TTGTGATGTT GCTGGCTTTT CTGGTCATAG TGCAAAGCCG GCCTGTTGAA AAGGAACAAA AGTATTCCCA GCTTATAAGC GATATTCACA ACGGTAAGGT TCAGGAAATT ATACTGGAAG ACAATAAAGC CACGGTAAAA TATAAAGAAG AGGGACAGAG GGACCAGTTT GTTTATATAC CCGATGTTGA AGTGTTCATG AATGAGATAA ACGACCTTAT CAGAGAAGGA GAGCTTGAGT TTCGAAGCAA GGTTCCTTAT TCGCCGCCGT GGTGGATTTC TATATTGCCT ACTTTGGTAA TTATAGTTGT GTTTGTGCTG TTCTGGGTGT TCTTCCTCCA GCAGTCTCAG GGCGGCGGAA GCAGAGTAAT GTCTTTTGGA AAAAGCAGAG CCAAAATGAC AATAGACGAT AAGAGAAAAG TGACTTTCAA TGACGTCGCA GGAGCGGATG AGGAGAAAGA GGAGCTAAGG GAAATAGTCG AATTTCTGAA AAATTCCAAA AAGTTTTTGG AGTTGGGAGC GAGGATACCA AAAGGAGTGC TTTTGGTGGG ACCTCCGGGT ACAGGTAAAA CCTTGCTGGC AAAAGCCGTG TCCGGAGAAG CCGGAGTTCC GTTCTTCAGC ATAAGCGGAT CGGACTTTGT GGAAATGTTT GTCGGTGTAG GTGCTTCGAG GGTTAGAGAC CTTTTTGAAC AGGCAAAGAA GAATGCCCCC TGTATTGTTT TCATAGATGA AATTGATGCT GTCGGAAGAC ATAGAGGAGC AGGTCTCGGC GGCGGCCATG ACGAGAGAGA ACAGACGCTT AATCAGCTGT TGGTTGAAAT GGACGGTTTC GGAGCAAATG AAGGGGTAAT AATTCTGGCT GCAACAAACA GACCGGATAT TTTGGACCCT GCGCTTTTAA GACCGGGAAG ATTTGATAGA AGAGTAGTTG TGGGGCTTCC GGACATAAAA GGAAGAGAGG AAATACTGAA AGTTCATGCC AAAGGCAAGC CCCTGGCGGA AGATGTCAAA TTGGATGAAC TTGCAAAAAG TACCCCCGGA TTTACCGGAG CGGACCTTGA AAACCTTCTT AATGAAGCAG CTTTGCTTGC TGCCAGAGCC AACAAGAAAG TGATAACAAT GGCTGAAATA AAAGAAGCGA CATTCAAGGT GGTCATGGGA CCGGAGAAAA AGAGCAGGGT AATGAGCGAG AAGGAAAAAA GGCTTACTGC CTATCATGAA GCCGGTCATG CCATTGCAAT AAAAGAAGTT TCCACTACCG ACAGGGTTGA CAGGATATCC ATAATTCCGT CGGGTATGGC AGGCGGTTTT ACCGCACACA AACCAGATGA GGATAAAAAT TATGAGACCA AGTCCCATCT TATTGAAAAG ATTATCGTTG CATTGGGAGG AAGAGCGGCA GAGGAGATTG TGCTGGGTGA AGTGAGCACC GGAGCATACT CCGATCTGAA GCAGGCAAAC GGAATTGCCA GAAGCATGAT TACCAAATAT GGTATGAGTG ATACCCTTGG CAATCTCGTA TTTGCAAATG AAAGTGACGA GGTATTTATC GGTAGGGATT TTGTCCAGAC AAAAAACTAC AGTGAGGAGA TAGCTGCTCA AATTGACAGA GAAGTAAAGA AGATAATAGA TTCATGTTAT GAAAGAATAA AAAATATTTT GAAAGAGAAT ATAAACAAAC TTCATGCCGT AGCCAACGCC CTTATGGAAA AGGAAAAACT TGAAGGTTAC GAGTTTGAAG AGTTGTATGC AAATGCATGA
|
Protein sequence | MKYFKNISFY IVLFVMLLAF LVIVQSRPVE KEQKYSQLIS DIHNGKVQEI ILEDNKATVK YKEEGQRDQF VYIPDVEVFM NEINDLIREG ELEFRSKVPY SPPWWISILP TLVIIVVFVL FWVFFLQQSQ GGGSRVMSFG KSRAKMTIDD KRKVTFNDVA GADEEKEELR EIVEFLKNSK KFLELGARIP KGVLLVGPPG TGKTLLAKAV SGEAGVPFFS ISGSDFVEMF VGVGASRVRD LFEQAKKNAP CIVFIDEIDA VGRHRGAGLG GGHDEREQTL NQLLVEMDGF GANEGVIILA ATNRPDILDP ALLRPGRFDR RVVVGLPDIK GREEILKVHA KGKPLAEDVK LDELAKSTPG FTGADLENLL NEAALLAARA NKKVITMAEI KEATFKVVMG PEKKSRVMSE KEKRLTAYHE AGHAIAIKEV STTDRVDRIS IIPSGMAGGF TAHKPDEDKN YETKSHLIEK IIVALGGRAA EEIVLGEVST GAYSDLKQAN GIARSMITKY GMSDTLGNLV FANESDEVFI GRDFVQTKNY SEEIAAQIDR EVKKIIDSCY ERIKNILKEN INKLHAVANA LMEKEKLEGY EFEELYANA
|
| |