Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0998 |
Symbol | |
ID | 4811292 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1195522 |
End bp | 1196796 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640106416 |
Product | peptidase RseP |
Protein accession | YP_001037423 |
Protein GI | 125973513 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0750] Predicted membrane-associated Zn-dependent proteases 1 |
TIGRFAM ID | [TIGR00054] RIP metalloprotease RseP |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000230134 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGATTTT TATTGGTTAT TCTGGCTTTT GATTTTATCA TTATTATTCA CGAACTGGGA CACTTTATTG TTGCAAAACT GTCAGGCATA AAAGTTGAAG AGTTTTCCCT CTTTGTAGGT CCGAAGCTGT TTTCCGTCAC AATAGGCGAG ACGGCTTATA CGTTGCGTTT GTTTCCGATA CTTGCCTACG TCAAAATGGA AGGTGAGGAA GAAGAGTCGG ACAGCGAGAG AGCCTTTAAC AACAAGCCTG TCTGGGTGCG GGCCGCGGTG GTGGCGGCAG GACCTCTGGC CAATTTGATT TCCGCGTTTC TGATAATATC GGTAGTGTAT TATACGACGG GTTATACCAC AAGAACCGTA GGTTTGGTTC AGAAAGATTC TCCGGCTTAT AATGTAGGTA TCCGGGAAGG AGATGTTATT GTCGGCTACG ACGGAAAAAG AATATATGAC CCGTTGGAAG TTATCCAGTT TTTATATGTA TCAAAAGGGA AAGAAACCAC AATAGAGTTT GTAAGAAACG GAAAAGAGAT AAAAAAAGAT ATTAAACCTA AGGTGGAAAG GACTTACCAG TTGGGATATT ATTCCTCCGC ATCGGGGGAG AACTCCAATG TTATTGGGGA GTTGATTTAT GGCGGTGCTT TGGAAAAAGC GGGCGCCAAA CCCGGAGATA AAATTGTGAA GTTAAATGAT GTTGAGGTTG AAAGCATTGA TGAGATAAAG AATTTTTTAC AGGAAAATAA GAACCAACCG GTTAAGGTGA CTGTTTTGAG AGACGGGAAT GAAATAGTCT TTAATGTTGT ACCGCAATTT GTGGAGAATT ATTCTCTTGG AATATCCTTC TCCCGGGCAA AGGGTGGCAA TATTCTGAAT GTTTTAAAGA ACGGTGCGAT GTTTACCTAC TCCAACATAC GCATGGTGCC TTACAGCCTT TACTGGCTTG TGACGGGCCA GGTATCCATA AATCAGATGA CGGGTCCGGT GGGAATTGTG AGCACCATGA ATGATGTGGC GCAGCAAAGT GATACCTTTA AGGATGCGGT GCTGAACATT CTTCTGTGGA CGGCTTTAAT AAGTGCCGCA ATTGGTGCGA CAAACCTTGT ACCATTCCCG GCGCTTGACG GAAGCAAGCT TCTTATTCTT GCCATTGAGG CGATAAGCAG AAGGAAGATT CCTGTGGAAA AGGAAGCAAT TATTACTTCA ATAGGATTTA TTATTTTAAT AGGTCTTTCA ATATTTGTAA TGGCAAATGA CATAATTAGA TTTATAATCA AATAA
|
Protein sequence | MRFLLVILAF DFIIIIHELG HFIVAKLSGI KVEEFSLFVG PKLFSVTIGE TAYTLRLFPI LAYVKMEGEE EESDSERAFN NKPVWVRAAV VAAGPLANLI SAFLIISVVY YTTGYTTRTV GLVQKDSPAY NVGIREGDVI VGYDGKRIYD PLEVIQFLYV SKGKETTIEF VRNGKEIKKD IKPKVERTYQ LGYYSSASGE NSNVIGELIY GGALEKAGAK PGDKIVKLND VEVESIDEIK NFLQENKNQP VKVTVLRDGN EIVFNVVPQF VENYSLGISF SRAKGGNILN VLKNGAMFTY SNIRMVPYSL YWLVTGQVSI NQMTGPVGIV STMNDVAQQS DTFKDAVLNI LLWTALISAA IGATNLVPFP ALDGSKLLIL AIEAISRRKI PVEKEAIITS IGFIILIGLS IFVMANDIIR FIIK
|
| |