Gene Cthe_0998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0998 
Symbol 
ID4811292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1195522 
End bp1196796 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content42% 
IMG OID640106416 
Productpeptidase RseP 
Protein accessionYP_001037423 
Protein GI125973513 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0750] Predicted membrane-associated Zn-dependent proteases 1 
TIGRFAM ID[TIGR00054] RIP metalloprotease RseP 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000230134 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATTTT TATTGGTTAT TCTGGCTTTT GATTTTATCA TTATTATTCA CGAACTGGGA 
CACTTTATTG TTGCAAAACT GTCAGGCATA AAAGTTGAAG AGTTTTCCCT CTTTGTAGGT
CCGAAGCTGT TTTCCGTCAC AATAGGCGAG ACGGCTTATA CGTTGCGTTT GTTTCCGATA
CTTGCCTACG TCAAAATGGA AGGTGAGGAA GAAGAGTCGG ACAGCGAGAG AGCCTTTAAC
AACAAGCCTG TCTGGGTGCG GGCCGCGGTG GTGGCGGCAG GACCTCTGGC CAATTTGATT
TCCGCGTTTC TGATAATATC GGTAGTGTAT TATACGACGG GTTATACCAC AAGAACCGTA
GGTTTGGTTC AGAAAGATTC TCCGGCTTAT AATGTAGGTA TCCGGGAAGG AGATGTTATT
GTCGGCTACG ACGGAAAAAG AATATATGAC CCGTTGGAAG TTATCCAGTT TTTATATGTA
TCAAAAGGGA AAGAAACCAC AATAGAGTTT GTAAGAAACG GAAAAGAGAT AAAAAAAGAT
ATTAAACCTA AGGTGGAAAG GACTTACCAG TTGGGATATT ATTCCTCCGC ATCGGGGGAG
AACTCCAATG TTATTGGGGA GTTGATTTAT GGCGGTGCTT TGGAAAAAGC GGGCGCCAAA
CCCGGAGATA AAATTGTGAA GTTAAATGAT GTTGAGGTTG AAAGCATTGA TGAGATAAAG
AATTTTTTAC AGGAAAATAA GAACCAACCG GTTAAGGTGA CTGTTTTGAG AGACGGGAAT
GAAATAGTCT TTAATGTTGT ACCGCAATTT GTGGAGAATT ATTCTCTTGG AATATCCTTC
TCCCGGGCAA AGGGTGGCAA TATTCTGAAT GTTTTAAAGA ACGGTGCGAT GTTTACCTAC
TCCAACATAC GCATGGTGCC TTACAGCCTT TACTGGCTTG TGACGGGCCA GGTATCCATA
AATCAGATGA CGGGTCCGGT GGGAATTGTG AGCACCATGA ATGATGTGGC GCAGCAAAGT
GATACCTTTA AGGATGCGGT GCTGAACATT CTTCTGTGGA CGGCTTTAAT AAGTGCCGCA
ATTGGTGCGA CAAACCTTGT ACCATTCCCG GCGCTTGACG GAAGCAAGCT TCTTATTCTT
GCCATTGAGG CGATAAGCAG AAGGAAGATT CCTGTGGAAA AGGAAGCAAT TATTACTTCA
ATAGGATTTA TTATTTTAAT AGGTCTTTCA ATATTTGTAA TGGCAAATGA CATAATTAGA
TTTATAATCA AATAA
 
Protein sequence
MRFLLVILAF DFIIIIHELG HFIVAKLSGI KVEEFSLFVG PKLFSVTIGE TAYTLRLFPI 
LAYVKMEGEE EESDSERAFN NKPVWVRAAV VAAGPLANLI SAFLIISVVY YTTGYTTRTV
GLVQKDSPAY NVGIREGDVI VGYDGKRIYD PLEVIQFLYV SKGKETTIEF VRNGKEIKKD
IKPKVERTYQ LGYYSSASGE NSNVIGELIY GGALEKAGAK PGDKIVKLND VEVESIDEIK
NFLQENKNQP VKVTVLRDGN EIVFNVVPQF VENYSLGISF SRAKGGNILN VLKNGAMFTY
SNIRMVPYSL YWLVTGQVSI NQMTGPVGIV STMNDVAQQS DTFKDAVLNI LLWTALISAA
IGATNLVPFP ALDGSKLLIL AIEAISRRKI PVEKEAIITS IGFIILIGLS IFVMANDIIR
FIIK