Gene Cthe_0351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0351 
Symbol 
ID4808500 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp440776 
End bp442167 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content41% 
IMG OID640105765 
ProductPpiC-type peptidyl-prolyl cis-trans isomerase 
Protein accessionYP_001036782 
Protein GI125972872 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0760] Parvulin-like peptidyl-prolyl isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTACATA AAAAGAAAAC AGCTGTTATA ACCGGTGTGG CGGTTTTGGT TGGTATAGTG 
ATTATTGCTT TGGCGGTGGG ATATAATTAT TATTCCAAAA GAAACAAGCC CCAGGAGTCC
AAAGAGTTTG GTTTTCAGGC TCTTAAAATT AACGGCACTT ATGTGAGTAC AGATATTATG
AAAGAAGAAA GGAATAAGTT TTTTGAAAAG TATAAAAGAA ATGCGGATGT TTTGAGAATG
AATGATCACG AACGCAATGA CATGCTTTTG GACCAGGTGA TTGAAAGACT TTTACTGGAA
GACTATGTAA ACAACAAATC CGGTGTAACT GCAACGGACA GTGAAGTGGA GGATTACATA
AACAGGTTTA TTAAACCAAG ATACGGAGAT TCCCTCGGTA CATTTATGAG TTCCCAGGGG
TACACAAACG AAGAGGAGAT GAAAGCCGGT ATCAAGGAGT ACATATTAAA GCACAAAGCC
TTGTACAAAG CTGCAAAAGA GAAAAATGTG ACTTTGACCG AGCAGGAACT GGATGAAGGC
TACGAAAAAC ACAAGATTCA AAACAAGAAA GTTGACATAA GGCATATATT TATCTCCAGC
CAGGAAAGGG GCAAGGAGGA GGCTAAAAAG CTGGCCGACG AGATTTATAA CAGACTTAAG
AACAATGAAG ATTTTGAAAC TCTGGCAAAG CAATATTCCG ATGACGAGAA AACTAAAGAA
TCGGGAGGAG TGATCACGGA ACTTCGAGCA GGTTTCAATG AAGCGGTCTT TGACAATGCG
GTTTTTACGG CAGAGGCCGG GCAGTTGCTG GAGCCGATAG AGGTTGCCAG GGGATATGAG
ATTGTATATG TGGACAAGGT TACGGATTTT TACAGAACCA GAGACGAGTA TGCGGAACTC
CTTACAGTGG ATAAATTTAT GCAGTCCGAT GCATACAAGG AATGGTTCGA GGAATACAAG
AAAAACTATG ATATAGAGAT AACGGATCCG GCAATGAAGG CGTTCAGGCT TTTCAGAGAA
AAGCAGTATA ATGAAGCCGG AGCACTCTAT GAAGAACTTT ACAAGTCGGA AAAAGATGCA
TACTATATTG AAATGGCATG TGAGGCATAC AAACTTGCCG AGAATTGGGC CGGACTTATT
GAAGCTGGCA AATTGGCAGT TAAAGAAAAT CCTGATTTCG TTAATTATTA TTTATACCAG
GCAGAGGGAG AATTTAAGGT CGGAGACGCC AATAAAGCTA AAGAGTTGTT GAAAGAGGCG
GAGAAAAAGG CGGGAGACAA TACATATTTC CTGGATTTGA TTCGAAAAAT GTATGAGAGC
CAGGGCCTTG CCGAAGATGT CGAAAGAATA GACCGGAAAT TTCAGGAAAT ATCTGAGAAA
TTGAAAGGAT AA
 
Protein sequence
MVHKKKTAVI TGVAVLVGIV IIALAVGYNY YSKRNKPQES KEFGFQALKI NGTYVSTDIM 
KEERNKFFEK YKRNADVLRM NDHERNDMLL DQVIERLLLE DYVNNKSGVT ATDSEVEDYI
NRFIKPRYGD SLGTFMSSQG YTNEEEMKAG IKEYILKHKA LYKAAKEKNV TLTEQELDEG
YEKHKIQNKK VDIRHIFISS QERGKEEAKK LADEIYNRLK NNEDFETLAK QYSDDEKTKE
SGGVITELRA GFNEAVFDNA VFTAEAGQLL EPIEVARGYE IVYVDKVTDF YRTRDEYAEL
LTVDKFMQSD AYKEWFEEYK KNYDIEITDP AMKAFRLFRE KQYNEAGALY EELYKSEKDA
YYIEMACEAY KLAENWAGLI EAGKLAVKEN PDFVNYYLYQ AEGEFKVGDA NKAKELLKEA
EKKAGDNTYF LDLIRKMYES QGLAEDVERI DRKFQEISEK LKG