Gene Cthe_0555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0555 
Symbol 
ID4808230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp680567 
End bp681808 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content38% 
IMG OID640105969 
ProductPpiC-type peptidyl-prolyl cis-trans isomerase 
Protein accessionYP_001036984 
Protein GI125973074 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000887936 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAT CAAAAAGTAT TATATTGGTT ATTAGTATAG TTGTAGTGCT GATTGCGGGT 
TTGTCAGTGG CAACTTATTT TATACTGAAA CCCTTGTTTG GAGAAAAGGA TGAAAGTAAT
ATTTCTCCGA TTACCAAACA GTTGACGGAA GAAGAGGCAA GTAAAGTAAT TGCTGAAGTA
AACGGGGAGC AGATACTATA CAAAGATTTC TATTTTATAT ACAGCCAGCA GGCGGCATAT
TATGGTCTTA CTACTGAGGA TGAAGATTCA CTGAGTGACG ATACGAAAGA AATATTAAAT
ACTATAAAAA AAGAGTTGTT GACTCAGTTG ATTCAGCAAA AACTTGCAAA ACAAAAGGCA
AAAGAAGCAG GATATGAAGT AACGAAGGAA AGACTTGATG AGGCTTCGGA AGCAATTGAA
GAGATGATTC GTAATATGGC GGAGCAAATG AAACTTAGCA GTCCGTCCGA AGCTGAAAGC
AGAGATTTTC TCAAAGAAGC AAGGGACTTC ATTAATAGTG AGCTTAAAGC TATGAGAATA
ACGATGGACG AGTATATAAG AGATACTGCC GAATATATGA TTGTAACGGA TTTTATGGAA
GACCTTACAA AGGATATTGT TGTAACCGAC GAAGAAATCA AGAAATATTA TGACGAACAG
TTGAAGATCC AGCAGGAAAA TCCGGAAGAA GCTGCGTATG CCGAAGTACA ATTGATTCAG
CCGGCAAGCT CAAGGGTAAA ACATATATTG ATAGCTTTAC CTGAGGAAGA ACAGCAGGAG
TACCAAAACC TGAAAAGTGA GGGAAAGGAT GAGGAAGCAG AGGCATATTT GAAGGAAAAG
CTCGAAGCAA TAAAGCCAAA GGCTGAAGAA GTACTGAACA AGGCAAAAAA CGGAGAAGAC
TTTGAGGCTC TTATAAAAGA ATACGGTGAA GATCCCGGAA TGGAAAGCGA ACAGTACAAG
GACGGATACA CCGTTACTAA AAACAGCGGA TTTATAAAGA GTTTTGAAGA TGCTTCCCTG
GCTCTTGGAG TAGGCGAGAT ATCGGATCTT GTTGAAGGTC CTTACGGATA TCATATAATA
AAAGTGTATG AGAAGACGGA AGCAAAACCG TATACTCAGG AAGAGAAAAA ATCTGAGATT
GAAAGTCTTT TAAAGAGTCA AAAGAAAACG AATTTCATGA ATGAAAAAAT GAAAGAGTGG
GAAAGTGCTT CTACAATAGT AAGGCATGAT GATTTGCTGT AA
 
Protein sequence
MKKSKSIILV ISIVVVLIAG LSVATYFILK PLFGEKDESN ISPITKQLTE EEASKVIAEV 
NGEQILYKDF YFIYSQQAAY YGLTTEDEDS LSDDTKEILN TIKKELLTQL IQQKLAKQKA
KEAGYEVTKE RLDEASEAIE EMIRNMAEQM KLSSPSEAES RDFLKEARDF INSELKAMRI
TMDEYIRDTA EYMIVTDFME DLTKDIVVTD EEIKKYYDEQ LKIQQENPEE AAYAEVQLIQ
PASSRVKHIL IALPEEEQQE YQNLKSEGKD EEAEAYLKEK LEAIKPKAEE VLNKAKNGED
FEALIKEYGE DPGMESEQYK DGYTVTKNSG FIKSFEDASL ALGVGEISDL VEGPYGYHII
KVYEKTEAKP YTQEEKKSEI ESLLKSQKKT NFMNEKMKEW ESASTIVRHD DLL