Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0555 |
Symbol | |
ID | 4808230 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 680567 |
End bp | 681808 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640105969 |
Product | PpiC-type peptidyl-prolyl cis-trans isomerase |
Protein accession | YP_001036984 |
Protein GI | 125973074 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000887936 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAT CAAAAAGTAT TATATTGGTT ATTAGTATAG TTGTAGTGCT GATTGCGGGT TTGTCAGTGG CAACTTATTT TATACTGAAA CCCTTGTTTG GAGAAAAGGA TGAAAGTAAT ATTTCTCCGA TTACCAAACA GTTGACGGAA GAAGAGGCAA GTAAAGTAAT TGCTGAAGTA AACGGGGAGC AGATACTATA CAAAGATTTC TATTTTATAT ACAGCCAGCA GGCGGCATAT TATGGTCTTA CTACTGAGGA TGAAGATTCA CTGAGTGACG ATACGAAAGA AATATTAAAT ACTATAAAAA AAGAGTTGTT GACTCAGTTG ATTCAGCAAA AACTTGCAAA ACAAAAGGCA AAAGAAGCAG GATATGAAGT AACGAAGGAA AGACTTGATG AGGCTTCGGA AGCAATTGAA GAGATGATTC GTAATATGGC GGAGCAAATG AAACTTAGCA GTCCGTCCGA AGCTGAAAGC AGAGATTTTC TCAAAGAAGC AAGGGACTTC ATTAATAGTG AGCTTAAAGC TATGAGAATA ACGATGGACG AGTATATAAG AGATACTGCC GAATATATGA TTGTAACGGA TTTTATGGAA GACCTTACAA AGGATATTGT TGTAACCGAC GAAGAAATCA AGAAATATTA TGACGAACAG TTGAAGATCC AGCAGGAAAA TCCGGAAGAA GCTGCGTATG CCGAAGTACA ATTGATTCAG CCGGCAAGCT CAAGGGTAAA ACATATATTG ATAGCTTTAC CTGAGGAAGA ACAGCAGGAG TACCAAAACC TGAAAAGTGA GGGAAAGGAT GAGGAAGCAG AGGCATATTT GAAGGAAAAG CTCGAAGCAA TAAAGCCAAA GGCTGAAGAA GTACTGAACA AGGCAAAAAA CGGAGAAGAC TTTGAGGCTC TTATAAAAGA ATACGGTGAA GATCCCGGAA TGGAAAGCGA ACAGTACAAG GACGGATACA CCGTTACTAA AAACAGCGGA TTTATAAAGA GTTTTGAAGA TGCTTCCCTG GCTCTTGGAG TAGGCGAGAT ATCGGATCTT GTTGAAGGTC CTTACGGATA TCATATAATA AAAGTGTATG AGAAGACGGA AGCAAAACCG TATACTCAGG AAGAGAAAAA ATCTGAGATT GAAAGTCTTT TAAAGAGTCA AAAGAAAACG AATTTCATGA ATGAAAAAAT GAAAGAGTGG GAAAGTGCTT CTACAATAGT AAGGCATGAT GATTTGCTGT AA
|
Protein sequence | MKKSKSIILV ISIVVVLIAG LSVATYFILK PLFGEKDESN ISPITKQLTE EEASKVIAEV NGEQILYKDF YFIYSQQAAY YGLTTEDEDS LSDDTKEILN TIKKELLTQL IQQKLAKQKA KEAGYEVTKE RLDEASEAIE EMIRNMAEQM KLSSPSEAES RDFLKEARDF INSELKAMRI TMDEYIRDTA EYMIVTDFME DLTKDIVVTD EEIKKYYDEQ LKIQQENPEE AAYAEVQLIQ PASSRVKHIL IALPEEEQQE YQNLKSEGKD EEAEAYLKEK LEAIKPKAEE VLNKAKNGED FEALIKEYGE DPGMESEQYK DGYTVTKNSG FIKSFEDASL ALGVGEISDL VEGPYGYHII KVYEKTEAKP YTQEEKKSEI ESLLKSQKKT NFMNEKMKEW ESASTIVRHD DLL
|
| |