Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0436 |
Symbol | |
ID | 4808364 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 548775 |
End bp | 551945 |
Gene Length | 3171 bp |
Protein Length | 1056 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640105850 |
Product | tetratricopeptide TPR_2 |
Protein accession | YP_001036867 |
Protein GI | 125972957 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3063] Tfp pilus assembly protein PilF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGGAATTT TCAATATATA TACACTGTTC GGAGACAGAA AAAGCAAGGC TGAAAAGTTG TCCAAAAAAG GAGACGAATT GTTTCTGAAC AATAAGTATG CCGAATCTGT GAACTATTAC AAAAAGGCAA TAAAAACATA TGCCAAATAT TTTGAAGCTT ATGTAAACTT AGGTTATACA TTGATAATAT TGGGAAAATA TGAAGAGTGT ATAAGGTATT GCAACAGGGC ATTGGCTCTT AATCCTCAGG ATGCTTCGGA GTTGTATTTT ATAAAGGCTG AATGCTTCAA AAAAATGAAG AGATATCGTG AAGCTCTGGA GAATTATATA AAGGCGGTTG AAATAAGGAA GAGGGTTTTC TATTTGATTC CACTGGCAAT TCTTCTTTAT GATATGGAGG AATATGACAA GGCTCTGGAG ATATTTGACA CTCTTGAAGC ATTAGACCTT AAATATAATG ATGATTTGGA AAGTATATTC CTTTACAAAG GTAAGATAAT GGAAAAGAAA GGCCGTTTCA AAGAAGCTAT AGATTACTTT GACAAGGCTC TTGAGGTTAA TCCTGCAAAT GCGGAGATAT ATGATAAAAA AGCTTCTTCT TTGTATTATC TGGGCAGAGA TACGGATGAC ATGGATCTTA TAAAGGAATC AATAATATAT TATCGAAAAG CGCTGGAGAT AGACGGTGAA TATTTGCACT CATTAAACGG AATTGCAGTT TCTCTTGAGG TGTTGGGAAA TGCCGATGAA GCTTTGATTT ATTATGATAA AGCACTCGAA GTTTATCCTG ATTTTGTACT TGTCCATTAC AACAAGGCAA ATTTGTTGAT GAATTTAAGC AGAAATGAAG AGGCTTTATA TCATTATGAC AAGGCAATAC AGATAGACCG GTATTGTGTT GATGCCTACA TCGAAAAAGC GGAATTGCTT TGCAAGATGG AAAAGTACGC TGACGCTTTG AAAGTGTTGG ATAATATTTT GAATATTGTA GAAGCTTCCG ATATCAGAGA CAGAAATGAG AAAATATGCA CATTGTTAAA GTGCAAGGGT GAAGCGTTTC ATATCATGGG TAAATTTAAC GAAGCTATTG AGTGCTATGA CAAAGCTCTT GCAGTTGATA AAGACAGAGC GGATGTTCTT GTGAAAAAGG GGGAAGCTTA TAATCGTTTG GGAATGCCTC AAGAGGCAAT TCTTATGTAC GAAAAAGCAC TCGGGGTGAG AAATGACTAT TATATAGCCT ATTTTTTAAT GGGAGTTACA TACAAGCATT TAGATGAGTA CCAATTGGCA CTTGAAGCTT TTGATTGTTA TATAAATGCT GTGCCTAAAG TACCTGAAGC TTATGTGGAG AGGGCTGAAG TACTGCAATT TATGCAAAGG TATGAGGAGG CAAAGGAAGA TTGCGACCAA GCCCTTGTGT TGAGGCCACA GTTTGGAAGT GCATGTTACA GGAAGAGCCT TATTTTATGT GAACTTGGCA AATATGATGA GGCAATAGAA ATTCTCGAAA AACTGCTCGA TGATGAAGAG TTTTGTGATA TTGCAGGATA TTTCAAGGGT GTTGCGCTGA AAAATCTGGG AAGGTATGAA GAAGCTTTGG AATATGTGGA TGGATATATA ACAAAATATC CCGGATACAG AGAACCCTAT CTTGAAAAAG CTGATATTTT GATTGCTCTT GAAGAATACG AAAAAGCCAT GGAGGCCTGT AACGTTCTGC TTGACAGGGA TGCTGAAGAT ATCGGTGCTT TGGTAAAAAA GAGCGGTGTG TTTTTCAGAC AGGATAAATT CGAAGAGGCT CTTAAATGTA TTGAAGATGC CATGGCTTTA TCTTTGGACC ATCATGCTTT GTACTACTAC AAAGCAGAAA TACTGAGGAA TATGGGGAAA CCTGAGGAGG CTATAGAGTT TTTTGACAAA TATATTGAGA AAGTTCCCAA CCACCCCAAT CCTTATATTG GCCGTGCGAA GTCGTTATAT GTAATGCAGG AATATGAGAA AGCCCTGGAA TGCTGTGAAA AGGCAATAAG TCTTGATGAC AAATATATTG AAGGTTATTA TTCAAAAGCG CACATATTGC TGCAGATGGA CAAATATGAG GATGTCCTGG AACTGTTGGA TAAAATAAAG GAAATTGATC CGGAGTTTCC TATGTTTTAT TATGACCGGG CTGAAGTTTT CAAAAGAATG GGAAATCACG AAAAAGCGCT TCAGGAAATC GATATTTATC TTGAGAAATT TCCGGACGAC GGCTATGCCC ATGAAAAAAG GGCCAATATC CTGTTTACTT TGGGAAGACT TGACGAGGCC ATCGAGGAAT GCGACAAGGC CATCGAGTTT GAACCCGAGC TGTTAGATGC TTACTACGGG AAGGGATACA TACTTTATTA TACAGGACGG TTTAAAGAGT CCTTAAGCTA TTTTGACAAG GTAATTGAGT TAAATTCCAA AAGTGCTTAT GCTTATTACA GCAAGGGAAA TGCCCTTAAA TATTTGGGAG ACTTTGAAGG CGCTTTGGAA AATTACAACT ATGCCATAAA TTTGTGGCAT GAATTTGCTG AGTGTTATTC GGCCATAGGT CATCTTTATT TCCTGGTGGG TAATTATACA AACAGTATGA TTTTCTACGA CAGGGCTGAG AGTCTAAAAC CGGATTATAT TTATCCATAT ATAGGAAAAT CCCAGCTGTA TATGACGCTG GGCGACATGG AAAGTGCCAT AAGGTATAGT GACAAAGCTT TGGAAATATC TCCTGATGAT GCGGAGGTAC ACAATAACAA GGGTAAGATT CTGGGGTATT TTGGAATGTT TGATGAAGCA GTCAGCTCTT TTCTGACTGC AATTGAACTA AATGACAGTC AGGCGGAATA TTATTATAAT CTGGGAAATG CCTATCTTAT GATAAATGAG TTTGAAAATG CGATAGAAAG CTATAACAAG GCTATAAATT TGTATCCGGA GTATGAAGCC GCCTACGTTG GAATCGGCAA GGCGCAGATG TGCCTTGAAA ATATTGAAGA AGCACTGAAG AATTTTAACA AAGCCATAGA ATTGAATCCG CGTTCTGCCG AAGCATATTA TTCAAAATCC GAGGCTTTAA GAATACTGGA CGAGGAAGAA GAAGCGCAGG AGTGCTATGA AAAGGCTTTG GAGCTTGGGT ATAATGCATA G
|
Protein sequence | MGIFNIYTLF GDRKSKAEKL SKKGDELFLN NKYAESVNYY KKAIKTYAKY FEAYVNLGYT LIILGKYEEC IRYCNRALAL NPQDASELYF IKAECFKKMK RYREALENYI KAVEIRKRVF YLIPLAILLY DMEEYDKALE IFDTLEALDL KYNDDLESIF LYKGKIMEKK GRFKEAIDYF DKALEVNPAN AEIYDKKASS LYYLGRDTDD MDLIKESIIY YRKALEIDGE YLHSLNGIAV SLEVLGNADE ALIYYDKALE VYPDFVLVHY NKANLLMNLS RNEEALYHYD KAIQIDRYCV DAYIEKAELL CKMEKYADAL KVLDNILNIV EASDIRDRNE KICTLLKCKG EAFHIMGKFN EAIECYDKAL AVDKDRADVL VKKGEAYNRL GMPQEAILMY EKALGVRNDY YIAYFLMGVT YKHLDEYQLA LEAFDCYINA VPKVPEAYVE RAEVLQFMQR YEEAKEDCDQ ALVLRPQFGS ACYRKSLILC ELGKYDEAIE ILEKLLDDEE FCDIAGYFKG VALKNLGRYE EALEYVDGYI TKYPGYREPY LEKADILIAL EEYEKAMEAC NVLLDRDAED IGALVKKSGV FFRQDKFEEA LKCIEDAMAL SLDHHALYYY KAEILRNMGK PEEAIEFFDK YIEKVPNHPN PYIGRAKSLY VMQEYEKALE CCEKAISLDD KYIEGYYSKA HILLQMDKYE DVLELLDKIK EIDPEFPMFY YDRAEVFKRM GNHEKALQEI DIYLEKFPDD GYAHEKRANI LFTLGRLDEA IEECDKAIEF EPELLDAYYG KGYILYYTGR FKESLSYFDK VIELNSKSAY AYYSKGNALK YLGDFEGALE NYNYAINLWH EFAECYSAIG HLYFLVGNYT NSMIFYDRAE SLKPDYIYPY IGKSQLYMTL GDMESAIRYS DKALEISPDD AEVHNNKGKI LGYFGMFDEA VSSFLTAIEL NDSQAEYYYN LGNAYLMINE FENAIESYNK AINLYPEYEA AYVGIGKAQM CLENIEEALK NFNKAIELNP RSAEAYYSKS EALRILDEEE EAQECYEKAL ELGYNA
|
| |