Gene Cthe_0436 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0436 
Symbol 
ID4808364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp548775 
End bp551945 
Gene Length3171 bp 
Protein Length1056 aa 
Translation table11 
GC content38% 
IMG OID640105850 
Producttetratricopeptide TPR_2 
Protein accessionYP_001036867 
Protein GI125972957 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGGAATTT TCAATATATA TACACTGTTC GGAGACAGAA AAAGCAAGGC TGAAAAGTTG 
TCCAAAAAAG GAGACGAATT GTTTCTGAAC AATAAGTATG CCGAATCTGT GAACTATTAC
AAAAAGGCAA TAAAAACATA TGCCAAATAT TTTGAAGCTT ATGTAAACTT AGGTTATACA
TTGATAATAT TGGGAAAATA TGAAGAGTGT ATAAGGTATT GCAACAGGGC ATTGGCTCTT
AATCCTCAGG ATGCTTCGGA GTTGTATTTT ATAAAGGCTG AATGCTTCAA AAAAATGAAG
AGATATCGTG AAGCTCTGGA GAATTATATA AAGGCGGTTG AAATAAGGAA GAGGGTTTTC
TATTTGATTC CACTGGCAAT TCTTCTTTAT GATATGGAGG AATATGACAA GGCTCTGGAG
ATATTTGACA CTCTTGAAGC ATTAGACCTT AAATATAATG ATGATTTGGA AAGTATATTC
CTTTACAAAG GTAAGATAAT GGAAAAGAAA GGCCGTTTCA AAGAAGCTAT AGATTACTTT
GACAAGGCTC TTGAGGTTAA TCCTGCAAAT GCGGAGATAT ATGATAAAAA AGCTTCTTCT
TTGTATTATC TGGGCAGAGA TACGGATGAC ATGGATCTTA TAAAGGAATC AATAATATAT
TATCGAAAAG CGCTGGAGAT AGACGGTGAA TATTTGCACT CATTAAACGG AATTGCAGTT
TCTCTTGAGG TGTTGGGAAA TGCCGATGAA GCTTTGATTT ATTATGATAA AGCACTCGAA
GTTTATCCTG ATTTTGTACT TGTCCATTAC AACAAGGCAA ATTTGTTGAT GAATTTAAGC
AGAAATGAAG AGGCTTTATA TCATTATGAC AAGGCAATAC AGATAGACCG GTATTGTGTT
GATGCCTACA TCGAAAAAGC GGAATTGCTT TGCAAGATGG AAAAGTACGC TGACGCTTTG
AAAGTGTTGG ATAATATTTT GAATATTGTA GAAGCTTCCG ATATCAGAGA CAGAAATGAG
AAAATATGCA CATTGTTAAA GTGCAAGGGT GAAGCGTTTC ATATCATGGG TAAATTTAAC
GAAGCTATTG AGTGCTATGA CAAAGCTCTT GCAGTTGATA AAGACAGAGC GGATGTTCTT
GTGAAAAAGG GGGAAGCTTA TAATCGTTTG GGAATGCCTC AAGAGGCAAT TCTTATGTAC
GAAAAAGCAC TCGGGGTGAG AAATGACTAT TATATAGCCT ATTTTTTAAT GGGAGTTACA
TACAAGCATT TAGATGAGTA CCAATTGGCA CTTGAAGCTT TTGATTGTTA TATAAATGCT
GTGCCTAAAG TACCTGAAGC TTATGTGGAG AGGGCTGAAG TACTGCAATT TATGCAAAGG
TATGAGGAGG CAAAGGAAGA TTGCGACCAA GCCCTTGTGT TGAGGCCACA GTTTGGAAGT
GCATGTTACA GGAAGAGCCT TATTTTATGT GAACTTGGCA AATATGATGA GGCAATAGAA
ATTCTCGAAA AACTGCTCGA TGATGAAGAG TTTTGTGATA TTGCAGGATA TTTCAAGGGT
GTTGCGCTGA AAAATCTGGG AAGGTATGAA GAAGCTTTGG AATATGTGGA TGGATATATA
ACAAAATATC CCGGATACAG AGAACCCTAT CTTGAAAAAG CTGATATTTT GATTGCTCTT
GAAGAATACG AAAAAGCCAT GGAGGCCTGT AACGTTCTGC TTGACAGGGA TGCTGAAGAT
ATCGGTGCTT TGGTAAAAAA GAGCGGTGTG TTTTTCAGAC AGGATAAATT CGAAGAGGCT
CTTAAATGTA TTGAAGATGC CATGGCTTTA TCTTTGGACC ATCATGCTTT GTACTACTAC
AAAGCAGAAA TACTGAGGAA TATGGGGAAA CCTGAGGAGG CTATAGAGTT TTTTGACAAA
TATATTGAGA AAGTTCCCAA CCACCCCAAT CCTTATATTG GCCGTGCGAA GTCGTTATAT
GTAATGCAGG AATATGAGAA AGCCCTGGAA TGCTGTGAAA AGGCAATAAG TCTTGATGAC
AAATATATTG AAGGTTATTA TTCAAAAGCG CACATATTGC TGCAGATGGA CAAATATGAG
GATGTCCTGG AACTGTTGGA TAAAATAAAG GAAATTGATC CGGAGTTTCC TATGTTTTAT
TATGACCGGG CTGAAGTTTT CAAAAGAATG GGAAATCACG AAAAAGCGCT TCAGGAAATC
GATATTTATC TTGAGAAATT TCCGGACGAC GGCTATGCCC ATGAAAAAAG GGCCAATATC
CTGTTTACTT TGGGAAGACT TGACGAGGCC ATCGAGGAAT GCGACAAGGC CATCGAGTTT
GAACCCGAGC TGTTAGATGC TTACTACGGG AAGGGATACA TACTTTATTA TACAGGACGG
TTTAAAGAGT CCTTAAGCTA TTTTGACAAG GTAATTGAGT TAAATTCCAA AAGTGCTTAT
GCTTATTACA GCAAGGGAAA TGCCCTTAAA TATTTGGGAG ACTTTGAAGG CGCTTTGGAA
AATTACAACT ATGCCATAAA TTTGTGGCAT GAATTTGCTG AGTGTTATTC GGCCATAGGT
CATCTTTATT TCCTGGTGGG TAATTATACA AACAGTATGA TTTTCTACGA CAGGGCTGAG
AGTCTAAAAC CGGATTATAT TTATCCATAT ATAGGAAAAT CCCAGCTGTA TATGACGCTG
GGCGACATGG AAAGTGCCAT AAGGTATAGT GACAAAGCTT TGGAAATATC TCCTGATGAT
GCGGAGGTAC ACAATAACAA GGGTAAGATT CTGGGGTATT TTGGAATGTT TGATGAAGCA
GTCAGCTCTT TTCTGACTGC AATTGAACTA AATGACAGTC AGGCGGAATA TTATTATAAT
CTGGGAAATG CCTATCTTAT GATAAATGAG TTTGAAAATG CGATAGAAAG CTATAACAAG
GCTATAAATT TGTATCCGGA GTATGAAGCC GCCTACGTTG GAATCGGCAA GGCGCAGATG
TGCCTTGAAA ATATTGAAGA AGCACTGAAG AATTTTAACA AAGCCATAGA ATTGAATCCG
CGTTCTGCCG AAGCATATTA TTCAAAATCC GAGGCTTTAA GAATACTGGA CGAGGAAGAA
GAAGCGCAGG AGTGCTATGA AAAGGCTTTG GAGCTTGGGT ATAATGCATA G
 
Protein sequence
MGIFNIYTLF GDRKSKAEKL SKKGDELFLN NKYAESVNYY KKAIKTYAKY FEAYVNLGYT 
LIILGKYEEC IRYCNRALAL NPQDASELYF IKAECFKKMK RYREALENYI KAVEIRKRVF
YLIPLAILLY DMEEYDKALE IFDTLEALDL KYNDDLESIF LYKGKIMEKK GRFKEAIDYF
DKALEVNPAN AEIYDKKASS LYYLGRDTDD MDLIKESIIY YRKALEIDGE YLHSLNGIAV
SLEVLGNADE ALIYYDKALE VYPDFVLVHY NKANLLMNLS RNEEALYHYD KAIQIDRYCV
DAYIEKAELL CKMEKYADAL KVLDNILNIV EASDIRDRNE KICTLLKCKG EAFHIMGKFN
EAIECYDKAL AVDKDRADVL VKKGEAYNRL GMPQEAILMY EKALGVRNDY YIAYFLMGVT
YKHLDEYQLA LEAFDCYINA VPKVPEAYVE RAEVLQFMQR YEEAKEDCDQ ALVLRPQFGS
ACYRKSLILC ELGKYDEAIE ILEKLLDDEE FCDIAGYFKG VALKNLGRYE EALEYVDGYI
TKYPGYREPY LEKADILIAL EEYEKAMEAC NVLLDRDAED IGALVKKSGV FFRQDKFEEA
LKCIEDAMAL SLDHHALYYY KAEILRNMGK PEEAIEFFDK YIEKVPNHPN PYIGRAKSLY
VMQEYEKALE CCEKAISLDD KYIEGYYSKA HILLQMDKYE DVLELLDKIK EIDPEFPMFY
YDRAEVFKRM GNHEKALQEI DIYLEKFPDD GYAHEKRANI LFTLGRLDEA IEECDKAIEF
EPELLDAYYG KGYILYYTGR FKESLSYFDK VIELNSKSAY AYYSKGNALK YLGDFEGALE
NYNYAINLWH EFAECYSAIG HLYFLVGNYT NSMIFYDRAE SLKPDYIYPY IGKSQLYMTL
GDMESAIRYS DKALEISPDD AEVHNNKGKI LGYFGMFDEA VSSFLTAIEL NDSQAEYYYN
LGNAYLMINE FENAIESYNK AINLYPEYEA AYVGIGKAQM CLENIEEALK NFNKAIELNP
RSAEAYYSKS EALRILDEEE EAQECYEKAL ELGYNA