Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3049 |
Symbol | |
ID | 4811121 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 3575203 |
End bp | 3576978 |
Gene Length | 1776 bp |
Protein Length | 591 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640108470 |
Product | TPR repeat-containing protein |
Protein accession | YP_001039438 |
Protein GI | 125975528 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000157665 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACCA CATTAAAATC AATTATCATA ATTTTTATTA TCCTATTCTC CCTTTCTTTA CTGACTTCCA TATCCTATGC TGACTCAGCC ATGCAATATT TTAGCGAAGG AAACTCTTTA TTCGAAGCCG GAAAAATTGA GGAAGCAATA CAAAGTTACA ACAAAGCTAT TGAGCTCAAT CCCAACCTTG CCGAAATTCA CTATAATAAA GGGGTTGCTC TGTTCAATCT GAAAAAATAC AATGAAGCAA TTGAATCCTA CAACCGTTCA ATTGAATTAG CGCCCAACTT TAAGGAAGCT TATCTAAACA AATCAATATG TCTGCTGGTT GTCAGCAAAT TTGAAGAAGC TCTCGAAACC GTCAACAAAT TCATTGAGAT GTCCCCCAAC GAACCTAACG GATACACAGT AAAAGGTTCC ATCCTGATAA TGATTGAAAA GTATGAAGAA GCTCTGGAAG TATCCAATAA AGTAATTGCC ATGAATCCTA ATAACCAATC TGTGCTTTCC GTCGCATATT CCAATAAAGG TTATGCTTTA GTGTGGCTCA AAAAACCCAA AGAAGCCTTG GAAGCATGTA ACAAATCTCT TGAGCTTTCC GGCGACAATC TTGACGCACA CATAGCCATA AGCTTGGCAC ATTCTTCTTT AGGTAACTAT GAAGAAGCGG TTAATTGGTG TGACAAAGCC ATTAAAATCG ATCCCAATGC AGTTGAGCCA TACATTAATA AATCCAACTA TCTTGTAAAT TTAGGAAAAT CCAAAGAAGC GCTTGAATGC TGTAGCAAAG CATCAGAATT AAACGTTCCT CAAAATCCTG TTTATGAATC AATCATTCTC ACGAATAAAT CAGCGGCGCT AATTTTTGAA AATAATTACG AAGAAGCCCT TGCGGCAGCT GAAAAAGCCA TAGAGTTAGA TCCTAAAAAT GCTCTTGCGT ATGTCAACAA AGCCAATGCA CTAAACATGG CGGGTAGTTA TGATGAAGCT CTTTCATTCA GCGACAAGGC AATCGAAATT GATCCAGATT GTGGTGAAGC CTATGGCGCA AAAGGAAGTG CACTATTCTA TCTTGGCAGA TTTGATGAAT CAATAGAGAC ATGCAAAAAG GCCATTGAAT TAAGTCCTGA GAACATTATC CTTTGTGTTC AGGCATATAC CAATATAGGA AGTTCTTTGT CTGAAAAAGG AATGTACGAA GAAGCTCTGA AAAATCTTGA TAAAGCTCTT GAGTTGCCGT CAAAAAATGC TAAAGCCATC TCTATTGCCT ACTCAAACAA AGCCTATGCA CTGATTGGTC TTGAAAAATT TGAAGACGCT CTGGAATGCG CAAACAAGGC TATTGAAGCT GACCCCAGCA ATGTCATGGG ATATTCCAAC AAAAGCTCCG TTTTGATGAG ACTTTCCAGA TACAAGGAAG CTCTTGAATG CTGTGATGAG GCTATCAAGC TAAATATAGC CGATTATGCC GTCTATAATA ACAAAGGACT CGCTTTGGAA AGTCAGGGCA AACTTGGCAA AGCATTGGAA GCGTTCAACA AAAGTCTTGA ACTCAATCCT GACTATAAAA ATGCCCAAGA CAACATCCAA AGAGTCTCAA CAAAACTTAC TATAAGAAGA GTCTTACTCA TTTTTGGTAT ATTTGCATTT GTTACGGCAA TAACTATAGC CACCGTAATT TTCATTGTTT TAAGAAAAAG AAAAACCACA GTACAAAGCA TCAATCCGCC CATGCCCGAA CCGTTTCTCA ACTACGATGA ACTGAATCAA AAATAA
|
Protein sequence | MKTTLKSIII IFIILFSLSL LTSISYADSA MQYFSEGNSL FEAGKIEEAI QSYNKAIELN PNLAEIHYNK GVALFNLKKY NEAIESYNRS IELAPNFKEA YLNKSICLLV VSKFEEALET VNKFIEMSPN EPNGYTVKGS ILIMIEKYEE ALEVSNKVIA MNPNNQSVLS VAYSNKGYAL VWLKKPKEAL EACNKSLELS GDNLDAHIAI SLAHSSLGNY EEAVNWCDKA IKIDPNAVEP YINKSNYLVN LGKSKEALEC CSKASELNVP QNPVYESIIL TNKSAALIFE NNYEEALAAA EKAIELDPKN ALAYVNKANA LNMAGSYDEA LSFSDKAIEI DPDCGEAYGA KGSALFYLGR FDESIETCKK AIELSPENII LCVQAYTNIG SSLSEKGMYE EALKNLDKAL ELPSKNAKAI SIAYSNKAYA LIGLEKFEDA LECANKAIEA DPSNVMGYSN KSSVLMRLSR YKEALECCDE AIKLNIADYA VYNNKGLALE SQGKLGKALE AFNKSLELNP DYKNAQDNIQ RVSTKLTIRR VLLIFGIFAF VTAITIATVI FIVLRKRKTT VQSINPPMPE PFLNYDELNQ K
|
| |