Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0581 |
Symbol | |
ID | 4808256 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 710639 |
End bp | 712405 |
Gene Length | 1767 bp |
Protein Length | 588 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640105995 |
Product | fibronectin-binding A-like protein |
Protein accession | YP_001037010 |
Protein GI | 125973100 |
COG category | [K] Transcription |
COG ID | [COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCCTTTTG ACGGTATAGT AACCAAAAAC ATAGTCAGCG AACTGTCGGA CATCCTTACA GGCGCCCGAA TAGAAAAAAT TTATCAGCCC GAGCCGGATG AGATAATAAT AAACTTAAGA ACAAAAGGCC AAAACCTCAA ACTTTTATTA TCCGCCAATG CAAGCTATCC AAGGATTCAT CTTACCGACG TTACAAAGGA GAATCCTATA AACCCTCCTG TTTTTTGCAT GCTTTTAAGA AAGCATCTTT CAGGAGGAAA GATAACAAAA ATAGATTTTC ATGACTTTGA AAGAGTTGTT ACAATACATA TAGAAACCAT TGACGAATTG GGTGATTTGA CTTTTAAAAA ACTTGTGGTT GAAATAATGG GTAAACATAG CAACATTATA CTTGTAAACA ATGAAAATAA AATAATTGAT TCAATAAAGC ATGTAGACAG TGATATCAGC CGGGTAAGAG AAGTCATGCC GGCAAGACCC TATACCCTTC CTCCCGGCCA GGACAAAATC AATCCTCTGG AGCTTGATAT TGATTTGCTT TTCTCAAAAG CAAAGGAACA AGGGGATCCC CGTATTTCAA AGTTCTTTTT AAACAATATC AAAGGATTCA GTCCCCTTTT ATGTGAAGAA ATATGCCATC GTGCGGATGT TGACCCTCGA TTGCCGGTTT CAAGCCTTTC AGAGGATACT ATTCAAAACT TAAAGAAAGT ATTGAAAGAA ATAATTTCAA AAATTGAGAA TTCAGAATTT ACACCATGTA TAATATGGAA CGGTGATGAC AGGCAAAAAG CCGTAGATTT CCATTCTTTG GAAATAAAAC AATATAATAC GGTGGATTTT TATCCATCCA TCAGCAGAGT TTTGGATTTG TTTTATACGA TCAAAGATAC TTCCGAGAGA CTTGCACAGA AGAAGGCAGA TCTCGCGAAA ATTTTAAACA ACTGCATTGA CCGCTGCAAT AAGAAAATAT CCATCCACAT GGACACACTT AGAGAAGTTG CCGAAAGGGA AAAGTTCAAA CTCTATGGAG AGCTTATCAC TGCAAACATA TATTGTATAC CTAAAAATGC AAGCAAAGTT TCACTTTTGA ACTACTACAG CGAAAACGGC GAGTATGTTG AAGTACCTCT TGACGAAAAC CTCCTTCCTC AGGAGAACGC CCAGAGGTAT TTTAAAAAAT ACGCAAAGGC AAAAGCCGCT TACATTCATG CCACACAGCA ACTTGAAGAA GCCCGCGGCG AACTTTCATA TCTTGAAAGT GTGCTTCACA GCCTTGAAAA CAGCAATTCT TTCGAAGATA TTGACGATAT ACGGCAGGAA CTTGCCGAAC AAGGGTATTT GCCTTCAAAG AAAAAAAGGC CGGAAAAGAA AAATTCAAAA AACTTTACTC CTTATACTTA CAAGTCCACA GACGGTTTTT ATATCTATGT GGGAAAAAAC AATGTGCAAA ATGATTTTTT GACATTGAAA TTTGCATCTT CCAATGACAT CTGGCTTCAT ACGAAGAATA TTCCGGGCTC TCATGTCATA ATAAGAAAAG ATAGAGGAGA AATACCCGAC AGCACCCTGT TTCAGGCAGC CATGCTGGCG GCTTATCACA GCAAGGCAAA GAATTCTTCC CATGTGGAGG TTGATTACAC CAAGGTCAAA AATGTAAAAA AACCTAACGG TGCAAAACCC GGAATGGTAA TCTACGATAA TTATAAAACT ATTATTGTCA CACCTGATGA AAATGTAGTC AATAATTTAA GAATGGAAAA TAGATAA
|
Protein sequence | MPFDGIVTKN IVSELSDILT GARIEKIYQP EPDEIIINLR TKGQNLKLLL SANASYPRIH LTDVTKENPI NPPVFCMLLR KHLSGGKITK IDFHDFERVV TIHIETIDEL GDLTFKKLVV EIMGKHSNII LVNNENKIID SIKHVDSDIS RVREVMPARP YTLPPGQDKI NPLELDIDLL FSKAKEQGDP RISKFFLNNI KGFSPLLCEE ICHRADVDPR LPVSSLSEDT IQNLKKVLKE IISKIENSEF TPCIIWNGDD RQKAVDFHSL EIKQYNTVDF YPSISRVLDL FYTIKDTSER LAQKKADLAK ILNNCIDRCN KKISIHMDTL REVAEREKFK LYGELITANI YCIPKNASKV SLLNYYSENG EYVEVPLDEN LLPQENAQRY FKKYAKAKAA YIHATQQLEE ARGELSYLES VLHSLENSNS FEDIDDIRQE LAEQGYLPSK KKRPEKKNSK NFTPYTYKST DGFYIYVGKN NVQNDFLTLK FASSNDIWLH TKNIPGSHVI IRKDRGEIPD STLFQAAMLA AYHSKAKNSS HVEVDYTKVK NVKKPNGAKP GMVIYDNYKT IIVTPDENVV NNLRMENR
|
| |