Gene Cthe_0581 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0581 
Symbol 
ID4808256 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp710639 
End bp712405 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content38% 
IMG OID640105995 
Productfibronectin-binding A-like protein 
Protein accessionYP_001037010 
Protein GI125973100 
COG category[K] Transcription 
COG ID[COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCCTTTTG ACGGTATAGT AACCAAAAAC ATAGTCAGCG AACTGTCGGA CATCCTTACA 
GGCGCCCGAA TAGAAAAAAT TTATCAGCCC GAGCCGGATG AGATAATAAT AAACTTAAGA
ACAAAAGGCC AAAACCTCAA ACTTTTATTA TCCGCCAATG CAAGCTATCC AAGGATTCAT
CTTACCGACG TTACAAAGGA GAATCCTATA AACCCTCCTG TTTTTTGCAT GCTTTTAAGA
AAGCATCTTT CAGGAGGAAA GATAACAAAA ATAGATTTTC ATGACTTTGA AAGAGTTGTT
ACAATACATA TAGAAACCAT TGACGAATTG GGTGATTTGA CTTTTAAAAA ACTTGTGGTT
GAAATAATGG GTAAACATAG CAACATTATA CTTGTAAACA ATGAAAATAA AATAATTGAT
TCAATAAAGC ATGTAGACAG TGATATCAGC CGGGTAAGAG AAGTCATGCC GGCAAGACCC
TATACCCTTC CTCCCGGCCA GGACAAAATC AATCCTCTGG AGCTTGATAT TGATTTGCTT
TTCTCAAAAG CAAAGGAACA AGGGGATCCC CGTATTTCAA AGTTCTTTTT AAACAATATC
AAAGGATTCA GTCCCCTTTT ATGTGAAGAA ATATGCCATC GTGCGGATGT TGACCCTCGA
TTGCCGGTTT CAAGCCTTTC AGAGGATACT ATTCAAAACT TAAAGAAAGT ATTGAAAGAA
ATAATTTCAA AAATTGAGAA TTCAGAATTT ACACCATGTA TAATATGGAA CGGTGATGAC
AGGCAAAAAG CCGTAGATTT CCATTCTTTG GAAATAAAAC AATATAATAC GGTGGATTTT
TATCCATCCA TCAGCAGAGT TTTGGATTTG TTTTATACGA TCAAAGATAC TTCCGAGAGA
CTTGCACAGA AGAAGGCAGA TCTCGCGAAA ATTTTAAACA ACTGCATTGA CCGCTGCAAT
AAGAAAATAT CCATCCACAT GGACACACTT AGAGAAGTTG CCGAAAGGGA AAAGTTCAAA
CTCTATGGAG AGCTTATCAC TGCAAACATA TATTGTATAC CTAAAAATGC AAGCAAAGTT
TCACTTTTGA ACTACTACAG CGAAAACGGC GAGTATGTTG AAGTACCTCT TGACGAAAAC
CTCCTTCCTC AGGAGAACGC CCAGAGGTAT TTTAAAAAAT ACGCAAAGGC AAAAGCCGCT
TACATTCATG CCACACAGCA ACTTGAAGAA GCCCGCGGCG AACTTTCATA TCTTGAAAGT
GTGCTTCACA GCCTTGAAAA CAGCAATTCT TTCGAAGATA TTGACGATAT ACGGCAGGAA
CTTGCCGAAC AAGGGTATTT GCCTTCAAAG AAAAAAAGGC CGGAAAAGAA AAATTCAAAA
AACTTTACTC CTTATACTTA CAAGTCCACA GACGGTTTTT ATATCTATGT GGGAAAAAAC
AATGTGCAAA ATGATTTTTT GACATTGAAA TTTGCATCTT CCAATGACAT CTGGCTTCAT
ACGAAGAATA TTCCGGGCTC TCATGTCATA ATAAGAAAAG ATAGAGGAGA AATACCCGAC
AGCACCCTGT TTCAGGCAGC CATGCTGGCG GCTTATCACA GCAAGGCAAA GAATTCTTCC
CATGTGGAGG TTGATTACAC CAAGGTCAAA AATGTAAAAA AACCTAACGG TGCAAAACCC
GGAATGGTAA TCTACGATAA TTATAAAACT ATTATTGTCA CACCTGATGA AAATGTAGTC
AATAATTTAA GAATGGAAAA TAGATAA
 
Protein sequence
MPFDGIVTKN IVSELSDILT GARIEKIYQP EPDEIIINLR TKGQNLKLLL SANASYPRIH 
LTDVTKENPI NPPVFCMLLR KHLSGGKITK IDFHDFERVV TIHIETIDEL GDLTFKKLVV
EIMGKHSNII LVNNENKIID SIKHVDSDIS RVREVMPARP YTLPPGQDKI NPLELDIDLL
FSKAKEQGDP RISKFFLNNI KGFSPLLCEE ICHRADVDPR LPVSSLSEDT IQNLKKVLKE
IISKIENSEF TPCIIWNGDD RQKAVDFHSL EIKQYNTVDF YPSISRVLDL FYTIKDTSER
LAQKKADLAK ILNNCIDRCN KKISIHMDTL REVAEREKFK LYGELITANI YCIPKNASKV
SLLNYYSENG EYVEVPLDEN LLPQENAQRY FKKYAKAKAA YIHATQQLEE ARGELSYLES
VLHSLENSNS FEDIDDIRQE LAEQGYLPSK KKRPEKKNSK NFTPYTYKST DGFYIYVGKN
NVQNDFLTLK FASSNDIWLH TKNIPGSHVI IRKDRGEIPD STLFQAAMLA AYHSKAKNSS
HVEVDYTKVK NVKKPNGAKP GMVIYDNYKT IIVTPDENVV NNLRMENR