Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0655 |
Symbol | |
ID | 4808185 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 807995 |
End bp | 809140 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640106070 |
Product | cysteine desulfurase family protein |
Protein accession | YP_001037083 |
Protein GI | 125973173 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01977] cysteine desulfurase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.696219 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTTATC TTGACAATGC GGCAACTTCA TATCCGAAGC CTGACAGAGT TTATGATGAA ATGCTTTTAT GTATGAAGGA ATACTGTGCA AATCCCGGAA GGTCAGGGCA TGAACTGGCA ATAAAAACGG GAAGGGCGGT GTATGAAACC CGGGAAATTG TGTCAAGATT TTTCAATATC GAAAATCCTA TGAGGGTCGT ATTTACCAAA AATGCAACAG AAGCTTTGAA CCTTGCCATT AACGGAGTAT TGAAAGAAGG GGAACATGTA ATTACTACAA GCATGGAGCA CAACTCTGTT TTAAGACCTT TAAAAACGCT GGAGAGAAAC AATATTATAG AGCTTACCAT AGTATGGGGA AATTATTTCG GTGAAATTGA TGTGGCGGAT ATAGAAAGAA GTATAAAAAA GAACACAAAA ATGATTATAT GTTCCCTTTC CTCCAACGTT AATGGTATAA TAATGCCCGT AAAAGAAATA GGGAAAATAA CCAGGGAAAG GGGAATTTTA TTTCTTGTTG ACGCATCCCA AGGGGCAGGT TCTATAAAGC TGGATGTCCA GGAAATAAAT GCAGATTTGT TTGCCGTTCC GGGCCATAAG GATTTACTTG GTCCCCAGGG AGTAGGTGCA CTTTATGTGA ACGAAAATGT GGAAATAACG CCGATAATGC AGGGGGGTAC AGGCAGTCGT TCTGAAAGTT TTTATCAACC GGAAATATTC CCCGATATGC TGGAAAGCGG AACCTTGAAT GCTCCAGGCA TAGTCGGTTT GGGATTTGGG ATTAAATTTA TAGAAAGTTT CGGAGTTGAC AATATCAGAA TTTACAAGCA TATGCTTATA AAAAGATTAT ATGAAGGCAT TGAAAATTTA AACGGAATAA AGCTTTACAG CCTGAAAGAC ATGGATAAAA ACTCCGGAAT TATTTCCTTC AATTTTATAG GTGTGGATTC TACAAAAGTG AGTTTTATGC TTGACAGAAT ATACGGCATT GCTTCCAGGT CCGGCCTTCA CTGTGCGCCT TTGGCCCATG AAACCATTGG GACAAAAGCA ACCGGTACAG TGAGGCTAAG TGTGGGATGT TTTAATACCA TTGAGGAAAT AGATACTACA ATTGAAGCAT TAAGAGAAAT TTCCCAGGGA TTATAA
|
Protein sequence | MIYLDNAATS YPKPDRVYDE MLLCMKEYCA NPGRSGHELA IKTGRAVYET REIVSRFFNI ENPMRVVFTK NATEALNLAI NGVLKEGEHV ITTSMEHNSV LRPLKTLERN NIIELTIVWG NYFGEIDVAD IERSIKKNTK MIICSLSSNV NGIIMPVKEI GKITRERGIL FLVDASQGAG SIKLDVQEIN ADLFAVPGHK DLLGPQGVGA LYVNENVEIT PIMQGGTGSR SESFYQPEIF PDMLESGTLN APGIVGLGFG IKFIESFGVD NIRIYKHMLI KRLYEGIENL NGIKLYSLKD MDKNSGIISF NFIGVDSTKV SFMLDRIYGI ASRSGLHCAP LAHETIGTKA TGTVRLSVGC FNTIEEIDTT IEALREISQG L
|
| |