Gene Cthe_0655 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0655 
Symbol 
ID4808185 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp807995 
End bp809140 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content39% 
IMG OID640106070 
Productcysteine desulfurase family protein 
Protein accessionYP_001037083 
Protein GI125973173 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01977] cysteine desulfurase family protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.696219 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTATC TTGACAATGC GGCAACTTCA TATCCGAAGC CTGACAGAGT TTATGATGAA 
ATGCTTTTAT GTATGAAGGA ATACTGTGCA AATCCCGGAA GGTCAGGGCA TGAACTGGCA
ATAAAAACGG GAAGGGCGGT GTATGAAACC CGGGAAATTG TGTCAAGATT TTTCAATATC
GAAAATCCTA TGAGGGTCGT ATTTACCAAA AATGCAACAG AAGCTTTGAA CCTTGCCATT
AACGGAGTAT TGAAAGAAGG GGAACATGTA ATTACTACAA GCATGGAGCA CAACTCTGTT
TTAAGACCTT TAAAAACGCT GGAGAGAAAC AATATTATAG AGCTTACCAT AGTATGGGGA
AATTATTTCG GTGAAATTGA TGTGGCGGAT ATAGAAAGAA GTATAAAAAA GAACACAAAA
ATGATTATAT GTTCCCTTTC CTCCAACGTT AATGGTATAA TAATGCCCGT AAAAGAAATA
GGGAAAATAA CCAGGGAAAG GGGAATTTTA TTTCTTGTTG ACGCATCCCA AGGGGCAGGT
TCTATAAAGC TGGATGTCCA GGAAATAAAT GCAGATTTGT TTGCCGTTCC GGGCCATAAG
GATTTACTTG GTCCCCAGGG AGTAGGTGCA CTTTATGTGA ACGAAAATGT GGAAATAACG
CCGATAATGC AGGGGGGTAC AGGCAGTCGT TCTGAAAGTT TTTATCAACC GGAAATATTC
CCCGATATGC TGGAAAGCGG AACCTTGAAT GCTCCAGGCA TAGTCGGTTT GGGATTTGGG
ATTAAATTTA TAGAAAGTTT CGGAGTTGAC AATATCAGAA TTTACAAGCA TATGCTTATA
AAAAGATTAT ATGAAGGCAT TGAAAATTTA AACGGAATAA AGCTTTACAG CCTGAAAGAC
ATGGATAAAA ACTCCGGAAT TATTTCCTTC AATTTTATAG GTGTGGATTC TACAAAAGTG
AGTTTTATGC TTGACAGAAT ATACGGCATT GCTTCCAGGT CCGGCCTTCA CTGTGCGCCT
TTGGCCCATG AAACCATTGG GACAAAAGCA ACCGGTACAG TGAGGCTAAG TGTGGGATGT
TTTAATACCA TTGAGGAAAT AGATACTACA ATTGAAGCAT TAAGAGAAAT TTCCCAGGGA
TTATAA
 
Protein sequence
MIYLDNAATS YPKPDRVYDE MLLCMKEYCA NPGRSGHELA IKTGRAVYET REIVSRFFNI 
ENPMRVVFTK NATEALNLAI NGVLKEGEHV ITTSMEHNSV LRPLKTLERN NIIELTIVWG
NYFGEIDVAD IERSIKKNTK MIICSLSSNV NGIIMPVKEI GKITRERGIL FLVDASQGAG
SIKLDVQEIN ADLFAVPGHK DLLGPQGVGA LYVNENVEIT PIMQGGTGSR SESFYQPEIF
PDMLESGTLN APGIVGLGFG IKFIESFGVD NIRIYKHMLI KRLYEGIENL NGIKLYSLKD
MDKNSGIISF NFIGVDSTKV SFMLDRIYGI ASRSGLHCAP LAHETIGTKA TGTVRLSVGC
FNTIEEIDTT IEALREISQG L