Gene Cthe_1559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1559 
Symbol 
ID4810066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1887838 
End bp1889001 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content46% 
IMG OID640106977 
Productcystathionine gamma-lyase 
Protein accessionYP_001037978 
Protein GI125974068 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGCACATT ACAAACATAT TGAATCGGCA GTCATCCATG GTGGCATTTA TGGAGATTTC 
CATACCGGTT CGGTAAATAC CCCTATTTAT CAAACCTCCA CCTTTGAACA GGACGGTTTG
GGCAAGCCCC GCTCAAATTG GGAATATTCC CGAACGGGAA ATCCCACTCG GGCAGCTTTG
GAGGCTTTGA TTGCAGAGTT GGAGGGTGGG TCCCGGGGAT TTGCATTTTC TTCCGGTATG
GCAGCCATTG ATGCAGTTCT GCATCTCTTC CAATCCGGAG ACAGCGTCAT TATTTCCGAC
AATGTATATG GAGGGACTTT TCGAATTCTG GATAAAATCT TCAAGCAGTA TGGCTTAAAC
TATAAAATTG TGGACACCAC TGATTTGGCA GCACTCGAAA GTGCATTTAC TTCGGATGTT
AAAGCTTTGT TGCTTGAATC CCCGGCCAAT CCGCTGCTCA AAGTTACGGA TATCGCGGCG
GCAGCTGAGA TAGCAAGATC CAAAGGAGCG CTGACTGTAG TGGATAACAC CTTTATGACC
CCTTATCTTC AACGGCCTTT AGAGCTTGGA GCGGATATCG TCGTGCATTC GGCAACCAAA
TATCTTGGCG GACATAGCGA TGTCATTGCA GGACTTGTCA TCGTTAAAGA CGGTGAACTG
GCAGAAAAGC TGCATTTCAT ACAAAATGCG GTGGGTGCCG TTGCCGGGCC GTTTGATTCT
TTCCTGCTCA TTCGAAGTAT CAAGACGTTG GCAGTGCGCA TGGAAGCCCA TGTGGCCAAC
GCAGAAAAAC TAGCAGAGGC TTTAAAAAGT AATCCGGCAG TTAAAAACGT CTATTATCCC
GGCTTAAAAT CCGCTCAAGG ATATGAGATT CAAAAGAGAC AGGCAAAAAA CGGCGGAGCC
ATGATTTCCT TTGAGTTACA TAACAATTAT GACATCAACA GGTTTTTTGA AGGTTTGGAG
TTGATTGCCC TTGCGGAAAG CTTGGGCGGT GTTGAAAGTC TTGTCTGCCA TCCTTCAAGC
ATGACCCATG CATCTGTTCC AAAGGAAATA CGCGAAAAGA TCGGCATCAC GGATACATTG
ATCCGCTTGT CGGTAGGTAT TGAAAATTAT GATGATTTAA AAAACGATTT ATTTTCTGCT
ATAAAAGGAG CGCGAGTACT ATGA
 
Protein sequence
MAHYKHIESA VIHGGIYGDF HTGSVNTPIY QTSTFEQDGL GKPRSNWEYS RTGNPTRAAL 
EALIAELEGG SRGFAFSSGM AAIDAVLHLF QSGDSVIISD NVYGGTFRIL DKIFKQYGLN
YKIVDTTDLA ALESAFTSDV KALLLESPAN PLLKVTDIAA AAEIARSKGA LTVVDNTFMT
PYLQRPLELG ADIVVHSATK YLGGHSDVIA GLVIVKDGEL AEKLHFIQNA VGAVAGPFDS
FLLIRSIKTL AVRMEAHVAN AEKLAEALKS NPAVKNVYYP GLKSAQGYEI QKRQAKNGGA
MISFELHNNY DINRFFEGLE LIALAESLGG VESLVCHPSS MTHASVPKEI REKIGITDTL
IRLSVGIENY DDLKNDLFSA IKGARVL