Gene Cthe_2799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2799 
Symbol 
ID4810116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3298648 
End bp3299808 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content44% 
IMG OID640108219 
Productcystathionine gamma-synthase 
Protein accessionYP_001039191 
Protein GI125975281 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000029873 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAAAG TTGGGAATGT GTCAAACTAT AGTATAAGTA CAAAAGTGGT ACATGGTTCA 
AAGTGTTATG ACCCGCATAC CGGGGCGGTA AGTTTCCCCA TATATCAAAG TGCTACTTTC
AGACATCCGG CGCTCTATCA GACAACGGGT TATGATTATT CACGCTTGCA GAATCCGACA
AGGGAAGAAC TTGAAAACAC CATTGCAAAT ATCGAAAACG GGAAGTTTGG ATTTGCCTTT
TCCAGCGGCA TGGCGGCAGT ATCCACCATA CTGTCTCTTT TTTCACCCAA AGACCATATC
ATTGTTTCCG ATGACCTTTA TGGTGGTACT TACAGACTGT TTGAGGAAAT ATACAAAAAA
TACGGTTTGG AATTTTCCTA TGTCAACACA AGCAGGATTC AGGACATAGA AGAAGCTGTG
AAAGAGAACA CAAAGGCGTT TTTTATTGAG ACCCCCACAA ACCCGATGAT GAAGGTGGCC
GATTTAAAGA CGATATCGCG GTTTGCAAAA GACAGGAAAA TACTTTTGAT TGTGGACAAT
ACTTTTCTTA CACCGTATTT TCAGAGGCCC TTGGAGCTGG GGGCGGATAT TGTGGTTCAC
AGCGGAACGA AATATCTCGG GGGACATAAC GATACTTTGG CGGGTCTTGT TGTAGTTAAT
GATGAAGAGC TTGCCGAAAG GATAAAACTT ATTCAAAAAT CGGAAGGGGC CGTACTGTCT
CCTTTTGACA GCTGGCTGAT TTTAAGAGGT ATAAAGACGC TGGGGGTACG CCTTGAAAAG
CAGCAGGAAA ATGCCATGAA AATTGCAAAA TGGCTTTGTA CCCATAAAAA TGTCACAAAG
GTCAACTATG TGGGATTGCC CGACCATGAA GGCTATGAAA TTTCGAAATC CCAGGCTTCC
GGTTTTGGAG CCATGATTTC CTTTAACGTA AAAGACGTTC AGACTGTGGA AAAGGTTTTA
AGCAAGGTGC AGCTTGTAAT GTTTGCTGAG AGCCTCGGCG GTGTGGAAAG CTTGATTACC
TATCCTGCCG TTCAGACCCA TGCTGCCATA CCGGAAGAAA TGAGAAATAG AATCGGGGTT
ACCGATACGC TTTTAAGGCT TTCGGTGGGA ATTGAGGATG CAGACGATAT AATTGCCGAC
CTTGAGCAGG CCTTGGAATA G
 
Protein sequence
MMKVGNVSNY SISTKVVHGS KCYDPHTGAV SFPIYQSATF RHPALYQTTG YDYSRLQNPT 
REELENTIAN IENGKFGFAF SSGMAAVSTI LSLFSPKDHI IVSDDLYGGT YRLFEEIYKK
YGLEFSYVNT SRIQDIEEAV KENTKAFFIE TPTNPMMKVA DLKTISRFAK DRKILLIVDN
TFLTPYFQRP LELGADIVVH SGTKYLGGHN DTLAGLVVVN DEELAERIKL IQKSEGAVLS
PFDSWLILRG IKTLGVRLEK QQENAMKIAK WLCTHKNVTK VNYVGLPDHE GYEISKSQAS
GFGAMISFNV KDVQTVEKVL SKVQLVMFAE SLGGVESLIT YPAVQTHAAI PEEMRNRIGV
TDTLLRLSVG IEDADDIIAD LEQALE