Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2799 |
Symbol | |
ID | 4810116 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3298648 |
End bp | 3299808 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640108219 |
Product | cystathionine gamma-synthase |
Protein accession | YP_001039191 |
Protein GI | 125975281 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000029873 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAAAG TTGGGAATGT GTCAAACTAT AGTATAAGTA CAAAAGTGGT ACATGGTTCA AAGTGTTATG ACCCGCATAC CGGGGCGGTA AGTTTCCCCA TATATCAAAG TGCTACTTTC AGACATCCGG CGCTCTATCA GACAACGGGT TATGATTATT CACGCTTGCA GAATCCGACA AGGGAAGAAC TTGAAAACAC CATTGCAAAT ATCGAAAACG GGAAGTTTGG ATTTGCCTTT TCCAGCGGCA TGGCGGCAGT ATCCACCATA CTGTCTCTTT TTTCACCCAA AGACCATATC ATTGTTTCCG ATGACCTTTA TGGTGGTACT TACAGACTGT TTGAGGAAAT ATACAAAAAA TACGGTTTGG AATTTTCCTA TGTCAACACA AGCAGGATTC AGGACATAGA AGAAGCTGTG AAAGAGAACA CAAAGGCGTT TTTTATTGAG ACCCCCACAA ACCCGATGAT GAAGGTGGCC GATTTAAAGA CGATATCGCG GTTTGCAAAA GACAGGAAAA TACTTTTGAT TGTGGACAAT ACTTTTCTTA CACCGTATTT TCAGAGGCCC TTGGAGCTGG GGGCGGATAT TGTGGTTCAC AGCGGAACGA AATATCTCGG GGGACATAAC GATACTTTGG CGGGTCTTGT TGTAGTTAAT GATGAAGAGC TTGCCGAAAG GATAAAACTT ATTCAAAAAT CGGAAGGGGC CGTACTGTCT CCTTTTGACA GCTGGCTGAT TTTAAGAGGT ATAAAGACGC TGGGGGTACG CCTTGAAAAG CAGCAGGAAA ATGCCATGAA AATTGCAAAA TGGCTTTGTA CCCATAAAAA TGTCACAAAG GTCAACTATG TGGGATTGCC CGACCATGAA GGCTATGAAA TTTCGAAATC CCAGGCTTCC GGTTTTGGAG CCATGATTTC CTTTAACGTA AAAGACGTTC AGACTGTGGA AAAGGTTTTA AGCAAGGTGC AGCTTGTAAT GTTTGCTGAG AGCCTCGGCG GTGTGGAAAG CTTGATTACC TATCCTGCCG TTCAGACCCA TGCTGCCATA CCGGAAGAAA TGAGAAATAG AATCGGGGTT ACCGATACGC TTTTAAGGCT TTCGGTGGGA ATTGAGGATG CAGACGATAT AATTGCCGAC CTTGAGCAGG CCTTGGAATA G
|
Protein sequence | MMKVGNVSNY SISTKVVHGS KCYDPHTGAV SFPIYQSATF RHPALYQTTG YDYSRLQNPT REELENTIAN IENGKFGFAF SSGMAAVSTI LSLFSPKDHI IVSDDLYGGT YRLFEEIYKK YGLEFSYVNT SRIQDIEEAV KENTKAFFIE TPTNPMMKVA DLKTISRFAK DRKILLIVDN TFLTPYFQRP LELGADIVVH SGTKYLGGHN DTLAGLVVVN DEELAERIKL IQKSEGAVLS PFDSWLILRG IKTLGVRLEK QQENAMKIAK WLCTHKNVTK VNYVGLPDHE GYEISKSQAS GFGAMISFNV KDVQTVEKVL SKVQLVMFAE SLGGVESLIT YPAVQTHAAI PEEMRNRIGV TDTLLRLSVG IEDADDIIAD LEQALE
|
| |