Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1840 |
Symbol | |
ID | 4809386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2184775 |
End bp | 2185710 |
Gene Length | 936 bp |
Protein Length | 311 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640107254 |
Product | cysteine synthase |
Protein accession | YP_001038254 |
Protein GI | 125974344 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0031] Cysteine synthase |
TIGRFAM ID | [TIGR01136] cysteine synthases [TIGR01139] cysteine synthase A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000000292185 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTAAAA TAGCTAAGAA TCTGACGGAA CTCATAGGAA ATACCCCGCT TTTGGAGTTG AGCAATTATA ACAGAGCAAA CAATTTGGAA GCTGTCCTGA TAGCAAAGCT CGAATACTTC AATCCTGCAT CCAGTGTAAA GGACAGGATT GGTTATGCAA TGATAAAGGA CGCAGAAGAA AAAGGAATAA TAAACAAAGA TACGGTTATT ATAGAGCCCA CAAGCGGAAA TACAGGTATT GCCCTGGCTT TTGTGGCAGC TGCAAGAGGA TACAGGGTTA TACTTACAAT GCCAGAGACC ATGAGTATTG AAAGAAGGAA TCTTTTAAAG GCTTTGGGTG CCGAGTTGGT GCTGACACCG GGAGCCGACG GAATGGGAGG AGCGATCAGA AAGGCTGAGG AGCTTGCCCG TGAAATACCC AACTCCTTTA TCCCTCAACA GTTCTCCAAT CCTGCAAATC CGGAGATTCA CAGAAGGACC ACGGCAGAGG AAATCTGGAG AGACACCGAC GGACAGGTGG ATATATTTGT GGCGGGAGTT GGAACAGGAG GAACAATTTC CGGTGTCGGT GAAGTGTTAA AGCAGCGCAA GCCGGATGTA AAGATTGTTG CGGTGGAGCC TTTTGATTCA CCGGTTCTGT CCGGAGGAAC CAAAGGTCCT CACAAGATAC AGGGAATAGG TGCCGGTTTT GTGCCGGATA ATTTCAACCG CGCAGTGGTG GATGAAATAT TCAAGGTTAA AAATGAAGAG GCCTTTGAAA CATCCAGAAA GCTTGCAAGA ACGGAAGGTC TTTTGGTGGG AATATCCTCG GGAGCTGCAG CTTTTGCAGC CACACAGATT GCAAAAAGGC CTGAAAACAA AGGAAAGAAC ATTGTGGTTC TGCTTCCCGA TACAGGAGAG AGATATTTGT CCACGGCATT ATTCCAGGAT GCATAG
|
Protein sequence | MAKIAKNLTE LIGNTPLLEL SNYNRANNLE AVLIAKLEYF NPASSVKDRI GYAMIKDAEE KGIINKDTVI IEPTSGNTGI ALAFVAAARG YRVILTMPET MSIERRNLLK ALGAELVLTP GADGMGGAIR KAEELAREIP NSFIPQQFSN PANPEIHRRT TAEEIWRDTD GQVDIFVAGV GTGGTISGVG EVLKQRKPDV KIVAVEPFDS PVLSGGTKGP HKIQGIGAGF VPDNFNRAVV DEIFKVKNEE AFETSRKLAR TEGLLVGISS GAAAFAATQI AKRPENKGKN IVVLLPDTGE RYLSTALFQD A
|
| |