Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2065 |
Symbol | |
ID | 4810663 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2456920 |
End bp | 2458326 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640107472 |
Product | cysteinyl-tRNA synthetase |
Protein accession | YP_001038465 |
Protein GI | 125974555 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0215] Cysteinyl-tRNA synthetase |
TIGRFAM ID | [TIGR00435] cysteinyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00052706 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGACTTT ATAATACTTT GACCAGAAAA AAGGAAGAGT TTCACCCTAT TGATGAAAAA GAAGTGAAGA TGTATTCATG CGGCCCCACG GTTTACAACT ATTTTCACAT AGGAAACGCC CGTCCCTTCA TCATATTTGA CACTTTGAGA AGATACCTTG AATACAAGGG CTACAACGTC AAGTTTGTGC AAAACTTTAC CGACATAGAC GACAAAATGA TAAAAAGGGC AAATGAGGAA GGAATAACGG TAAAGGAACT GGCAGACAAG TTTATAAACG AGTATTTTGT GGATGCAAAA GGTCTTGGAA TAAAAGAAGC CACTGTGCAT CCGAGGGCCA CGGAAAACAT TGACGCAATA ATTGAGATGA TTAAAAAGCT CGAGGAAAAA GGTTTTGCCT ACAATGTGGA CGGGGATGTG TATTTCAGCG CCAGGAAATT TACAGAATAC GGGAAGCTTT CTCACCAGTC ATTGGAGGAT TTGGAGCTTG GCGCAAGAAT TGACGTGGAT GAAAGGAAAA AAGACCCCAT GGATTTTGCG TTGTGGAAGG CTCAAAAGCC GGGAGAGCCT GCATGGGACA GCCCGTGGGG AAAAGGAAGG CCGGGCTGGC ATATTGAATG CTCTGCCATG GCAAACAAGT ATCTTGGAGA GACAATAGAC ATCCACAGCG GAGGACAGGA TTTGGTGTTT CCCCACCACG AGAATGAGAT TGCCCAGAGT GAAGCCGCAA ACGGAAAGCC TTTTGCACGA TTTTGGCTTC ACAACGGTTT TATAAACGTG GACGGCGAAA AAATGGCAAA GTCCAAGGGT AATTTCTTTA CGGTAAGGGA TATTGCAAAA ACTTTTGACT ATGAAGTAAT AAGGTTTTTT ATGCTTTCCG CCCATTACAG AAGCCCCATA AATTTCAGCG CTGAGCTTTT GGAACAGGCT AAAAACGGGC TTGAGAGGAT ATACAACTGC CTTGACAATC TTGAGTATTT AAAAGAGCAT GCACAGGCTG AGAAAATAAC GGACAGTGAG AGGGAGCTTC AAAACAGACT CCTTGGGATT AAGGCAAAAT TCATTGAAGC CATGGACGAT GACATCAATA CGGCGGATGC CATTGCAGCC ATTTTTGATA TTGTAAAGGA GGTTAACACC AATATAAATG CAACCTCCAA TTCCTCAAAG GAAATTATTG ATTTTTCCCT TTCACTGATA AAGGAATTGG GAGGAGTTTT GGGAATTGCC CAAAAGAGCA GGCAGAAAGT TCTCGACAAA GAGATTGAGG AACTTATTGA AAGAAGGCAG AAGGCGAGAA AGGAAAAAGA TTGGAAGACG GCTGATGAAA TCCGAGACAA GCTCAAAGAA ATGGGAATAA TACTTGAGGA CACTCCGCAG GGAGTGAAAT GGACGATACA ACGGTAA
|
Protein sequence | MRLYNTLTRK KEEFHPIDEK EVKMYSCGPT VYNYFHIGNA RPFIIFDTLR RYLEYKGYNV KFVQNFTDID DKMIKRANEE GITVKELADK FINEYFVDAK GLGIKEATVH PRATENIDAI IEMIKKLEEK GFAYNVDGDV YFSARKFTEY GKLSHQSLED LELGARIDVD ERKKDPMDFA LWKAQKPGEP AWDSPWGKGR PGWHIECSAM ANKYLGETID IHSGGQDLVF PHHENEIAQS EAANGKPFAR FWLHNGFINV DGEKMAKSKG NFFTVRDIAK TFDYEVIRFF MLSAHYRSPI NFSAELLEQA KNGLERIYNC LDNLEYLKEH AQAEKITDSE RELQNRLLGI KAKFIEAMDD DINTADAIAA IFDIVKEVNT NINATSNSSK EIIDFSLSLI KELGGVLGIA QKSRQKVLDK EIEELIERRQ KARKEKDWKT ADEIRDKLKE MGIILEDTPQ GVKWTIQR
|
| |