Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1543 |
Symbol | |
ID | 4810050 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1870769 |
End bp | 1872514 |
Gene Length | 1746 bp |
Protein Length | 581 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640106962 |
Product | aspartyl-tRNA synthetase |
Protein accession | YP_001037963 |
Protein GI | 125974053 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0173] Aspartyl-tRNA synthetase |
TIGRFAM ID | [TIGR00459] aspartyl-tRNA synthetase, bacterial type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00178145 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGAGA TTTATCGTAC CAAGACAATG AATCAAATCA CTGAAGAAGA TATTGGCAGC GTCCTTAAAA TTGCCGGTTG GGTGGAGAAC ATCCGGGACC ACGGCGGCGT GTCCTTTATT GACCTTAGGG ACATGTATGG TGTCATGCAG GTGGTATTCA GAAATATCGA GTTTCCAAAA GGTATCAGAA AAGAAGAGTG CTTATCCATT GAAGGTGTGA TTGAAAAACG GGATGAAGAA ACCTACAACC CCAAGATTCC CACAGGTACC ATTGAGCTTG TGGCGCAAAA AGTGACGGTT TTGGGCCGGG TCTACAGGGA TCTGCCCTTT GAAGTTGCCA CTTCCAAGGA AATTCGTGAG GAAGTGCGCC TAAAGTACCG CTACCTTGAT CTGAGAAACC GGAAGGTAAA GGATAATATC CTGTTCAGAT CCCAGGTCAT CAGCTTTTTA AGGCAGAAGA TGACCGAGAT GGGCTTTGTC GAAATCCAGA CGCCGATTTT GTCGGCATCC TCTCCCGAAG GAGCACGGGA TTATATTGTT CCATCCAGAA AACATAAAGG AAAATTCTAT GCACTGCCGC AGGCCCCACA GATATATAAG CAGCTTTTGA TGGTCTCGGG CTTTGACAAG TATTTTCAGA TCGCCCCATG CTTCCGTGAC GAGGATGCGC GGGCCGATCG TTCTCCGGGC GAGTTTTATC AGCTCGACTT TGAAATGAGC TTTGCCACCC AGGAAGACGT TTTCCGCGTG GGTGAGGAGG TGCTGACGGC AACCTTTAAA AAGTTCGCAC CCGAAGGGTA TGAAATCACC GAGGCGCCGT TCCCAGTTTT CAGCTACAAG CAGGCCATGC TGGAATTCGG CACCGACAAG CCGGACCTTA GAAACCCCCT GCGCATCATC GATGTGACCG ATTTCTTCCA GAGATGCGCC TTCAAGCCAT TCCACAACAA GACGGTCAGA GCCATCAAGG TTCATGCCGA TATGTCCAAG GGCTTCCATG AAAAACTGCT GGAATATGCA CTGTCCATCG GCATGGGCGG GCTTGGCTAC CTTGAGGTTC AGGAGGATAT GACCTATAAA GGCCCCATCG ACAAATACAT CCCCCAGGAA CTAAAAGGGG AGCTGGCCCA AATTTCCGGC CTTATTCCCG GCGATGTCAT ATTCTTCATT GCCGACAAGG AAAAGCTTGC CTGCAAATAC GCAGGACTAA TCCGCAATGA GCTGGGCAAG CGCCTTGACA TCTGCGAGAA GAATGCTTAC CGCTTCTGTT ATGTCAATGA TTTTCCAATG TATGAAATGG ATGAAGAGAC CGGTAAAATT GAATTTACCC ATAACCCCTT CTCCATGCCG CAGGGTGGGC TGGAGGCGCT TAATACCAAG GGTCCTCTGG ATATTCTGGC TTACCAGTAT GATATTGTCT GCAACGGTGT TGAACTTTCC TCGGGTGCTG TCCGGAACCA TGACCTGGAT ATCATGGTAA AGGCCTTTGA GATTGCGGGT TACGATGAGG AGACTTTGAA AAAGAAGTTC GGAGCGCTCT ATAATGCATT TCAGTACGGA GCGCCGCCTC ACGCTGGCAT GGCTCCGGGG ATTGAACGGA TGATTATGCT TCTTAGGAAT GAGGAAAACA TCCGCGAGGT TGTGGCTTTC CCCATGAACA GCAACGCACA GGATCTGTTG TGCGGCGCAC CCGGCGAAGT TACCGAACAG CAACTGCGTG AAGTGCATAT TAAAATCAGG AATTAA
|
Protein sequence | MAEIYRTKTM NQITEEDIGS VLKIAGWVEN IRDHGGVSFI DLRDMYGVMQ VVFRNIEFPK GIRKEECLSI EGVIEKRDEE TYNPKIPTGT IELVAQKVTV LGRVYRDLPF EVATSKEIRE EVRLKYRYLD LRNRKVKDNI LFRSQVISFL RQKMTEMGFV EIQTPILSAS SPEGARDYIV PSRKHKGKFY ALPQAPQIYK QLLMVSGFDK YFQIAPCFRD EDARADRSPG EFYQLDFEMS FATQEDVFRV GEEVLTATFK KFAPEGYEIT EAPFPVFSYK QAMLEFGTDK PDLRNPLRII DVTDFFQRCA FKPFHNKTVR AIKVHADMSK GFHEKLLEYA LSIGMGGLGY LEVQEDMTYK GPIDKYIPQE LKGELAQISG LIPGDVIFFI ADKEKLACKY AGLIRNELGK RLDICEKNAY RFCYVNDFPM YEMDEETGKI EFTHNPFSMP QGGLEALNTK GPLDILAYQY DIVCNGVELS SGAVRNHDLD IMVKAFEIAG YDEETLKKKF GALYNAFQYG APPHAGMAPG IERMIMLLRN EENIREVVAF PMNSNAQDLL CGAPGEVTEQ QLREVHIKIR N
|
| |