Gene Cthe_1543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1543 
Symbol 
ID4810050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1870769 
End bp1872514 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content49% 
IMG OID640106962 
Productaspartyl-tRNA synthetase 
Protein accessionYP_001037963 
Protein GI125974053 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0173] Aspartyl-tRNA synthetase 
TIGRFAM ID[TIGR00459] aspartyl-tRNA synthetase, bacterial type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00178145 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGAGA TTTATCGTAC CAAGACAATG AATCAAATCA CTGAAGAAGA TATTGGCAGC 
GTCCTTAAAA TTGCCGGTTG GGTGGAGAAC ATCCGGGACC ACGGCGGCGT GTCCTTTATT
GACCTTAGGG ACATGTATGG TGTCATGCAG GTGGTATTCA GAAATATCGA GTTTCCAAAA
GGTATCAGAA AAGAAGAGTG CTTATCCATT GAAGGTGTGA TTGAAAAACG GGATGAAGAA
ACCTACAACC CCAAGATTCC CACAGGTACC ATTGAGCTTG TGGCGCAAAA AGTGACGGTT
TTGGGCCGGG TCTACAGGGA TCTGCCCTTT GAAGTTGCCA CTTCCAAGGA AATTCGTGAG
GAAGTGCGCC TAAAGTACCG CTACCTTGAT CTGAGAAACC GGAAGGTAAA GGATAATATC
CTGTTCAGAT CCCAGGTCAT CAGCTTTTTA AGGCAGAAGA TGACCGAGAT GGGCTTTGTC
GAAATCCAGA CGCCGATTTT GTCGGCATCC TCTCCCGAAG GAGCACGGGA TTATATTGTT
CCATCCAGAA AACATAAAGG AAAATTCTAT GCACTGCCGC AGGCCCCACA GATATATAAG
CAGCTTTTGA TGGTCTCGGG CTTTGACAAG TATTTTCAGA TCGCCCCATG CTTCCGTGAC
GAGGATGCGC GGGCCGATCG TTCTCCGGGC GAGTTTTATC AGCTCGACTT TGAAATGAGC
TTTGCCACCC AGGAAGACGT TTTCCGCGTG GGTGAGGAGG TGCTGACGGC AACCTTTAAA
AAGTTCGCAC CCGAAGGGTA TGAAATCACC GAGGCGCCGT TCCCAGTTTT CAGCTACAAG
CAGGCCATGC TGGAATTCGG CACCGACAAG CCGGACCTTA GAAACCCCCT GCGCATCATC
GATGTGACCG ATTTCTTCCA GAGATGCGCC TTCAAGCCAT TCCACAACAA GACGGTCAGA
GCCATCAAGG TTCATGCCGA TATGTCCAAG GGCTTCCATG AAAAACTGCT GGAATATGCA
CTGTCCATCG GCATGGGCGG GCTTGGCTAC CTTGAGGTTC AGGAGGATAT GACCTATAAA
GGCCCCATCG ACAAATACAT CCCCCAGGAA CTAAAAGGGG AGCTGGCCCA AATTTCCGGC
CTTATTCCCG GCGATGTCAT ATTCTTCATT GCCGACAAGG AAAAGCTTGC CTGCAAATAC
GCAGGACTAA TCCGCAATGA GCTGGGCAAG CGCCTTGACA TCTGCGAGAA GAATGCTTAC
CGCTTCTGTT ATGTCAATGA TTTTCCAATG TATGAAATGG ATGAAGAGAC CGGTAAAATT
GAATTTACCC ATAACCCCTT CTCCATGCCG CAGGGTGGGC TGGAGGCGCT TAATACCAAG
GGTCCTCTGG ATATTCTGGC TTACCAGTAT GATATTGTCT GCAACGGTGT TGAACTTTCC
TCGGGTGCTG TCCGGAACCA TGACCTGGAT ATCATGGTAA AGGCCTTTGA GATTGCGGGT
TACGATGAGG AGACTTTGAA AAAGAAGTTC GGAGCGCTCT ATAATGCATT TCAGTACGGA
GCGCCGCCTC ACGCTGGCAT GGCTCCGGGG ATTGAACGGA TGATTATGCT TCTTAGGAAT
GAGGAAAACA TCCGCGAGGT TGTGGCTTTC CCCATGAACA GCAACGCACA GGATCTGTTG
TGCGGCGCAC CCGGCGAAGT TACCGAACAG CAACTGCGTG AAGTGCATAT TAAAATCAGG
AATTAA
 
Protein sequence
MAEIYRTKTM NQITEEDIGS VLKIAGWVEN IRDHGGVSFI DLRDMYGVMQ VVFRNIEFPK 
GIRKEECLSI EGVIEKRDEE TYNPKIPTGT IELVAQKVTV LGRVYRDLPF EVATSKEIRE
EVRLKYRYLD LRNRKVKDNI LFRSQVISFL RQKMTEMGFV EIQTPILSAS SPEGARDYIV
PSRKHKGKFY ALPQAPQIYK QLLMVSGFDK YFQIAPCFRD EDARADRSPG EFYQLDFEMS
FATQEDVFRV GEEVLTATFK KFAPEGYEIT EAPFPVFSYK QAMLEFGTDK PDLRNPLRII
DVTDFFQRCA FKPFHNKTVR AIKVHADMSK GFHEKLLEYA LSIGMGGLGY LEVQEDMTYK
GPIDKYIPQE LKGELAQISG LIPGDVIFFI ADKEKLACKY AGLIRNELGK RLDICEKNAY
RFCYVNDFPM YEMDEETGKI EFTHNPFSMP QGGLEALNTK GPLDILAYQY DIVCNGVELS
SGAVRNHDLD IMVKAFEIAG YDEETLKKKF GALYNAFQYG APPHAGMAPG IERMIMLLRN
EENIREVVAF PMNSNAQDLL CGAPGEVTEQ QLREVHIKIR N