Gene Cthe_1331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1331 
SymbolaspS 
ID4809471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1619327 
End bp1621114 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content45% 
IMG OID640106755 
Productaspartyl-tRNA synthetase 
Protein accessionYP_001037756 
Protein GI125973846 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0173] Aspartyl-tRNA synthetase 
TIGRFAM ID[TIGR00459] aspartyl-tRNA synthetase, bacterial type 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.175841 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAGAAT CTATTTATGG ATTGAAAAGG ACGCATATGT GCGCCGAACT TACCGTGAAT 
GATGTCGGAA AGACCGTTAC GGTTATGGGA TGGTGCCACA AAAGCAGGAA TCTTGGTGGG
TTAATTTTTG TTACATTAAG AGACAGAACC GGTATAATCC AGGTAGTGTT TGACAACACG
GTAAATTCCG AACTTTTTGC AAAAGCTGAG GGAATCAGAG GCGAATACGT GCTTGCCGTG
GTGGGAGAAG TGGTGAAAAG AAGCCCCGAG GCAATAAATC CCAAACTTCC CACAGGAGAA
ATTGAGATTA TTGCAAAGGA ATTGAGAATT CTCAGTACTG CCGAGTCACC ACCCATATAT
ATTGAAGAGG ATTCCGATGT AAACGAGGCC ACAAGGCTTA AGTACAGATA TCTTGATTTG
AGAAGACCTG ACATGCAGAG AAACCTGATG TTAAGGCACA GGGTGGCAAA GATTGCCAGA
GACTATTTTG ACGAGCACGG TTTTATTGAA ATAGAGACTC CGATGCTTAC CAAGAGCACT
CCCGAGGGGG CAAGAGACTA TCTTGTGCCA AGCAGGGTAC ATCCGGGGAA ATTTTTTGCG
CTGCCCCAGT CACCCCAGCT TTTCAAGCAG CTTTTGATGG TTGCAGGTTT TGACCGGTAT
ATGCAGATTG TAAAATGCTT CAGGGATGAG GACCTTAGAG CCGACAGGCA GCCTGAGTTT
ACACAGATAG ATTTGGAAAT GTCATTTGTA AATGTTGAGG ATGTGCTTAC CATAAATGAA
GGTTTTATAA AAAGGGTATT CAAGGAGGCT ATTAATGTCG ACCTTGAGAT ACCTTTCATA
AGGATGCCGT ATAAAGAGGC CATGGAGAGA TTTGGGACCG ACAAACCGGA TATAAGGTTT
GGATTTGAAC TGGTTAACCT GTCAGACCTT GTGGAAAACT GTGGCTTTAA GGTATTTTCC
GATGCCGTCA AAAACGGAGG AAGCGTTCGG GCGATAAATG CCAAAGGATG TGGAAATAAA
TTCAGCAGAA AGGAAATAGA TGCCCTTGGT GAATTCGTAA AAACCTATGG TGCAAAGGGA
ATGGCCTGGA TAGTTGTGGG AGAAAACGAG CATAAATCCC CGATTACCAA ATTCTTTACC
GAGGACGAAA TCAAGGCCGT TTTGACAAGA ATGCGGGCAG AACCCGGAGA CCTCATATGC
TTTATTGCCG ACAAAAATGA GGTTGTGTTC GATTCACTGG GACAGCTGAG AGTGGAAATA
GCAAGAAAGC TGGGATTGCT TGACAACAAG GAATTTAAAT TCCTGTGGGT GACCGAGTTC
CCGCTCCTTG AATATGACGA GGAGGAAAAA CGCTATGTGG CAAAACACCA TCCTTTTACG
TCTCCGATGG ATGAGGATGT TGAATTGCTG GATACCGACC CGCTGAAAGT TAGGGCAAAA
GCTTATGACA TCGTGCTAAA CGGTACGGAA ATCGGAGGAG GAAGCATCAG AATTCACAGT
CAGGAGCTTC AGTCGAAAAT GTTCAAACTT CTTGGCTTTA GTGAGAAAGA TGCCTGGGAC
AGGTTCGGAT TCCTTCTTGA GGCTTTCAAA TACGGAACGC CTCCCCACGG CGGAATGGCA
TTCGGACTCG ACAGATTGGT AATGCTTATG GCCGGAAGAA ACAGCATCAG GGATGTTATT
GCATTCCCCA AAGTACAGAA TTCATCATGT CTTATGACAA ATGCGCCGGA TGAGGTTGAG
CCAAAACAGC TTGAGGAGCT TAAAATAAGG GTGGATTTGC AAAATTGA
 
Protein sequence
MGESIYGLKR THMCAELTVN DVGKTVTVMG WCHKSRNLGG LIFVTLRDRT GIIQVVFDNT 
VNSELFAKAE GIRGEYVLAV VGEVVKRSPE AINPKLPTGE IEIIAKELRI LSTAESPPIY
IEEDSDVNEA TRLKYRYLDL RRPDMQRNLM LRHRVAKIAR DYFDEHGFIE IETPMLTKST
PEGARDYLVP SRVHPGKFFA LPQSPQLFKQ LLMVAGFDRY MQIVKCFRDE DLRADRQPEF
TQIDLEMSFV NVEDVLTINE GFIKRVFKEA INVDLEIPFI RMPYKEAMER FGTDKPDIRF
GFELVNLSDL VENCGFKVFS DAVKNGGSVR AINAKGCGNK FSRKEIDALG EFVKTYGAKG
MAWIVVGENE HKSPITKFFT EDEIKAVLTR MRAEPGDLIC FIADKNEVVF DSLGQLRVEI
ARKLGLLDNK EFKFLWVTEF PLLEYDEEEK RYVAKHHPFT SPMDEDVELL DTDPLKVRAK
AYDIVLNGTE IGGGSIRIHS QELQSKMFKL LGFSEKDAWD RFGFLLEAFK YGTPPHGGMA
FGLDRLVMLM AGRNSIRDVI AFPKVQNSSC LMTNAPDEVE PKQLEELKIR VDLQN