Gene Cthe_2947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2947 
Symbol 
ID4810835 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3463959 
End bp3465677 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content45% 
IMG OID640108370 
Productprolyl-tRNA synthetase 
Protein accessionYP_001039338 
Protein GI125975428 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0442] Prolyl-tRNA synthetase 
TIGRFAM ID[TIGR00409] prolyl-tRNA synthetase, family II 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000162722 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGTTT CCAATATGTT TTTTCAAACA CTGAGGGAAG TTCCGGCGGA AGCTGAAATA 
GCAAGTCATC AGCTTATGCT GAGAGCCGGA CTTATGAGAA AGCTGGCATC GGGAATTTAT
TCCTTCTTAC CTTTGGGTTA CAGGGTTTTT AGAAAGATTG AGCAGATTGT AAGGGAAGAG
ATGGACAGGG CTGGCGCCCA GGAATTGATA ATGTCGGCGC TTCTTCCCGC CGAATCGTAC
CAGGCATCGG GACGATGGGA AGTATTCGGG GCGGAAATGT TCAGGCTCAA AGACAGAAAC
GGAAGGGATT TTTGTCTTGG ACCAACCCAT GAAGAAATAT TTACCGAAAC GGTAAAAAGT
GTTACAAGGT CGTACAGGTC TCTTCCCCTT ATTCTCTACC AGATTCAGAC AAAGTACAGG
GATGAGAGAA GGCCAAGATT TGGTGTTATG AGATCGAGAG AGTTCGTGAT GAAGGACGCA
TACAGTTTTG ACAGGGACGA GGCGGGCCTT GATATATCCT ACAAGAAGAT GTACGATGCA
TACTGCAGGA TATTTGACCG TTTGGGACTG GACTACATCA TTGTGGATGC GGATACCGGA
GCAATGGGAG GTTCAGACTC ACAGGAGTTT ATGGTGAAAT CGGCAGTAGG TGAATCACGC
ATTGCATATT GTGAAGCCTG CGGTTATGCG GCAAATGATG AAAAAGCCGA GTGTGTACCT
GAAAAATGCT GCGATGACAA AGAATGCTGT GGGGAACTTG GACTGGAAAA AGTTGCAACT
CCGGACGTGC GGACCATTGA GGAGCTTATG CAGTTCTTCG GCTGCTCTGC AAAGGAATTT
GCAAAGACCC TTATATATAA AGCGGATGAT AAAGTCGTTG CGGCCATGGT AAGAGGAGAC
AGAGAGCTGA ATGAGACAAA GCTTCAGAAT CTCCTGGGCT GCATAGAGCT TGAAATGGCG
GATGCTGAAA CGGTGGAGAA GGTGACAGGT GCGGCTGTAG GCTTTGCAGG TCCCATAGGC
CTTGATATTG ATATTGTGGT TGACCTTGAA GTTGCAGAAA TGAAGAACTT TGTGGTGGGA
GCAAATGAGA CGGGTTTCCA CTACAAGAAT GTCAATATAA ACAGGGATTT TAAACCCAAA
TACGTGAAAG ACATAAGGAC TATCAAAGAA GGGGATGCAT GCCCCAAATG CGGAGCTCCT
GTAAAGGTTG AATTCGGAAT TGAAGTTGGG CACATATTCA AGCTTGGAAC CAAGTATTCG
GAAGCTTTAG ACTGCATATA TCTTGATGAA ACCGGCAAAG AAAGACCTAT GATTATGGGA
TGCTACGGTA TAGGAATAAA CAGGAGCATG GCCGCCGTAA TTGAACAGAA CAACGACGAA
AACGGAATAA TCTGGCCTAT ATCCATTGCA CCATATCATG TAATTGTAAT ACCGGTAAAT
ACCACCGACA GTGTTCAGAT GGAGCTGGCC GAAAAGATAT ATACCCAGCT GGGAGAAATG
GGCATTGAGG TACTGCTGGA TGACAGGGAC GAACGGCCGG GAGTCAAGTT CAAGGATGCC
GACCTTATTG GTATTCCGAT AAGGATAACT GTAGGAAAAA GAGCAGGAGA AGGCATTGTT
GAATATAAGC TGAGGCGTGA AAAGGATTTT GCTGCAATTC CTTATGAGGA AGCAATTGCA
AAAGCTAAAA AGGAAGTGGC CGAAGGCCTT AAAAAATAA
 
Protein sequence
MRVSNMFFQT LREVPAEAEI ASHQLMLRAG LMRKLASGIY SFLPLGYRVF RKIEQIVREE 
MDRAGAQELI MSALLPAESY QASGRWEVFG AEMFRLKDRN GRDFCLGPTH EEIFTETVKS
VTRSYRSLPL ILYQIQTKYR DERRPRFGVM RSREFVMKDA YSFDRDEAGL DISYKKMYDA
YCRIFDRLGL DYIIVDADTG AMGGSDSQEF MVKSAVGESR IAYCEACGYA ANDEKAECVP
EKCCDDKECC GELGLEKVAT PDVRTIEELM QFFGCSAKEF AKTLIYKADD KVVAAMVRGD
RELNETKLQN LLGCIELEMA DAETVEKVTG AAVGFAGPIG LDIDIVVDLE VAEMKNFVVG
ANETGFHYKN VNINRDFKPK YVKDIRTIKE GDACPKCGAP VKVEFGIEVG HIFKLGTKYS
EALDCIYLDE TGKERPMIMG CYGIGINRSM AAVIEQNNDE NGIIWPISIA PYHVIVIPVN
TTDSVQMELA EKIYTQLGEM GIEVLLDDRD ERPGVKFKDA DLIGIPIRIT VGKRAGEGIV
EYKLRREKDF AAIPYEEAIA KAKKEVAEGL KK