Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2947 |
Symbol | |
ID | 4810835 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3463959 |
End bp | 3465677 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640108370 |
Product | prolyl-tRNA synthetase |
Protein accession | YP_001039338 |
Protein GI | 125975428 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0442] Prolyl-tRNA synthetase |
TIGRFAM ID | [TIGR00409] prolyl-tRNA synthetase, family II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000162722 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAGTTT CCAATATGTT TTTTCAAACA CTGAGGGAAG TTCCGGCGGA AGCTGAAATA GCAAGTCATC AGCTTATGCT GAGAGCCGGA CTTATGAGAA AGCTGGCATC GGGAATTTAT TCCTTCTTAC CTTTGGGTTA CAGGGTTTTT AGAAAGATTG AGCAGATTGT AAGGGAAGAG ATGGACAGGG CTGGCGCCCA GGAATTGATA ATGTCGGCGC TTCTTCCCGC CGAATCGTAC CAGGCATCGG GACGATGGGA AGTATTCGGG GCGGAAATGT TCAGGCTCAA AGACAGAAAC GGAAGGGATT TTTGTCTTGG ACCAACCCAT GAAGAAATAT TTACCGAAAC GGTAAAAAGT GTTACAAGGT CGTACAGGTC TCTTCCCCTT ATTCTCTACC AGATTCAGAC AAAGTACAGG GATGAGAGAA GGCCAAGATT TGGTGTTATG AGATCGAGAG AGTTCGTGAT GAAGGACGCA TACAGTTTTG ACAGGGACGA GGCGGGCCTT GATATATCCT ACAAGAAGAT GTACGATGCA TACTGCAGGA TATTTGACCG TTTGGGACTG GACTACATCA TTGTGGATGC GGATACCGGA GCAATGGGAG GTTCAGACTC ACAGGAGTTT ATGGTGAAAT CGGCAGTAGG TGAATCACGC ATTGCATATT GTGAAGCCTG CGGTTATGCG GCAAATGATG AAAAAGCCGA GTGTGTACCT GAAAAATGCT GCGATGACAA AGAATGCTGT GGGGAACTTG GACTGGAAAA AGTTGCAACT CCGGACGTGC GGACCATTGA GGAGCTTATG CAGTTCTTCG GCTGCTCTGC AAAGGAATTT GCAAAGACCC TTATATATAA AGCGGATGAT AAAGTCGTTG CGGCCATGGT AAGAGGAGAC AGAGAGCTGA ATGAGACAAA GCTTCAGAAT CTCCTGGGCT GCATAGAGCT TGAAATGGCG GATGCTGAAA CGGTGGAGAA GGTGACAGGT GCGGCTGTAG GCTTTGCAGG TCCCATAGGC CTTGATATTG ATATTGTGGT TGACCTTGAA GTTGCAGAAA TGAAGAACTT TGTGGTGGGA GCAAATGAGA CGGGTTTCCA CTACAAGAAT GTCAATATAA ACAGGGATTT TAAACCCAAA TACGTGAAAG ACATAAGGAC TATCAAAGAA GGGGATGCAT GCCCCAAATG CGGAGCTCCT GTAAAGGTTG AATTCGGAAT TGAAGTTGGG CACATATTCA AGCTTGGAAC CAAGTATTCG GAAGCTTTAG ACTGCATATA TCTTGATGAA ACCGGCAAAG AAAGACCTAT GATTATGGGA TGCTACGGTA TAGGAATAAA CAGGAGCATG GCCGCCGTAA TTGAACAGAA CAACGACGAA AACGGAATAA TCTGGCCTAT ATCCATTGCA CCATATCATG TAATTGTAAT ACCGGTAAAT ACCACCGACA GTGTTCAGAT GGAGCTGGCC GAAAAGATAT ATACCCAGCT GGGAGAAATG GGCATTGAGG TACTGCTGGA TGACAGGGAC GAACGGCCGG GAGTCAAGTT CAAGGATGCC GACCTTATTG GTATTCCGAT AAGGATAACT GTAGGAAAAA GAGCAGGAGA AGGCATTGTT GAATATAAGC TGAGGCGTGA AAAGGATTTT GCTGCAATTC CTTATGAGGA AGCAATTGCA AAAGCTAAAA AGGAAGTGGC CGAAGGCCTT AAAAAATAA
|
Protein sequence | MRVSNMFFQT LREVPAEAEI ASHQLMLRAG LMRKLASGIY SFLPLGYRVF RKIEQIVREE MDRAGAQELI MSALLPAESY QASGRWEVFG AEMFRLKDRN GRDFCLGPTH EEIFTETVKS VTRSYRSLPL ILYQIQTKYR DERRPRFGVM RSREFVMKDA YSFDRDEAGL DISYKKMYDA YCRIFDRLGL DYIIVDADTG AMGGSDSQEF MVKSAVGESR IAYCEACGYA ANDEKAECVP EKCCDDKECC GELGLEKVAT PDVRTIEELM QFFGCSAKEF AKTLIYKADD KVVAAMVRGD RELNETKLQN LLGCIELEMA DAETVEKVTG AAVGFAGPIG LDIDIVVDLE VAEMKNFVVG ANETGFHYKN VNINRDFKPK YVKDIRTIKE GDACPKCGAP VKVEFGIEVG HIFKLGTKYS EALDCIYLDE TGKERPMIMG CYGIGINRSM AAVIEQNNDE NGIIWPISIA PYHVIVIPVN TTDSVQMELA EKIYTQLGEM GIEVLLDDRD ERPGVKFKDA DLIGIPIRIT VGKRAGEGIV EYKLRREKDF AAIPYEEAIA KAKKEVAEGL KK
|
| |