Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0817 |
Symbol | |
ID | 4601988 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 770405 |
End bp | 771853 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639773594 |
Product | prolyl-tRNA synthetase |
Protein accession | YP_920221 |
Protein GI | 119719726 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0442] Prolyl-tRNA synthetase |
TIGRFAM ID | [TIGR00408] prolyl-tRNA synthetase, family I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGAGC AGGGGATCAC TGTCAGCAAG TCCGAGGACT TCTCGGAGTG GTACTCGCAG GTACTGAGTA AGGCGGGGCT TGTCGATCTA CGCTACAACG TCCAAGGCTT CGTTGTCCAC AAGCCCTGGC TTATGCGCAT CATAAAGGCT ATTTACCGCT TCTTCGAGGA GGAGCTGGAG AAGACCGGGC ACGAGCCTGT CCTCTTCCCC CTCGTGATAC CGGAGGAGAA CTTCGAGAAG GAGAAGGAGC ACGTCGAGGG TTTCAAGCCG GAGGTGTTCT GGGTTACCCA AGCGGGCGAC GAGAAGCTTG AACGGAGGCT GGCGCTTAGG CCGACAAGCG AGACCGCTTT CTACTACATG TACTCCTACT GGATACAGAG CTGGCGCGAC CTACCCCTCA AACTCTATCA GAGCGTTAGC GTGTACCGCA ACGAGAAGAA TACGAGGCCG TTGATACGGG GGAGGGAGTT CCTCTGGATA GAGGCCCACG ACGCGTTCGC AACGCACGAG GAGGCGCTTA ACCAGATACG CGAGGACATG GAGAACTCGA GGAAGGTTAT CTGGGAGAAG CTCGGGATCC CGTTTTTGTT CCTGAGGAGG CCTCCCTGGG ACAAGTTCAG CGGTGCTGAA GACACGTACG CCGCCGACAC TATAATGCCG GATGGAAGGG TGCTTCAGAT ATCCTCGACG CATGACCTCG GCCAGAGGTT CGCGAAGGCT TTCAACGTTA CCTTCCTGGA CAAGGACGGC AAGAGGAAGT ACGTTTGGCA GACGTGCTAC GGCCCCGGCA TCTGGAGGAT AACCGCCGCA CTCATAGCGA TACACGGCGA CGACAAGGGG CTCGTACTCC CGATGAACGT GGCCCCGATA CAGGTCGTCA TCGTCCCCAT ATACTACAAG GAGTCCGACA AGGAGAGGGT ACTCGAGAAA TGCAGGAAGC TGGAGGCGAT GATAAGGGAG GCTGGCTACA GGGTCTACCT GGACGCCAGG GAGGAGTACA CGCCGGGCTG GAAGTTCAAC GACTGGGAGT TGAAGGGGGT TCCCGTGCGC CTAGAGGTGG GGGTACGGGA GGTCGAAACA GGGACGGTCA CGGTGTTTAG GAGGGATTTG AGGGTGAAGG AGAAGGTCGC GGACAGCGAG CTGATAAGCC ACATCCGCAA GCTCGAGAAC GACATTCTCG AGGAGCTGAA GAGGAGGGCG AAGGAGTTCT TCGAAAGCAG GATAGTCACG GCGACGCGCA GAGAGGAAGT CGAGGAGGCG TTACGCTCGG GAAAGATGGT CAAAATGCCT TTCTGTGGAC GGGAGGAGTG CGCAGACGAC TTGAAGGAAG CTACCGACGG CGGGAAGGTC AGGGGTACCG AGATAGACTT CAAGGAGGGA GACTACGGGC GTTGCGCCTG GTGCGGAGCC CCCGCGCGCC TAATAGTATA CGTCGCTAAG TCGTACTAG
|
Protein sequence | MSEQGITVSK SEDFSEWYSQ VLSKAGLVDL RYNVQGFVVH KPWLMRIIKA IYRFFEEELE KTGHEPVLFP LVIPEENFEK EKEHVEGFKP EVFWVTQAGD EKLERRLALR PTSETAFYYM YSYWIQSWRD LPLKLYQSVS VYRNEKNTRP LIRGREFLWI EAHDAFATHE EALNQIREDM ENSRKVIWEK LGIPFLFLRR PPWDKFSGAE DTYAADTIMP DGRVLQISST HDLGQRFAKA FNVTFLDKDG KRKYVWQTCY GPGIWRITAA LIAIHGDDKG LVLPMNVAPI QVVIVPIYYK ESDKERVLEK CRKLEAMIRE AGYRVYLDAR EEYTPGWKFN DWELKGVPVR LEVGVREVET GTVTVFRRDL RVKEKVADSE LISHIRKLEN DILEELKRRA KEFFESRIVT ATRREEVEEA LRSGKMVKMP FCGREECADD LKEATDGGKV RGTEIDFKEG DYGRCAWCGA PARLIVYVAK SY
|
| |