Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0659 |
Symbol | |
ID | 4601617 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 609834 |
End bp | 611129 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 639773432 |
Product | histidyl-tRNA synthetase |
Protein accession | YP_920064 |
Protein GI | 119719569 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0124] Histidyl-tRNA synthetase |
TIGRFAM ID | [TIGR00442] histidyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCGAAGA ACCTCGTACT TCAACCTCCC CGGGGAACGA GGGACTGGTT GCCAGAAGAA GCTTACGCGA AGCGGATAGT GTCCGAGAAA ATAAGGGAGG TCTTCGAGAG CTATGGATAT GGAGAGGTGA TAACCCCTGC CTTCGAGTAC CTGGACTTGT TGAAGGCTAA AGCCGGGGAG GAAGTCGTCG AGCAGATATA CGCCTTCAAG GACAAGGCAG GCAGAGAGCT GGGACTAAGG TTCGAGATGA CCACGCCTAT CGCCAGGATA GTGGCCTCGC GGCTTGACCT GGCTAAGCCG CTACGTTTCT ACTACGTGCA ACCCGTATGG AGGTACGAGG AACCCCAGAG GGGAAGGTGG CGCGAGTTCT GGCAAGCTGG TATCGAGCTC TTTGGGATCT CTGAGCCGGA AGGCGACGCC GAGGTTGTCG CCGTAACGTT CGACGCTCTA AAGGCTGTAG GGCTCAAGGA CTTCGACATA CGGGTTAACG ATAGAAGGGT TGTCGAGGAT CTCGTGCTGG GAGCAGGGAT CCCCGGGGAT CTCTTGCCGA GCGCCTTAAG AGTGCTTGAC AAGATGGACA AGTTCGGGGA GGAGTACGTG GTATCCGAGC TGGCGAAGCT CGGGCTAAGG GAGGATGCCG CTACGTCTCT TCTCGAGAAG CTGAAGAGCG GTAGCCTCGA CATTGATACC TCGACGCAGC CCGGTAGGGA GGGGTTGAGG AGGCTAGCCC TCGTGGTGGA TACGCTCAAG AACTGTTACG GAATAAACGT TACAGTTGAC TACGCAATCG TAAGGGGACT CGGGTACTAC ACGGGTTTTG TGTTCGAGGT AAAAGCAGGC TCTTCGGAGG GGCTGGGGAG CATAGCCGGG GGCGGGAGAT ACGACGATCT CGTGAGCGTA GTGGGAGGCC CAAAGATTCC AGCCTCCGGT ATGGCTATAG GCGTAGAGAG GCTACTCGAA GCCTTATCAA TGCAGGGAGC ATTAAAGCTC GACTACAGGG AGGTGGACGT CTGCGTCATA CCTGTCAAGA AAACCCCGGA AATACTCTCG GAGGCCGTAG CGGTTGCGAG AGAGCTACGC GTAGCCGGTA TGAAGGTTGT CTTGGAGGTC TCTGAGAGGA GCCTATCCAA GCTTCTAGAA GCAGCCTCCA AGAGAGGAGC ACGCTTCGCG ATAATACTGG GAGAGAGGGA ACTAAAGGAG GGCGTAGTTA CTGTGCGTGA CTTATACTTG TGGAAGGAGG AGAAAGTTGC GCGTCCGCAC TTGTACGAGT ATATAAGAGC AGGCTCCTCG ACTTAG
|
Protein sequence | MSKNLVLQPP RGTRDWLPEE AYAKRIVSEK IREVFESYGY GEVITPAFEY LDLLKAKAGE EVVEQIYAFK DKAGRELGLR FEMTTPIARI VASRLDLAKP LRFYYVQPVW RYEEPQRGRW REFWQAGIEL FGISEPEGDA EVVAVTFDAL KAVGLKDFDI RVNDRRVVED LVLGAGIPGD LLPSALRVLD KMDKFGEEYV VSELAKLGLR EDAATSLLEK LKSGSLDIDT STQPGREGLR RLALVVDTLK NCYGINVTVD YAIVRGLGYY TGFVFEVKAG SSEGLGSIAG GGRYDDLVSV VGGPKIPASG MAIGVERLLE ALSMQGALKL DYREVDVCVI PVKKTPEILS EAVAVARELR VAGMKVVLEV SERSLSKLLE AASKRGARFA IILGERELKE GVVTVRDLYL WKEEKVARPH LYEYIRAGSS T
|
| |