Gene Tpen_0659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0659 
Symbol 
ID4601617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp609834 
End bp611129 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content56% 
IMG OID639773432 
Producthistidyl-tRNA synthetase 
Protein accessionYP_920064 
Protein GI119719569 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCGAAGA ACCTCGTACT TCAACCTCCC CGGGGAACGA GGGACTGGTT GCCAGAAGAA 
GCTTACGCGA AGCGGATAGT GTCCGAGAAA ATAAGGGAGG TCTTCGAGAG CTATGGATAT
GGAGAGGTGA TAACCCCTGC CTTCGAGTAC CTGGACTTGT TGAAGGCTAA AGCCGGGGAG
GAAGTCGTCG AGCAGATATA CGCCTTCAAG GACAAGGCAG GCAGAGAGCT GGGACTAAGG
TTCGAGATGA CCACGCCTAT CGCCAGGATA GTGGCCTCGC GGCTTGACCT GGCTAAGCCG
CTACGTTTCT ACTACGTGCA ACCCGTATGG AGGTACGAGG AACCCCAGAG GGGAAGGTGG
CGCGAGTTCT GGCAAGCTGG TATCGAGCTC TTTGGGATCT CTGAGCCGGA AGGCGACGCC
GAGGTTGTCG CCGTAACGTT CGACGCTCTA AAGGCTGTAG GGCTCAAGGA CTTCGACATA
CGGGTTAACG ATAGAAGGGT TGTCGAGGAT CTCGTGCTGG GAGCAGGGAT CCCCGGGGAT
CTCTTGCCGA GCGCCTTAAG AGTGCTTGAC AAGATGGACA AGTTCGGGGA GGAGTACGTG
GTATCCGAGC TGGCGAAGCT CGGGCTAAGG GAGGATGCCG CTACGTCTCT TCTCGAGAAG
CTGAAGAGCG GTAGCCTCGA CATTGATACC TCGACGCAGC CCGGTAGGGA GGGGTTGAGG
AGGCTAGCCC TCGTGGTGGA TACGCTCAAG AACTGTTACG GAATAAACGT TACAGTTGAC
TACGCAATCG TAAGGGGACT CGGGTACTAC ACGGGTTTTG TGTTCGAGGT AAAAGCAGGC
TCTTCGGAGG GGCTGGGGAG CATAGCCGGG GGCGGGAGAT ACGACGATCT CGTGAGCGTA
GTGGGAGGCC CAAAGATTCC AGCCTCCGGT ATGGCTATAG GCGTAGAGAG GCTACTCGAA
GCCTTATCAA TGCAGGGAGC ATTAAAGCTC GACTACAGGG AGGTGGACGT CTGCGTCATA
CCTGTCAAGA AAACCCCGGA AATACTCTCG GAGGCCGTAG CGGTTGCGAG AGAGCTACGC
GTAGCCGGTA TGAAGGTTGT CTTGGAGGTC TCTGAGAGGA GCCTATCCAA GCTTCTAGAA
GCAGCCTCCA AGAGAGGAGC ACGCTTCGCG ATAATACTGG GAGAGAGGGA ACTAAAGGAG
GGCGTAGTTA CTGTGCGTGA CTTATACTTG TGGAAGGAGG AGAAAGTTGC GCGTCCGCAC
TTGTACGAGT ATATAAGAGC AGGCTCCTCG ACTTAG
 
Protein sequence
MSKNLVLQPP RGTRDWLPEE AYAKRIVSEK IREVFESYGY GEVITPAFEY LDLLKAKAGE 
EVVEQIYAFK DKAGRELGLR FEMTTPIARI VASRLDLAKP LRFYYVQPVW RYEEPQRGRW
REFWQAGIEL FGISEPEGDA EVVAVTFDAL KAVGLKDFDI RVNDRRVVED LVLGAGIPGD
LLPSALRVLD KMDKFGEEYV VSELAKLGLR EDAATSLLEK LKSGSLDIDT STQPGREGLR
RLALVVDTLK NCYGINVTVD YAIVRGLGYY TGFVFEVKAG SSEGLGSIAG GGRYDDLVSV
VGGPKIPASG MAIGVERLLE ALSMQGALKL DYREVDVCVI PVKKTPEILS EAVAVARELR
VAGMKVVLEV SERSLSKLLE AASKRGARFA IILGERELKE GVVTVRDLYL WKEEKVARPH
LYEYIRAGSS T