Gene Tpen_1659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1659 
Symbol 
ID4601736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1605657 
End bp1606823 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content59% 
IMG OID639774432 
Producttyrosyl-tRNA synthetase 
Protein accessionYP_921057 
Protein GI119720562 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0162] Tyrosyl-tRNA synthetase 
TIGRFAM ID[TIGR00234] tyrosyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.661585 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTGTGTT CTACGCTCTC GGGGGTGTCG GCTTTGGATC CAGAGAAGGT TGTTGAAGCC 
GCTTTGCGGG AGCCAACAGA GGAGTTGCTC ACGCCGGACA GGCTGAAGAG CTTGGTCGAA
AACGGAGTGC CACTGGTGCA CTATATAGGC TTCGAGATCT CCGGGCTCGT CCACCTCGGT
ACCGGGCTTA TCTCCATGCA GAAAGTGGCG GACCTCCAGA GCGTCGGCGT GAAGACTAAG
GTGTTCCTGG CGGACTACCA CAGCTGGATC AACAGGAAGC TTGGAGGGGA CCTCGACGTG
ATAAGGAGGG TTGCCGGCGG CTACTTCAAG GAGGCCTTGA GGGTTTCGCT GAGGATAGTG
GGGGGAGACC CCGACAAAAC GGAGTTCATC CTCGGGTCGG AGCTCTACGA GAAGCTTGGA
CTAGACTACT TCACGAACGT GCTGAGGGTT TCCATGGAGA CGACACTGTC GCGCGTCAGG
AGGTCTGTAA CCATCCTCGG GAGGTCGGAG TCGGAGGCGC TTAGCTTCGC ACAGCTACTC
TACGTGCCCA TGCAGGTGGC GGACATCTTC AGCATGGGCG TCAACATACC GCACGGCGGC
ATGGATCAGA GGAAAGCCCA CGTTATAGCG ATCGAGGTCG GAGAAAAGCT GTTCGGGTAC
AAGCCCGTAG CCCTCCATCA CCACCTCCTC CCGGGGCTCC AGCTCGACGC CAGCGACTGG
AAGAAGCTCG TAGAGGCGAA GAACTCCGGG GACAAGGAGC TCTTCCGCGA GACCCTCGTG
AACATTAAGA TGTCGAAGTC GAAGCCCGAG ACCGCGCTAT TCATACACGA CTCCGAGGAG
GAAATCAGGA GCAAGATAGG CAGGGCATTC TGCCCCGCGG GCGAGGTCGA GATGAACCCC
CTGCTCGAGA TAGCTAGGTA CATAGTCTTC AGGAACAGGA AGGAGCCCTT CGAGGTTGTA
AACAAGAAGA CAGGGGAGAG GAGGGCTTTC AACACGTACG GCGAGCTCGA GGAAGCCTTC
CGGGAGAGGC TCGTGCACCC GGCGGACCTA AAGGAGGCTT TATCGCGCGA GCTGGCCGAG
ATACTGGCGC CCGCGAGGAA GCACTTCACC GAGGGGCCCG GCAGGAGCTT CCTGGAGGAG
ATAAAGGAGC TGAGGATAAC GAGGTAG
 
Protein sequence
MLCSTLSGVS ALDPEKVVEA ALREPTEELL TPDRLKSLVE NGVPLVHYIG FEISGLVHLG 
TGLISMQKVA DLQSVGVKTK VFLADYHSWI NRKLGGDLDV IRRVAGGYFK EALRVSLRIV
GGDPDKTEFI LGSELYEKLG LDYFTNVLRV SMETTLSRVR RSVTILGRSE SEALSFAQLL
YVPMQVADIF SMGVNIPHGG MDQRKAHVIA IEVGEKLFGY KPVALHHHLL PGLQLDASDW
KKLVEAKNSG DKELFRETLV NIKMSKSKPE TALFIHDSEE EIRSKIGRAF CPAGEVEMNP
LLEIARYIVF RNRKEPFEVV NKKTGERRAF NTYGELEEAF RERLVHPADL KEALSRELAE
ILAPARKHFT EGPGRSFLEE IKELRITR