Gene Tpen_0325 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0325 
Symbol 
ID4600976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp290946 
End bp293375 
Gene Length2430 bp 
Protein Length809 aa 
Translation table11 
GC content54% 
IMG OID639773085 
Productvalyl-tRNA synthetase 
Protein accessionYP_919737 
Protein GI119719242 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGTAG AGTTTCGCTT ACCCAAGGAA TACAACTTCA AGGCGGTGGA GGGTAAGTGG 
CAGAGATTCT GGGAGGAGAA GGGCATATAC AGGTTTGACC GGAAGGACAG GAGTAGGCCT
GTCTACGTCA TAGATACGCC TCCCCCGTAT CCCAGCGGAG ACCTCCACGT GGGGAACGCG
CTGAACTTTT CCTACATAGA CTTCGTGGCA AGGTACAAGA GGATGAGAGG GTACAACGTG
CTCTTCCCGC AGGGCTGGGA TTGCCACGGG CTTCCAACGG AGGTTCGCGT GGAGAAGGCT
GTCGGCAAGA GGAAGAGTGA AATGGATCCC AACGAGTTCC TGAGGCTTTG CCGCGAGTAC
ACGTTGAAGT GGATAGAGAG TATGAAAGCC GCCCTTAAAG GCTTAGGCTT AAGCATCGAC
TGGTCTACGG AGTACAAAAC TATGGACCCG GATTACTGGA GGAGAACACA ACTGAGCTTC
GTCCTAATGT ACAACAAGGG GTTGATCTAC AGGGGGGAGC ACCCGGTCAT ATGGTGTCCA
CGTTGTGAAA CGGCAATAGC CGAAGCTGAA GTGGAGTACG AAGAAAGGGA TCGGCCGTTG
TACTACTTCA AGTTTGGCGT TGAAGGAACA GGAGAAGAAC TCGTAGTAGC ATCCACGAGG
CCCGAGTTGT TGGCTTCTTG CGTGGCCGTC GCCGTGAACC CATCCGATGA GAGGTACAAG
CACCTCGTGG GAAAAAACGC TGTAGTGCCT ATCTACGGTA GAAAGGTGCC CATAATAGCC
GACGAAGCCG TGGATAAGGA CTTCGGCACA GGCGCAGTCA TGGTGTGCAC GTACGGAGAT
AAGACGGACG TGAAGTGGCA GAAAAAGTAC AACCTACCGG TGATCATCTC GATAACCGAG
CAGGGAACAA TGAACGACAA CGCCGGACCC CTTAAGGGCT TAAAGGTGGA GGATGCCAGA
AAGAAGATTG TCGAAATGTT GAAGGAAAAC GGTTTGCTCG TGAAGGTTGA GAGCATAAGA
TCCACTGTCG GGACCTGTTG GCGGTGCCAC ACCCCCGTAG AGATAATACC CAAGAAGCAG
TGGTTCGTAC GCTCGACTGC GCTCAACGAA AAAGTTCTCG AAGAAGGGAG GAAGGTGAAC
TGGGTTCCAA GCTACATGTA TAAAAGGCTC GAAAACTGGG TTCTGAGCCT GGACTGGGAC
TGGGTAATAT CGAGACAAAG GCTCTTCGCG ACCCCCATCC CCGTGTATTA CTGCAAGGAC
TGTGGAGCCG AGCTAGTAGT TCCTCCAGAA AAGCTACCGA TAGACCCTAG GTTCGACCCG
CCACCATTCG AGAAGTGTCC AAAGTGCGGT TCAAAGAACA TAGTGCCGGA GAGAGACGTT
ATGGACACGT GGATGGATTC GAGCATAACT GCCGCGGTGC ACGCCGGCTG GCCTGACAAC
TTCGACGAAA GGCTTTTCCC CGCCGACCTT CAGCCAAACG GCTACGACAT TATAAGAACG
TGGGACTACT ACCTGATATT GCGAGGCGTA GCTCTCTTCG GACGTTCACA GTTCAAAACA
GCCCTAATCA ACGGGATGGT GAGAGGCACA GATGGGAGGA TGATGCATAA GAGCTACGGG
AACTATGTTG CCGTGCAGGA GGTTCTCGAG AAGTACGGTG CAGACAGCTT CAGGCTTTGG
GTTGCCCTCG CAGCCGCTAC AGGGCAAGAC GTCAGGTTCT CCTGGGACGG CGTCGACTAC
GCGCACAGGT TCCTAGTTAA GGTGTGGAAT CTCGCGAGGC TTGCAAGCCC GTTTATAGAG
GACGTGAGAG AAGTCCCCCT CGGCAACCTC TCGCATGCTG ACCACTGGAT ACTGCGCGAG
CTCGCTTCAA CGGTGACTCG CGTAACGCAA GCCCTTGAAA ACTACAACTT TCAGGAAGCC
TCTCAGGCTC TCGTAGACTT CACTTGGCAC AAGCTCGCCG ACCACTACGT CGAGGCTGTG
AAGCACAGAC TTAGCAGGAG CGACGAAGCC GCGAAGTACA CGCTGTACGT AGTACTCATT
AAGACGTTGC AGATGCTCTC GGTCTTCGCG CCCCACATCT CCGAGGAGAT CTACGTCGAC
GTTTTGAAGA AGGCTGGAGG CTGGGAGAGC ATAACGGTGT CCCCATGGCC GGAGCCGCCG
GCCTACGACG AGGAGAAGGC CAGGGTAGGA GACATACTGA TAGCGGTTCT GGCGGAGGGT
AGGCGCGCCA AGCACGACGC TAGGATACCG CTTAACAAGG AAGTAAGCGC GGTGTACTTG
TACTCGGAGA AGTACTCAGA GGAGCTGAAA GCAGTTGTGG ACGACGTAGC GGGCACCCTG
CGCGCCAAGA AGGTAGAGGT CGTAAGGGGA GAGCCCGCGG GCAGAAAGGT ACCCGAGTAC
CCGGAGATTT CGATACAGAT TTCGCCGTAA
 
Protein sequence
MTVEFRLPKE YNFKAVEGKW QRFWEEKGIY RFDRKDRSRP VYVIDTPPPY PSGDLHVGNA 
LNFSYIDFVA RYKRMRGYNV LFPQGWDCHG LPTEVRVEKA VGKRKSEMDP NEFLRLCREY
TLKWIESMKA ALKGLGLSID WSTEYKTMDP DYWRRTQLSF VLMYNKGLIY RGEHPVIWCP
RCETAIAEAE VEYEERDRPL YYFKFGVEGT GEELVVASTR PELLASCVAV AVNPSDERYK
HLVGKNAVVP IYGRKVPIIA DEAVDKDFGT GAVMVCTYGD KTDVKWQKKY NLPVIISITE
QGTMNDNAGP LKGLKVEDAR KKIVEMLKEN GLLVKVESIR STVGTCWRCH TPVEIIPKKQ
WFVRSTALNE KVLEEGRKVN WVPSYMYKRL ENWVLSLDWD WVISRQRLFA TPIPVYYCKD
CGAELVVPPE KLPIDPRFDP PPFEKCPKCG SKNIVPERDV MDTWMDSSIT AAVHAGWPDN
FDERLFPADL QPNGYDIIRT WDYYLILRGV ALFGRSQFKT ALINGMVRGT DGRMMHKSYG
NYVAVQEVLE KYGADSFRLW VALAAATGQD VRFSWDGVDY AHRFLVKVWN LARLASPFIE
DVREVPLGNL SHADHWILRE LASTVTRVTQ ALENYNFQEA SQALVDFTWH KLADHYVEAV
KHRLSRSDEA AKYTLYVVLI KTLQMLSVFA PHISEEIYVD VLKKAGGWES ITVSPWPEPP
AYDEEKARVG DILIAVLAEG RRAKHDARIP LNKEVSAVYL YSEKYSEELK AVVDDVAGTL
RAKKVEVVRG EPAGRKVPEY PEISIQISP