Gene Tpen_0408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0408 
Symbol 
ID4601502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp372541 
End bp373899 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content56% 
IMG OID639773173 
Productseryl-tRNA synthetase 
Protein accessionYP_919820 
Protein GI119719325 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0172] Seryl-tRNA synthetase 
TIGRFAM ID[TIGR00414] seryl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGTCCA TGCTCTACCT GCTCAGAAAC AACCCGGAGA AGCTCATAGA AAACATGAAA 
GCGCGCTTCA TGGATCCCTC CCTAGTTGAA AACGCGATAA AACTCGACGT CGAGTGGAGG
GCGAAGAAAA AGGAGTACGA CGAGCTAAAG CACAGGCTGA ACGAAGTATC CGCCAAGGTG
AGAAGCTCTA CTGGGCAGGA AAGAGAGAAG CTCATCCTCG AAGCCAGGGA GCTCAGCTCC
AAGGTAAGCC TGCTCGAGAA GGAGCTCGCA GAGCTGGAGG CTAAGCGCGA GCTAGCTCTA
AGGCGGCTAC CGAACGTCAT ACACGAGAGC GTGCCGATAG GCCCCGACGA GACATTTAAC
AAGCCCGTGC GGTACTGGGG TAGACCCAAG GTTTTCCGCG AGCACCTGGA GGGCTTCCTC
AAGGAGACGA AGGAGAAGGG CTTCGTGGTA GACTACGAAG TCCTGGACTG GAAGCCGCTC
GGGCACGCGG AGTTCCTCGA GAGTAAAGGT CTAACCAATA CCTTGAAGGC TGCGGAAGTA
GCCGGTGCCC GGTTCTACTA CATGTTCAAC GACTTGGTCT GGCTCGCCAT AGCCTTGGAG
CTCTACGCCC TAGAATACCT GGCAGGCAAA GGCTTCACCA TACTCCTGCC ACCCCTCATG
CTGAGACGCG AGATCCTGGA GGGCGTAGTA AGCTTCGACG ACTTCAAGGA CATGATATAC
AAGATCGACG GAGAGGACCT CTACCTAATA GGAACCGCCG AGCACCCCAT AGCAGCGCTA
CACGCCGGCG AGGTTTTAGC CGAGAAGGAG TTACCGCTCC TCTACGCCGG TGTAAGCGAG
TCTTTCAGGA AGGAGGCGGG CGCTCACGGA AAAGACACTA AGGGAATTTT CCGTGTCCAC
CACTTTGAGA AGGTGGAGCA GTTCGTCTTC TCCCACCCGG ACGAGTCCTG GGAGTGGCAC
GAGAAGCTCA TCAGGAACGC CGAAGAGCTC TGGCAGGGTC TCGAGATACC GTACAGAATC
GTCAACATAG CGTCCGGCGA CCTGGGCGTC GTAGCAGCTA AGAAGTACGA CCTAGAGGCT
TGGATGCCTG CCCAGGGGAA ATACCGGGAA ATGGTCTCCT GTAGCAACTG CACGGATTGG
CAAAGCTACA GGCTTAACAT AAGGTACGCC GAGGTTAGGG GCGGCCCCTC CAAGGGCTAC
GTACACACGC TCAACAGCAC AGCTCTAGCG ATTCAAAGAA CCATCACGGC GATAGTCGAG
AACCACCAGA CCCCCGACGG CTACGTAAAA GTGCCTAAAG CCCTCCACAA GTACCTCGAA
CCCTTCGAGA ACCACTTCAG GGTCCTGGTG CTCAAGTAA
 
Protein sequence
MWSMLYLLRN NPEKLIENMK ARFMDPSLVE NAIKLDVEWR AKKKEYDELK HRLNEVSAKV 
RSSTGQEREK LILEARELSS KVSLLEKELA ELEAKRELAL RRLPNVIHES VPIGPDETFN
KPVRYWGRPK VFREHLEGFL KETKEKGFVV DYEVLDWKPL GHAEFLESKG LTNTLKAAEV
AGARFYYMFN DLVWLAIALE LYALEYLAGK GFTILLPPLM LRREILEGVV SFDDFKDMIY
KIDGEDLYLI GTAEHPIAAL HAGEVLAEKE LPLLYAGVSE SFRKEAGAHG KDTKGIFRVH
HFEKVEQFVF SHPDESWEWH EKLIRNAEEL WQGLEIPYRI VNIASGDLGV VAAKKYDLEA
WMPAQGKYRE MVSCSNCTDW QSYRLNIRYA EVRGGPSKGY VHTLNSTALA IQRTITAIVE
NHQTPDGYVK VPKALHKYLE PFENHFRVLV LK