Gene Tpen_0093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0093 
Symbol 
ID4601385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp73057 
End bp74619 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content56% 
IMG OID639772847 
Productputative thymidine phosphorylase 
Protein accessionYP_919506 
Protein GI119719011 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02645] putative thymidine phosphorylase
[TIGR03327] AMP phosphorylase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.543018 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCCCCG TCTTCGAGGT GGTCTGGATG AAGCTCAAAA CGAGGATCCT TCCGTTCGAG 
TCTGCCCACT ACACAGTCGT GCTCGATCAG AGCGTGGCGA AGAAACTCGA CGTGAGGCCT
AGCGACAGAG TACTTGTACG TTTCAACGGG AAAACCGTGG TAGCGATAGC TAACATAGCG
AAAGAGTTTT CCCACGAACA CGTGGGAGTC TACGTAAACA TAGCGAAAGC GCTGGGGATA
TCGGACGGAG ACGAGGTCGA AGTAGAGGCC ACAAGTCCGC CGGCATCCCT GCAGGCAATA
AGGAAGAAAC TACAGGGCTT GAGCCTCGAA TCCGACGAGA TATACCAGGT AGTAAAGGAC
ATAGTGGATG GAAAGCTGAG CGAGCTCGAG CTCGCAGCCT TCGTGACCGC GGTACATTTC
CAGGGGATGA CCCCGTCTGA GATATACTCC TTTACGCTCT CAATGGTCGA GACGGGGCAG
AGGTTAAGGC TTAAAAGGAA GCCTATACTC GACAAGCACA GCCTTGGCGG TGTTCCCGGG
GATAAGACGA GCCTCCTCGT AGTACCGATA ATAGCTTCCC TGGGCTTCAC AATTCCTAAG
ACCTCCTCGA GGGCCATAAC CTCCGCCGCC GGGACGGCCG ATAGAATGGA GGTGCTGGCG
CCGGTCAACC TGTCGATAGA TGAAATCGAG AGAATAGTCG AGAAGACGAA TGCGTGCCTC
GTCTGGGGAG GAGCCCTGAA CCTAGCCCCT GCTGACGACA TCATAATTAG GGTCGAGTAC
CCCCTCGGGA TAGACCCCTT CTACATCCCG TCCATCCTCG CAAAGAAGCT TGCAGTAGGG
TCTACGCACG TCGTTTTAGA CGTGCCCACA GGTAGGGGTA CGAAGGTGAA GACGCTAGAA
GAGGCGAAGA GAATCTCTCA AAGTTTCTTC GAAATAGCCA GGATGTTCGG CATGAACCTG
CAAGCAGTAG CGACGTACGC GGAGGAGCCC ATTGGGCACG CGATAGGTCC AGCTCTCGAA
GCTCGTGAAG CTTTAATCGC GTTGCGAGAG CTACGGCCGG GGGACCTCGT CGACAAAGCG
GCGAGCCTGG CGGGCACTCT CCTGGAAATG GTGGGGGTGG AGAACGGTTA CGAAACGGCT
ATGGAAGCTC TGAGAACGGG GAAGGCCGAG AAGAAGCTCC GGGAGATAAT CGAGGCCCAG
GGCGGAGACC CAGACGTTAC CCCCGAAGAG ATACCGCTGG GAGACAAGAC GTACACACTG
TACTCGGAGG AGGACGGCTT CGTATACTAC ATCGACAACT CGTTGCTGGC GAACATAGGT
AAGATTGCCG GGGCACCGAT AGACAAGGGC GCCGGGGTAT ACATCCACGT CAAGCTGGGC
GAGAAGGTCA GGAAGGGAGA CCCCCTGCTT ACAGTCTACT CTTCGAGCTC GGCGAAGCTT
CAAGCGGTGG AAAGAATCCT CGAAGACTCT AAGCCGGTGC TTGTAGGGCG GACTGCCGGC
AGGAGAATGC TCTTAGAGAG GATTCAGTAC CAGCCTCCGA GACAGCTGGT ATTGGAGAGA
TGA
 
Protein sequence
MVPVFEVVWM KLKTRILPFE SAHYTVVLDQ SVAKKLDVRP SDRVLVRFNG KTVVAIANIA 
KEFSHEHVGV YVNIAKALGI SDGDEVEVEA TSPPASLQAI RKKLQGLSLE SDEIYQVVKD
IVDGKLSELE LAAFVTAVHF QGMTPSEIYS FTLSMVETGQ RLRLKRKPIL DKHSLGGVPG
DKTSLLVVPI IASLGFTIPK TSSRAITSAA GTADRMEVLA PVNLSIDEIE RIVEKTNACL
VWGGALNLAP ADDIIIRVEY PLGIDPFYIP SILAKKLAVG STHVVLDVPT GRGTKVKTLE
EAKRISQSFF EIARMFGMNL QAVATYAEEP IGHAIGPALE AREALIALRE LRPGDLVDKA
ASLAGTLLEM VGVENGYETA MEALRTGKAE KKLREIIEAQ GGDPDVTPEE IPLGDKTYTL
YSEEDGFVYY IDNSLLANIG KIAGAPIDKG AGVYIHVKLG EKVRKGDPLL TVYSSSSAKL
QAVERILEDS KPVLVGRTAG RRMLLERIQY QPPRQLVLER