Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0093 |
Symbol | |
ID | 4601385 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 73057 |
End bp | 74619 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 639772847 |
Product | putative thymidine phosphorylase |
Protein accession | YP_919506 |
Protein GI | 119719011 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02645] putative thymidine phosphorylase [TIGR03327] AMP phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.543018 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCCCCG TCTTCGAGGT GGTCTGGATG AAGCTCAAAA CGAGGATCCT TCCGTTCGAG TCTGCCCACT ACACAGTCGT GCTCGATCAG AGCGTGGCGA AGAAACTCGA CGTGAGGCCT AGCGACAGAG TACTTGTACG TTTCAACGGG AAAACCGTGG TAGCGATAGC TAACATAGCG AAAGAGTTTT CCCACGAACA CGTGGGAGTC TACGTAAACA TAGCGAAAGC GCTGGGGATA TCGGACGGAG ACGAGGTCGA AGTAGAGGCC ACAAGTCCGC CGGCATCCCT GCAGGCAATA AGGAAGAAAC TACAGGGCTT GAGCCTCGAA TCCGACGAGA TATACCAGGT AGTAAAGGAC ATAGTGGATG GAAAGCTGAG CGAGCTCGAG CTCGCAGCCT TCGTGACCGC GGTACATTTC CAGGGGATGA CCCCGTCTGA GATATACTCC TTTACGCTCT CAATGGTCGA GACGGGGCAG AGGTTAAGGC TTAAAAGGAA GCCTATACTC GACAAGCACA GCCTTGGCGG TGTTCCCGGG GATAAGACGA GCCTCCTCGT AGTACCGATA ATAGCTTCCC TGGGCTTCAC AATTCCTAAG ACCTCCTCGA GGGCCATAAC CTCCGCCGCC GGGACGGCCG ATAGAATGGA GGTGCTGGCG CCGGTCAACC TGTCGATAGA TGAAATCGAG AGAATAGTCG AGAAGACGAA TGCGTGCCTC GTCTGGGGAG GAGCCCTGAA CCTAGCCCCT GCTGACGACA TCATAATTAG GGTCGAGTAC CCCCTCGGGA TAGACCCCTT CTACATCCCG TCCATCCTCG CAAAGAAGCT TGCAGTAGGG TCTACGCACG TCGTTTTAGA CGTGCCCACA GGTAGGGGTA CGAAGGTGAA GACGCTAGAA GAGGCGAAGA GAATCTCTCA AAGTTTCTTC GAAATAGCCA GGATGTTCGG CATGAACCTG CAAGCAGTAG CGACGTACGC GGAGGAGCCC ATTGGGCACG CGATAGGTCC AGCTCTCGAA GCTCGTGAAG CTTTAATCGC GTTGCGAGAG CTACGGCCGG GGGACCTCGT CGACAAAGCG GCGAGCCTGG CGGGCACTCT CCTGGAAATG GTGGGGGTGG AGAACGGTTA CGAAACGGCT ATGGAAGCTC TGAGAACGGG GAAGGCCGAG AAGAAGCTCC GGGAGATAAT CGAGGCCCAG GGCGGAGACC CAGACGTTAC CCCCGAAGAG ATACCGCTGG GAGACAAGAC GTACACACTG TACTCGGAGG AGGACGGCTT CGTATACTAC ATCGACAACT CGTTGCTGGC GAACATAGGT AAGATTGCCG GGGCACCGAT AGACAAGGGC GCCGGGGTAT ACATCCACGT CAAGCTGGGC GAGAAGGTCA GGAAGGGAGA CCCCCTGCTT ACAGTCTACT CTTCGAGCTC GGCGAAGCTT CAAGCGGTGG AAAGAATCCT CGAAGACTCT AAGCCGGTGC TTGTAGGGCG GACTGCCGGC AGGAGAATGC TCTTAGAGAG GATTCAGTAC CAGCCTCCGA GACAGCTGGT ATTGGAGAGA TGA
|
Protein sequence | MVPVFEVVWM KLKTRILPFE SAHYTVVLDQ SVAKKLDVRP SDRVLVRFNG KTVVAIANIA KEFSHEHVGV YVNIAKALGI SDGDEVEVEA TSPPASLQAI RKKLQGLSLE SDEIYQVVKD IVDGKLSELE LAAFVTAVHF QGMTPSEIYS FTLSMVETGQ RLRLKRKPIL DKHSLGGVPG DKTSLLVVPI IASLGFTIPK TSSRAITSAA GTADRMEVLA PVNLSIDEIE RIVEKTNACL VWGGALNLAP ADDIIIRVEY PLGIDPFYIP SILAKKLAVG STHVVLDVPT GRGTKVKTLE EAKRISQSFF EIARMFGMNL QAVATYAEEP IGHAIGPALE AREALIALRE LRPGDLVDKA ASLAGTLLEM VGVENGYETA MEALRTGKAE KKLREIIEAQ GGDPDVTPEE IPLGDKTYTL YSEEDGFVYY IDNSLLANIG KIAGAPIDKG AGVYIHVKLG EKVRKGDPLL TVYSSSSAKL QAVERILEDS KPVLVGRTAG RRMLLERIQY QPPRQLVLER
|
| |