Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_06970 |
Symbol | |
ID | 7312932 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | + |
Start bp | 756186 |
End bp | 757490 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643611128 |
Product | thymidine phosphorylase |
Protein accession | YP_002508449 |
Protein GI | 220931541 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02643] thymidine phosphorylase [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 46 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAGCAT ATGATATCAT TTATAAAAAA AGGGAAGGTT TTAAATTATC AAAAGAGGAA ATAGATTTTT TAATTCAGGA ATATACCCGT GGTCAAATAC CAGACTATCA AATGTCAGCC TGGGCTATGG CTGTTTTCTT CAAAGGTATG GATTCTGAAG AAACTTCACA CCTGACAATG GCTATGGCTA AATCCGGGGA TATTATTGAT TTGAGTGAAA TTCGTGGAAT AAAAGTAGAT AAACATAGTA GTGGTGGTGT TGGTGATACG ACTACTCTGG TTCTGGCGCC GCTGGTTGCT GCTGCCGGAA TTCCTGTTGC CAAGATGTCC GGGCGGGGTC TGGGTCATAC CGGAGGTACT ATTGATAAAC TGGAATCTAT TCCTGGATTT AAAACAGAGC TTGATCGTCG AGATTTTATA AATATCGTTA ATTCTACTGG TGTTGCTGTG GCCGGTCAAA CCGGTAATCT GACTCCTGCT GACAAAAAGC TATACAGTTT AAGGGATGTA ACAGCAACAG TTGATTCTAT ACCCCTGATA GCCAGCAGTA TAATGAGTAA GAAGATTGCC GGAGGGGCCG ATGGTATTGT CCTTGATGTT AAAACAGGCC GTGGTGCCTT TATGGAAAAC CTGGAAGATG CCAGGAAACT GGCCCGGGCT ATGGTTGAAA TAGGGAGACA GGTCCAGAGA AAAACTATAG CAGTGATAAC AGATATGAAT CAGCCTCTGG GATATGCCGT AGGTAATGCC CTTGAAGTGA AAGAGGCTAT TGACACCCTT GGGGGACATG GGCCTGAGGA TTTAGAGGAA TTATGCCTGA CCCTGGGGGC TAATATGCTT GTAATTGGTG AAAAGGCCAC TGATTTTGAA GAAGGGTATA ATAAATTAAA GGACCTGATT GAGACCGGTA AAGCCCTTGA AAAGTTTAAA GAGTTTATAA AGGCTCAAAA AGGAAATCCT GATGTAGTTG ATAATAAAGA ACTATTACCC CGGGCCAATA ATATAATAGC TGTTAAAGCC AATAATGATG GCTATGTCCA GCAGATAGAT GCCAGAGAGA TTGGACTAAC TGTTATGTCT TTAGGTGGAG GACGGGAGAA AAAAGGTGAC CGGATCGATC CTGCTGTTGG TATTGTTCTG AAGAAAAAAA TGGGTGATAA GGTGAATAAA GATGAACTAC TTGCAGAAAT ACATATTAAT GATACTACAA ACAGTGAAGA AGTAAAAGAA AGAGTTCAAA AAGCTATAAT TATAGGCCAG GAAAAGAATA AAAGAAACAA GTTAATTTAT GAGATAATCG AATAA
|
Protein sequence | MRAYDIIYKK REGFKLSKEE IDFLIQEYTR GQIPDYQMSA WAMAVFFKGM DSEETSHLTM AMAKSGDIID LSEIRGIKVD KHSSGGVGDT TTLVLAPLVA AAGIPVAKMS GRGLGHTGGT IDKLESIPGF KTELDRRDFI NIVNSTGVAV AGQTGNLTPA DKKLYSLRDV TATVDSIPLI ASSIMSKKIA GGADGIVLDV KTGRGAFMEN LEDARKLARA MVEIGRQVQR KTIAVITDMN QPLGYAVGNA LEVKEAIDTL GGHGPEDLEE LCLTLGANML VIGEKATDFE EGYNKLKDLI ETGKALEKFK EFIKAQKGNP DVVDNKELLP RANNIIAVKA NNDGYVQQID AREIGLTVMS LGGGREKKGD RIDPAVGIVL KKKMGDKVNK DELLAEIHIN DTTNSEEVKE RVQKAIIIGQ EKNKRNKLIY EIIE
|
| |