Gene Hore_06970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_06970 
Symbol 
ID7312932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp756186 
End bp757490 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content40% 
IMG OID643611128 
Productthymidine phosphorylase 
Protein accessionYP_002508449 
Protein GI220931541 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02643] thymidine phosphorylase
[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGCAT ATGATATCAT TTATAAAAAA AGGGAAGGTT TTAAATTATC AAAAGAGGAA 
ATAGATTTTT TAATTCAGGA ATATACCCGT GGTCAAATAC CAGACTATCA AATGTCAGCC
TGGGCTATGG CTGTTTTCTT CAAAGGTATG GATTCTGAAG AAACTTCACA CCTGACAATG
GCTATGGCTA AATCCGGGGA TATTATTGAT TTGAGTGAAA TTCGTGGAAT AAAAGTAGAT
AAACATAGTA GTGGTGGTGT TGGTGATACG ACTACTCTGG TTCTGGCGCC GCTGGTTGCT
GCTGCCGGAA TTCCTGTTGC CAAGATGTCC GGGCGGGGTC TGGGTCATAC CGGAGGTACT
ATTGATAAAC TGGAATCTAT TCCTGGATTT AAAACAGAGC TTGATCGTCG AGATTTTATA
AATATCGTTA ATTCTACTGG TGTTGCTGTG GCCGGTCAAA CCGGTAATCT GACTCCTGCT
GACAAAAAGC TATACAGTTT AAGGGATGTA ACAGCAACAG TTGATTCTAT ACCCCTGATA
GCCAGCAGTA TAATGAGTAA GAAGATTGCC GGAGGGGCCG ATGGTATTGT CCTTGATGTT
AAAACAGGCC GTGGTGCCTT TATGGAAAAC CTGGAAGATG CCAGGAAACT GGCCCGGGCT
ATGGTTGAAA TAGGGAGACA GGTCCAGAGA AAAACTATAG CAGTGATAAC AGATATGAAT
CAGCCTCTGG GATATGCCGT AGGTAATGCC CTTGAAGTGA AAGAGGCTAT TGACACCCTT
GGGGGACATG GGCCTGAGGA TTTAGAGGAA TTATGCCTGA CCCTGGGGGC TAATATGCTT
GTAATTGGTG AAAAGGCCAC TGATTTTGAA GAAGGGTATA ATAAATTAAA GGACCTGATT
GAGACCGGTA AAGCCCTTGA AAAGTTTAAA GAGTTTATAA AGGCTCAAAA AGGAAATCCT
GATGTAGTTG ATAATAAAGA ACTATTACCC CGGGCCAATA ATATAATAGC TGTTAAAGCC
AATAATGATG GCTATGTCCA GCAGATAGAT GCCAGAGAGA TTGGACTAAC TGTTATGTCT
TTAGGTGGAG GACGGGAGAA AAAAGGTGAC CGGATCGATC CTGCTGTTGG TATTGTTCTG
AAGAAAAAAA TGGGTGATAA GGTGAATAAA GATGAACTAC TTGCAGAAAT ACATATTAAT
GATACTACAA ACAGTGAAGA AGTAAAAGAA AGAGTTCAAA AAGCTATAAT TATAGGCCAG
GAAAAGAATA AAAGAAACAA GTTAATTTAT GAGATAATCG AATAA
 
Protein sequence
MRAYDIIYKK REGFKLSKEE IDFLIQEYTR GQIPDYQMSA WAMAVFFKGM DSEETSHLTM 
AMAKSGDIID LSEIRGIKVD KHSSGGVGDT TTLVLAPLVA AAGIPVAKMS GRGLGHTGGT
IDKLESIPGF KTELDRRDFI NIVNSTGVAV AGQTGNLTPA DKKLYSLRDV TATVDSIPLI
ASSIMSKKIA GGADGIVLDV KTGRGAFMEN LEDARKLARA MVEIGRQVQR KTIAVITDMN
QPLGYAVGNA LEVKEAIDTL GGHGPEDLEE LCLTLGANML VIGEKATDFE EGYNKLKDLI
ETGKALEKFK EFIKAQKGNP DVVDNKELLP RANNIIAVKA NNDGYVQQID AREIGLTVMS
LGGGREKKGD RIDPAVGIVL KKKMGDKVNK DELLAEIHIN DTTNSEEVKE RVQKAIIIGQ
EKNKRNKLIY EIIE