Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_1665 |
Symbol | |
ID | 6317074 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | - |
Start bp | 1741302 |
End bp | 1742636 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 642644041 |
Product | thymidine phosphorylase |
Protein accession | YP_001917827 |
Protein GI | 188586282 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.477565 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.134735 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAGCTT ATGAAGTCAT TGTAAAAAAA CGTGAAGGTG GAGAATTATC TTCTAGTGAA ATCGACTTTT TAGTACAAGG ATATACTAGG GGTGAAATAC CTGACTATCA GATGTCATCC TTTTTAATGG CAGCTTTTTT ACAGGGATTG AATAGCCAAG AAACTGCCCA ATTAACTAAA TCTATGGTAC ACTCAGGTGA AGTTTTAGAT CTTAGCAGGA TTTCCGGTAT CAAGGTTGAC AAACACAGTA CTGGTGGAGT AGGGGATAAA ACTACACTTG CACTTGCTCC TCTGGTAGCA TCAGCAGATC TCAACGTGGC AAAGATGTCT GGTAGAGGTC TAGGTCATTC TGGTGGAACC ATTGACAAAT TAGAAGCATT TTTGGGTTTT ACCCCTGAAC TGTCAATGGA AAACTTTATT GAACAGGTAC AAAAGCATAA CCTAGCTATT GTGGGGCAAA CAAAACAGCT GGCTCCAGCT GATGGCAAAA TTTATTCGTT AAGAGATGTA ACTGCTACTG TAGATTCAAT TCCGTTAATA GCAAGCTCAA TAATGAGTAA AAAACTTGCA GCTGGTACCA ATATGATTGT ATTGGATGTA AAAGTAGGCA AAGGGGCTTT TATGGAGAAT CTTGAAGATG CAACTGCCCT CGGACATGAG ATGGTTAATA TCGGCAAAAA TTTAGGAAGA AAAACGGTGG CAGTGATTAG TGATATGAAC CAGCCTTTGG GAAGAAAGGT TGGAAATTCC TTAGAAGTAC AAGAAGCCAT CGCTACATTG AAGGGCAATG GACCAGAAGA TTTTAAAGAA TTATGTCTCA ATTTAGGAGC CATCTTATTG AATATGGCCG AAAAAGTTAC CACAGTTACA GAAGGCAAAA AGTTATTATC AAATAAAATA AACAGTGGAG AGGCTTTAGC TAAGCTCGAG CAATTAGTAA AGGCTCAGAA TGGAGATACA TCAGGTATAC ATAATACAGA AAATCTGCCT CAAGCCAAGC ACTATAAAAT ATTAACAGCA GATAAATCTG GATTTATTAC AAATTTAGAT GCTAAAAAAG TAGGACTAGC CAGTGTAAAT TTAGGAGCTG GCAGGGCAAC CAAAGAAGAT AAAATTGACT TATCCGTTGG GATAGAATTA AATAAAAAAC TAGGTGATGA AGTTAGTACA GGTGATGAAT TAGCAAAAAT ATGGTATAAC GATGAAGATA AATTATTACA AGCCGCACCT ATTCTAGAAG ATGCTTTTGA TATTTCAGAA AGTGCTTCAG GAAAGTCACT TATTTATGGC ATGATAACTG AAAATACAAA TCCAGGTGAA TTAGATAGCA TTTAA
|
Protein sequence | MRAYEVIVKK REGGELSSSE IDFLVQGYTR GEIPDYQMSS FLMAAFLQGL NSQETAQLTK SMVHSGEVLD LSRISGIKVD KHSTGGVGDK TTLALAPLVA SADLNVAKMS GRGLGHSGGT IDKLEAFLGF TPELSMENFI EQVQKHNLAI VGQTKQLAPA DGKIYSLRDV TATVDSIPLI ASSIMSKKLA AGTNMIVLDV KVGKGAFMEN LEDATALGHE MVNIGKNLGR KTVAVISDMN QPLGRKVGNS LEVQEAIATL KGNGPEDFKE LCLNLGAILL NMAEKVTTVT EGKKLLSNKI NSGEALAKLE QLVKAQNGDT SGIHNTENLP QAKHYKILTA DKSGFITNLD AKKVGLASVN LGAGRATKED KIDLSVGIEL NKKLGDEVST GDELAKIWYN DEDKLLQAAP ILEDAFDISE SASGKSLIYG MITENTNPGE LDSI
|
| |