Gene Nther_1665 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_1665 
Symbol 
ID6317074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp1741302 
End bp1742636 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content37% 
IMG OID642644041 
Productthymidine phosphorylase 
Protein accessionYP_001917827 
Protein GI188586282 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.477565 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.134735 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGCTT ATGAAGTCAT TGTAAAAAAA CGTGAAGGTG GAGAATTATC TTCTAGTGAA 
ATCGACTTTT TAGTACAAGG ATATACTAGG GGTGAAATAC CTGACTATCA GATGTCATCC
TTTTTAATGG CAGCTTTTTT ACAGGGATTG AATAGCCAAG AAACTGCCCA ATTAACTAAA
TCTATGGTAC ACTCAGGTGA AGTTTTAGAT CTTAGCAGGA TTTCCGGTAT CAAGGTTGAC
AAACACAGTA CTGGTGGAGT AGGGGATAAA ACTACACTTG CACTTGCTCC TCTGGTAGCA
TCAGCAGATC TCAACGTGGC AAAGATGTCT GGTAGAGGTC TAGGTCATTC TGGTGGAACC
ATTGACAAAT TAGAAGCATT TTTGGGTTTT ACCCCTGAAC TGTCAATGGA AAACTTTATT
GAACAGGTAC AAAAGCATAA CCTAGCTATT GTGGGGCAAA CAAAACAGCT GGCTCCAGCT
GATGGCAAAA TTTATTCGTT AAGAGATGTA ACTGCTACTG TAGATTCAAT TCCGTTAATA
GCAAGCTCAA TAATGAGTAA AAAACTTGCA GCTGGTACCA ATATGATTGT ATTGGATGTA
AAAGTAGGCA AAGGGGCTTT TATGGAGAAT CTTGAAGATG CAACTGCCCT CGGACATGAG
ATGGTTAATA TCGGCAAAAA TTTAGGAAGA AAAACGGTGG CAGTGATTAG TGATATGAAC
CAGCCTTTGG GAAGAAAGGT TGGAAATTCC TTAGAAGTAC AAGAAGCCAT CGCTACATTG
AAGGGCAATG GACCAGAAGA TTTTAAAGAA TTATGTCTCA ATTTAGGAGC CATCTTATTG
AATATGGCCG AAAAAGTTAC CACAGTTACA GAAGGCAAAA AGTTATTATC AAATAAAATA
AACAGTGGAG AGGCTTTAGC TAAGCTCGAG CAATTAGTAA AGGCTCAGAA TGGAGATACA
TCAGGTATAC ATAATACAGA AAATCTGCCT CAAGCCAAGC ACTATAAAAT ATTAACAGCA
GATAAATCTG GATTTATTAC AAATTTAGAT GCTAAAAAAG TAGGACTAGC CAGTGTAAAT
TTAGGAGCTG GCAGGGCAAC CAAAGAAGAT AAAATTGACT TATCCGTTGG GATAGAATTA
AATAAAAAAC TAGGTGATGA AGTTAGTACA GGTGATGAAT TAGCAAAAAT ATGGTATAAC
GATGAAGATA AATTATTACA AGCCGCACCT ATTCTAGAAG ATGCTTTTGA TATTTCAGAA
AGTGCTTCAG GAAAGTCACT TATTTATGGC ATGATAACTG AAAATACAAA TCCAGGTGAA
TTAGATAGCA TTTAA
 
Protein sequence
MRAYEVIVKK REGGELSSSE IDFLVQGYTR GEIPDYQMSS FLMAAFLQGL NSQETAQLTK 
SMVHSGEVLD LSRISGIKVD KHSTGGVGDK TTLALAPLVA SADLNVAKMS GRGLGHSGGT
IDKLEAFLGF TPELSMENFI EQVQKHNLAI VGQTKQLAPA DGKIYSLRDV TATVDSIPLI
ASSIMSKKLA AGTNMIVLDV KVGKGAFMEN LEDATALGHE MVNIGKNLGR KTVAVISDMN
QPLGRKVGNS LEVQEAIATL KGNGPEDFKE LCLNLGAILL NMAEKVTTVT EGKKLLSNKI
NSGEALAKLE QLVKAQNGDT SGIHNTENLP QAKHYKILTA DKSGFITNLD AKKVGLASVN
LGAGRATKED KIDLSVGIEL NKKLGDEVST GDELAKIWYN DEDKLLQAAP ILEDAFDISE
SASGKSLIYG MITENTNPGE LDSI