Gene Nther_1666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_1666 
Symbol 
ID6315313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp1742711 
End bp1743526 
Gene Length816 bp 
Protein Length271 aa 
Translation table11 
GC content39% 
IMG OID642644042 
Productinosine guanosine and xanthosine phosphorylase family 
Protein accessionYP_001917828 
Protein GI188586283 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0005] Purine nucleoside phosphorylase 
TIGRFAM ID[TIGR01697] inosine guanosine and xanthosine phosphorylase family
[TIGR01700] purine nucleoside phosphorylase I, inosine and guanosine-specific 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.321595 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.0366177 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTTTC ATAAAATAAC AGAAGCTAAG GAGTATATAT TGAATCAGTG TGATGAAGTT 
CCATCAATTG GTTTGATTTT AGGGTCAGGT TTGGGAAGTT TAGCTGACGA AATTGAAAAT
GTTAACTATA TACCATACCA AGATATCCCT CATTTTCCCG AATCTACAGT TAAAGGGCAT
AAGGGGAGGT TAGCAATAGG TAAACTTGAG AGCAAAACGG TCATGGCTAT GCAGGGTAGA
TTTCATTACT ATGAAGGATA TGATTTGAAA GAAGTGACAT TTCCAGTTCG TGTAATGCAA
AATATGGGAA TAGACAAACT GATAGTAACA AACGCAGCAG GTGGAATTGA TTTGAATTTT
TCAGTGGGAG GTTTGATGTT AATTACCGAT CATATTAACT TTTTTGGCGC AAATCCCTTG
AGAGGAAAAA ATGATGATCG GCTAGGAACC CGCTTCCCAG ATATGACCTA TGCCTATGAC
CCTGAATTAC GAGAAGTAGC TAAACAAGCC GCCAATGAAC TGGACATAGG ACTTTATGAA
GGCGTATATT TAGGAGTACC TGGTCCCTCT TATGAAACTC CTGCTGAGAT CAAAATGGCT
CGAACCATAG GAGCAAGTGC AGTGGGTATG TCTACAGTGC CAGAAGTGAT AGCCGCCAAT
CACGGTAATA TGAATGTATT AGGGATTTCA TGTATAACGA ATATGGCTGC AGGAATTCAA
GATAAACGTT TAACTCATAA AGAAGTTGAA GAAGTTACAA CTAGAGTTCG GGGCGAATTT
CAGAGTTTAG TAAGAAAAGT GTTAGCATCA ATGTAA
 
Protein sequence
MDFHKITEAK EYILNQCDEV PSIGLILGSG LGSLADEIEN VNYIPYQDIP HFPESTVKGH 
KGRLAIGKLE SKTVMAMQGR FHYYEGYDLK EVTFPVRVMQ NMGIDKLIVT NAAGGIDLNF
SVGGLMLITD HINFFGANPL RGKNDDRLGT RFPDMTYAYD PELREVAKQA ANELDIGLYE
GVYLGVPGPS YETPAEIKMA RTIGASAVGM STVPEVIAAN HGNMNVLGIS CITNMAAGIQ
DKRLTHKEVE EVTTRVRGEF QSLVRKVLAS M