Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1201 |
Symbol | |
ID | 4810154 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1431771 |
End bp | 1432598 |
Gene Length | 828 bp |
Protein Length | 275 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640106624 |
Product | purine nucleoside phosphorylase I, inosine and guanosine-specific |
Protein accession | YP_001037626 |
Protein GI | 125973716 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0005] Purine nucleoside phosphorylase |
TIGRFAM ID | [TIGR01697] inosine guanosine and xanthosine phosphorylase family [TIGR01700] purine nucleoside phosphorylase I, inosine and guanosine-specific |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.202694 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACAATA TATACGAAAA GGCACGGGAA ACGGCTTCAT TTATCAAAAG GATAATAAAA GAAACGCCTG AGATTGCCAT TGTCCTTGGT TCGGGTTTGG GACCTTTGGC GGATGAGATT GAAAACAAAG TTGAGATTGA TTATAAAGAC GTACCCAACT TTCCGGTGAC TACCGTTGAA GGGCATGCCG GCAAATTTGT ATACGGAATT TTGGAAAACC GGCGCGTAAT TGCCATGAAA GGGCGTTTTC ATCACTATGA AGGATATGAT GTATCGCAGA TTGTTTTTCC CGTCAGGGTC TTTAAAATGC TGGGAATAAA CAATCTTATT GTCACAAATG CTTCCGGTGG GATAAACAGA AGTTTCAGGC CGGGAGATCT TATGATTATT AAAGACCACA TAAGCTTCTT TGCTCCGTCT CCTTTAAGAG GCAAAAACAT AGATGAGTTC GGATTAAGGT TTCCGGATAT GTGCAAGGCA TACAATCCGA AGCTTGTTGA AATTTGTAAA AAAGCAGCTT CAGATGTGGG AGTGGATGTC AAAGAAGGAG TCTATGCTTT TACCCAGGGG CCCATGTATG AGACACCTGC CGAGATAAGG GCGCTTGGAA TACTTGGAGC CGATGCTGTC GGTATGTCCA CGGTTCCGGA GGTTATTGCG GCAAGACATG CTAATATGAA TATTCTGGGA ATTTCGTGCA TAACCAATAT GGCGGCGGGA ATTTTGGATC AGCCTTTGAC TCATGAAGAG GTTATGAAAA CGGCAAAAGA AGCCGAAAAT AAATTTGTCC GTTTGGTTAA AAGAGTAATA TCCGTCTGGG AGGTATAA
|
Protein sequence | MDNIYEKARE TASFIKRIIK ETPEIAIVLG SGLGPLADEI ENKVEIDYKD VPNFPVTTVE GHAGKFVYGI LENRRVIAMK GRFHHYEGYD VSQIVFPVRV FKMLGINNLI VTNASGGINR SFRPGDLMII KDHISFFAPS PLRGKNIDEF GLRFPDMCKA YNPKLVEICK KAASDVGVDV KEGVYAFTQG PMYETPAEIR ALGILGADAV GMSTVPEVIA ARHANMNILG ISCITNMAAG ILDQPLTHEE VMKTAKEAEN KFVRLVKRVI SVWEV
|
| |