Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_1919 |
Symbol | purP |
ID | 6165059 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | - |
Start bp | 1691576 |
End bp | 1692610 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 641669082 |
Product | 5-formaminoimidazole-4-carboxamide-1-(beta)-D- ribofuranosyl 5'-monophosphate synthetase |
Protein accession | YP_001795280 |
Protein GI | 171186361 |
COG category | [R] General function prediction only |
COG ID | [COG1759] ATP-utilizing enzymes of ATP-grasp superfamily (probably carboligases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.750635 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTAATTA GTGGCGCCAA TCCGACGCAC GTGTCGGCCG CGTTGAAGAG ATACGACGTG GAGAAGCTGG CGGTCGCCAC GGTGGCGTCG CACACCGCGT TGCAGATCTT GAGGGGGGCG AAGAGGTTCG GCTTCAGGAC CATCGCCGTG GCCGGCAGGG CAGACGCCGC CGAGTTCTAC AGGCAGTTCG GCTTCATAGA CGAGGTGTGG ACAGCCGACT TTAGAAACTT CGTGAAGACG GCGGAGAAGC TCGTGGAGGC CAACGCGGTG CTCGTGCCCC ACGGCTCCTA CGTGGAATAC GTAGGCTGGA GGCAGGCGCT TGAGGCGCCG GTCCCCACCC TCGGCTGCAG AGAGCTGATA AGGTGGGAGG CGGATCAGTA CAAGAAGATG GAGCTCCTCC AGAGAGCCGG CGTCCCCACG CCGAGGGTCT ACAAAACGCC GGAGGAGGTG GATAGGCCCG TCATAGTTAA GCTCTTCGGC GCCAAGGGGG GCAGGGGGTA CTTCCTCGCC AGAGATAGAG AGGAGCTGAG GAGGAGGCTG GCCGGCCTAG GCGAGTACAT CATCCAGGAG TACGTATTCG GCGTGCCGGC CTACTACCAC TTCTTCTCCT CCCCCGTATA CGGCAGGGTG GAGGTCTTCG GCGCAGACAT CAGGTACGAA TCTAACGTAG ACGGGAGGAC CTTCGGCTGG GTGGAGCCCA CCTTCGTCGT CGTTGGCAAC CTCCCCCTGG TCCTCAGGGA GTCTCTGCTC CCAACTATAT GGAAATACGG CGTCCAGTTC GCCAAAGCCG TCGAGGAGGC GGTCGGCTGC AGGCTGGCGG GGCCCTACTG CCTGGAGTCC ATAATAAGGG ACGACATGTC CATCTCGGTC TTCGAGTTCT CAGGGCGTAT CGTGGCTGGG ACCAACATAT ACATGGGCTA CGGCTCGCCC TACTCGGTCC TCTACTTCGA CAGGCCTATG GACATGGGGG AGAGGATAGC CCACGAGATA AGGGAGGCCG CCCGGAGGGG CCGCCTAGAG GACCTCTTCA CGTAG
|
Protein sequence | MLISGANPTH VSAALKRYDV EKLAVATVAS HTALQILRGA KRFGFRTIAV AGRADAAEFY RQFGFIDEVW TADFRNFVKT AEKLVEANAV LVPHGSYVEY VGWRQALEAP VPTLGCRELI RWEADQYKKM ELLQRAGVPT PRVYKTPEEV DRPVIVKLFG AKGGRGYFLA RDREELRRRL AGLGEYIIQE YVFGVPAYYH FFSSPVYGRV EVFGADIRYE SNVDGRTFGW VEPTFVVVGN LPLVLRESLL PTIWKYGVQF AKAVEEAVGC RLAGPYCLES IIRDDMSISV FEFSGRIVAG TNIYMGYGSP YSVLYFDRPM DMGERIAHEI REAARRGRLE DLFT
|
| |