Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0839 |
Symbol | |
ID | 5054442 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 744674 |
End bp | 746023 |
Gene Length | 1350 bp |
Protein Length | 449 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640468400 |
Product | phosphomethylpyrimidine kinase |
Protein accession | YP_001153077 |
Protein GI | 145591075 |
COG category | [H] Coenzyme transport and metabolism [S] Function unknown |
COG ID | [COG0351] Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase [COG1992] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00097] phosphomethylpyrimidine kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.218839 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGGCGGG TGGCCATAAC GATAGCTGGA CTTGACTCCG GCGGCGGCGC CGGGATACAC GCCGATATAA AGACCTTCGC CGCCATGGGC GTCCACGGGA CCACGGCTCT TACCTGCGTT ACGGCCCAAA ACACCTACGA GGTCAGGGAG GCCCAGTGCC TTGCGCCGTC TCTCGTGAGG TCTCAGATAA TGGCGGTTTG GGACGACATG GGGATAGACG CGGGGAAAAC CGGCATGCTG GGCACAAGGG AGATAATAGA AGAGGTGGCC TCCACCGTGT CTAAGCTGGG GTTCCCCCTC GTCGTTGACC CCGTAATGGT GGCAAAGTCC GGCGCGCCCT TAATCTCAGA CGACGCCGTG GACGTGCTTA GGCGGAGGCT CCTCCCAGTG GCCAAAGTGG TTACCCCCAA CAGGCCGGAG GCGGAGAGGC TCACAGGCAT GAAAATCGCC TCCGAGAAGG ACGCGGAGAG GGCTGCCGAG TATATAAACA AGGAGTACGG GACAGAAGTC GTGGTGGTTA AGGGGGGCCA CCTTGAAGGC GCCGAGGCTG TCGACGTCGT GTACTACAAG GGGTCCTTCC ACAAGTTCTC CACGCCCCGC CTGGAGTCCC GCGCCACCCA CGGGACAGGC TGTGCCTACT CGGCGGCCAT AGCCGCGGCG CTGGCCAAGG GCCTCGACCC CCTGGAGGCC ATCAAGACGG CGAAGAGGTT TATCTACACG GCGATTAAAT ACGGCGTGTC CAGGGGCAAG GGGCACTGGC CTGTGAACCC CACGGCGTGG GTGGAGATCC CGGCGGAGAG GTGGAGGGCC GCGCAAGAGC TCAACGCGGC GCTGGACTTA ATACGGAGAA ATGCCGCGGT CTTCGCCAAG GCGATACCCG AAGTCCAGTC TAATATAGGG TATGTGATCG ACCCCCGCTA CGCCGAGGGG CCGGGCGACA TCGTTGCCGT CCCGGGGCGG ATCGTAAACT ACATGGGCGA GGCGAGGCCA TCGGGCCCGC CTACCTTCGG TGCCAGTAGC CACACGGCTA GGAAGATATT GGCCTTCGTG AAAAAAGGCG CTGAGGTGAG GGCGGCCATG AACATAAGAT ACTCCCCCCA CTTGGTAGAG AAGGCCAAAT CCCTGGGCTT CAGGGTGGCT GTGGTGGATC GCCGCAAGGA GCCGGAGGAG GTTAAGCAAG TCGAGGGCGG CTCCATGGCC TGGGTGGTAG GGGAGGCCCT ATCCCAAACG GGCGGAGCGC CGCCAGACAT CATAGCAGAC CTCGGCGACT GGGGGAAGGA GCCCCAGATC ACGATCCTCG GCAGAAGTCC CGTTGAGGTG GTCGAGAAGG CCCTGCGCCT ACTTACCTAG
|
Protein sequence | MWRVAITIAG LDSGGGAGIH ADIKTFAAMG VHGTTALTCV TAQNTYEVRE AQCLAPSLVR SQIMAVWDDM GIDAGKTGML GTREIIEEVA STVSKLGFPL VVDPVMVAKS GAPLISDDAV DVLRRRLLPV AKVVTPNRPE AERLTGMKIA SEKDAERAAE YINKEYGTEV VVVKGGHLEG AEAVDVVYYK GSFHKFSTPR LESRATHGTG CAYSAAIAAA LAKGLDPLEA IKTAKRFIYT AIKYGVSRGK GHWPVNPTAW VEIPAERWRA AQELNAALDL IRRNAAVFAK AIPEVQSNIG YVIDPRYAEG PGDIVAVPGR IVNYMGEARP SGPPTFGASS HTARKILAFV KKGAEVRAAM NIRYSPHLVE KAKSLGFRVA VVDRRKEPEE VKQVEGGSMA WVVGEALSQT GGAPPDIIAD LGDWGKEPQI TILGRSPVEV VEKALRLLT
|
| |