Gene Pars_0839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0839 
Symbol 
ID5054442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp744674 
End bp746023 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content63% 
IMG OID640468400 
Productphosphomethylpyrimidine kinase 
Protein accessionYP_001153077 
Protein GI145591075 
COG category[H] Coenzyme transport and metabolism
[S] Function unknown 
COG ID[COG0351] Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase
[COG1992] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00097] phosphomethylpyrimidine kinase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.218839 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGCGGG TGGCCATAAC GATAGCTGGA CTTGACTCCG GCGGCGGCGC CGGGATACAC 
GCCGATATAA AGACCTTCGC CGCCATGGGC GTCCACGGGA CCACGGCTCT TACCTGCGTT
ACGGCCCAAA ACACCTACGA GGTCAGGGAG GCCCAGTGCC TTGCGCCGTC TCTCGTGAGG
TCTCAGATAA TGGCGGTTTG GGACGACATG GGGATAGACG CGGGGAAAAC CGGCATGCTG
GGCACAAGGG AGATAATAGA AGAGGTGGCC TCCACCGTGT CTAAGCTGGG GTTCCCCCTC
GTCGTTGACC CCGTAATGGT GGCAAAGTCC GGCGCGCCCT TAATCTCAGA CGACGCCGTG
GACGTGCTTA GGCGGAGGCT CCTCCCAGTG GCCAAAGTGG TTACCCCCAA CAGGCCGGAG
GCGGAGAGGC TCACAGGCAT GAAAATCGCC TCCGAGAAGG ACGCGGAGAG GGCTGCCGAG
TATATAAACA AGGAGTACGG GACAGAAGTC GTGGTGGTTA AGGGGGGCCA CCTTGAAGGC
GCCGAGGCTG TCGACGTCGT GTACTACAAG GGGTCCTTCC ACAAGTTCTC CACGCCCCGC
CTGGAGTCCC GCGCCACCCA CGGGACAGGC TGTGCCTACT CGGCGGCCAT AGCCGCGGCG
CTGGCCAAGG GCCTCGACCC CCTGGAGGCC ATCAAGACGG CGAAGAGGTT TATCTACACG
GCGATTAAAT ACGGCGTGTC CAGGGGCAAG GGGCACTGGC CTGTGAACCC CACGGCGTGG
GTGGAGATCC CGGCGGAGAG GTGGAGGGCC GCGCAAGAGC TCAACGCGGC GCTGGACTTA
ATACGGAGAA ATGCCGCGGT CTTCGCCAAG GCGATACCCG AAGTCCAGTC TAATATAGGG
TATGTGATCG ACCCCCGCTA CGCCGAGGGG CCGGGCGACA TCGTTGCCGT CCCGGGGCGG
ATCGTAAACT ACATGGGCGA GGCGAGGCCA TCGGGCCCGC CTACCTTCGG TGCCAGTAGC
CACACGGCTA GGAAGATATT GGCCTTCGTG AAAAAAGGCG CTGAGGTGAG GGCGGCCATG
AACATAAGAT ACTCCCCCCA CTTGGTAGAG AAGGCCAAAT CCCTGGGCTT CAGGGTGGCT
GTGGTGGATC GCCGCAAGGA GCCGGAGGAG GTTAAGCAAG TCGAGGGCGG CTCCATGGCC
TGGGTGGTAG GGGAGGCCCT ATCCCAAACG GGCGGAGCGC CGCCAGACAT CATAGCAGAC
CTCGGCGACT GGGGGAAGGA GCCCCAGATC ACGATCCTCG GCAGAAGTCC CGTTGAGGTG
GTCGAGAAGG CCCTGCGCCT ACTTACCTAG
 
Protein sequence
MWRVAITIAG LDSGGGAGIH ADIKTFAAMG VHGTTALTCV TAQNTYEVRE AQCLAPSLVR 
SQIMAVWDDM GIDAGKTGML GTREIIEEVA STVSKLGFPL VVDPVMVAKS GAPLISDDAV
DVLRRRLLPV AKVVTPNRPE AERLTGMKIA SEKDAERAAE YINKEYGTEV VVVKGGHLEG
AEAVDVVYYK GSFHKFSTPR LESRATHGTG CAYSAAIAAA LAKGLDPLEA IKTAKRFIYT
AIKYGVSRGK GHWPVNPTAW VEIPAERWRA AQELNAALDL IRRNAAVFAK AIPEVQSNIG
YVIDPRYAEG PGDIVAVPGR IVNYMGEARP SGPPTFGASS HTARKILAFV KKGAEVRAAM
NIRYSPHLVE KAKSLGFRVA VVDRRKEPEE VKQVEGGSMA WVVGEALSQT GGAPPDIIAD
LGDWGKEPQI TILGRSPVEV VEKALRLLT