Gene Pars_1166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1166 
Symbol 
ID5054883 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1052138 
End bp1054063 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content60% 
IMG OID640468716 
Productmetallophosphoesterase 
Protein accessionYP_001153389 
Protein GI145591387 
COG category[R] General function prediction only 
COG ID[COG1409] Predicted phosphohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0143203 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAC TTTTTTCACT CTCCATCCTC CTACTTGTCG CTCTCCTCTC CGCCGCCTCA 
ATCGTCGACC CGAGGTGGGG CACGCCGGCG TACGTCACGC CGGGAGGCGC CTTTAACATA
ACCCTTGATG CAGCCGTAGC AATTAAGAGC GTCGCCCTCG CCGCGCCGGG GCTGGAGAAG
CCCATCGAGC TGAACTACGC TACAAGCGGC AACGTAATCA CGGCGAAGGT GCCTCCCGGT
ACGCCGCTGG GGGTGTACGA CTTGGTCATA AACGAGGGCG AGCTCTACGA GCCTAAGGCT
GTGTGGGTTG GCAACGTCAC GGGGCCGCTT AGGATAATCC AGCTCACCGA CATACACGTG
GGGGTTGAGC TGGACATGGC TTCTCTCTAC CGCCTCATAC ACGGCGCGCT TGTAGCCAGC
GGCGGGCCGT ACGACGTGGT GTTCTTCACC GGAGACCTCG CAGACGTGGG GGGCCAGGTG
TGGCACTACA CGCTGTTCAC TAAATACGCC GCTACTATAC CAAAACCCGT CTTCGCAATA
CCGGGAAACC ACGACCACGC CGGCGACGAC CCCTTGACGA ACTACCAGAA GTTCGTCGGG
AGGCCCGTGT GGTACAGGGT GATAGGCCCA TACCTAATAA TAGGGCTCGA CAGCGGCTTC
GACGGCTACC TTAGAGACGA CCAGGTTAAG ATGTACGAGG AGGTGCTGAA GAGGTACCCC
GACAAGGTTA AGATTGTCCT CATCCACCAC CCGCCCTTCT ACTTGCAAGA CAGTTACGTC
GTTGAGACGT ACAGGGGGCC CCAAGACGTG GATCGGCTGA ACAAGGACCC CACTGGGAGG
AGGAGCTACT ACATCGTATA CACCAGCTAC CTCCAGAACA GGCCCGCTTT CGAGAAGTTC
CTTGACCTCA CCATTAGGTA CAAGGTGGCT CTGGTAATGG CCGGCCACGT CCACAGCGGC
AACTCCACCA TTGTGATAAA CGGGACCTAC TTCGTCACCA CGAGGACGCT GGGCGGCTCC
ATTGACACCT CCCACGGCTT TAGGACATAC GTGGTGTACC CAGACGGCCG CGTCGAGGTT
GACAAGGAGT TGTGGACTTT CAAAAACTTC ACAGTCCTGA CCTGGGGCAC TAAGGCGGCT
CAGATATACG CCGAGAGCTC TCTACTGCCG CCGACTATCA CCATCGACCT CCCGGGGGAG
TTCGCCGGAC TGAAGGTGTT CAACGGCACT GCCGAGCTGG TAAAGGCGGA GAGGCACCCC
CTGGGCAAAT ACACCAGGTA CACCCTGAAG ATTGCCGGGA AGAGGCTGTG GCTCGCGTTG
GGCGACTACC AGCCCTCCCC CACAGTGGAG GTGGTTAGGA TACTCCCCAG GTCTCCCACT
CCCGGCGACG TGGTCACAGT GACCATTAGG GCAACCGACC CCAACGTAGG CGTGCCCTTC
ATCACCGTAA ACGGCCAGAG GATACTCGCG TCGTACCTAG AAGAGCAACC GGTCTACATC
TACAAGTTTA AGTACACCGC CCCCACCACG TTGCAAGCAG CGGCGCCTAG TGGCAAGCCC
CTAAGCCTCC AGATAGGCCA GCCCACAACT ACCACGCCGC CCACACCAAC CCCACCGCCT
ACAACCACCA CAACACCCCA AACAACGACT GTGACGACGA CGCCTTCAGC CACCCCGACT
GCCACCCCCA CCACGCAGAC CGCCACATCC CCCACGGCGA CGCCCAGCGC CACGGCACCC
CCCGCGACTA CTCAGCCAAC TCCGGCGCCG ACAGCTGCCG CTACCGCGAC ACCTGCGCCT
TCCTACCAGC CCTCGGCCTT CCCAGTAGAG GCAGTTATTT TGTTGGTAAT CGCCGTCGCC
GGCGCCGCGG TTATCGCAGT GACAGCGAGG AAAAAGCCGA CGTCTGAAGA GGCCAAGACT
AGATAG
 
Protein sequence
MKKLFSLSIL LLVALLSAAS IVDPRWGTPA YVTPGGAFNI TLDAAVAIKS VALAAPGLEK 
PIELNYATSG NVITAKVPPG TPLGVYDLVI NEGELYEPKA VWVGNVTGPL RIIQLTDIHV
GVELDMASLY RLIHGALVAS GGPYDVVFFT GDLADVGGQV WHYTLFTKYA ATIPKPVFAI
PGNHDHAGDD PLTNYQKFVG RPVWYRVIGP YLIIGLDSGF DGYLRDDQVK MYEEVLKRYP
DKVKIVLIHH PPFYLQDSYV VETYRGPQDV DRLNKDPTGR RSYYIVYTSY LQNRPAFEKF
LDLTIRYKVA LVMAGHVHSG NSTIVINGTY FVTTRTLGGS IDTSHGFRTY VVYPDGRVEV
DKELWTFKNF TVLTWGTKAA QIYAESSLLP PTITIDLPGE FAGLKVFNGT AELVKAERHP
LGKYTRYTLK IAGKRLWLAL GDYQPSPTVE VVRILPRSPT PGDVVTVTIR ATDPNVGVPF
ITVNGQRILA SYLEEQPVYI YKFKYTAPTT LQAAAPSGKP LSLQIGQPTT TTPPTPTPPP
TTTTTPQTTT VTTTPSATPT ATPTTQTATS PTATPSATAP PATTQPTPAP TAAATATPAP
SYQPSAFPVE AVILLVIAVA GAAVIAVTAR KKPTSEEAKT R