Gene Pars_0442 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0442 
Symbol 
ID5055607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp385316 
End bp386635 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content57% 
IMG OID640468007 
Productphosphate ABC transporter, periplasmic phosphate-binding protein 
Protein accessionYP_001152692 
Protein GI145590690 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0226] ABC-type phosphate transport system, periplasmic component 
TIGRFAM ID[TIGR00975] phosphate ABC transporter, phosphate-binding protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTCA AACTAATAGT CCCAGCTGTC GCGGCGGTTG TGGTTATAGC AATTGCACTG 
GCGCTTCTGG CATCTTCCCC CGGCGCGGGG GTGCCTGGCA CGACTCAAAC AACGCCTAGC
CCCAGCAAAA CTACGCCGCC TGGCTCTGGG ACAACTGGAG GGCAGAACAC ACAGCCAGCC
TCTGCTGTGG GGCAGGGCAC TCAACAGACG CCTAGCCCTA CTCAAGCTAC GCCAAGCACT
ACCCAGGCCA CGACCCAGCC GCCTCGTCTA AGCGGGACCG TTACTGGTGG CGGCTCAACA
TTTATCAACC CCCAGATGAT TGCGTGGTCT AGGAGATTCT ACGAATTGAC GGGAGCCCAG
GTCAACTACC AGTCCATAGG CTCAGGTGCG GGCGCCGCCC AGTTCTTGGC TAAGAAGCTG
GAGTTCGCCG CGTCTGACGT CCCAATGCCA AGGGACAAGT ACGAGCAGTT TAGGGGCCGG
TTTTTGCAAT TCCCCGTAGT TATAGGGTCT ATTGTCTTGG TGTACAATAT TCCGGAGGTG
GCATATGAGA AGACTGGGAA GTACCTGAAC CTAACGTCTG AGGTAATCTC GCTGATCTAC
ATGGGCGAGA TAAGGCAGTG GTGCGACGAG AGGATCCAGA AGCTGAACCC AGGTCTGAGG
CTCCCATGCA AGGACATAGT GGCTGTACAC AGGAGCGACG GCTCTGGCAC CACTGCGGCG
TTTACCTTGT ACCTGGCTGT GGCTTATCCG CCCTGGAACC AGACCGTGGG CTGGGGCTAT
ACGGTGAAGT GGCCGGCTGA CGAGAAGGCT GAGGGAACAG GCGCAAAGGG CAACGAGGGC
GTCGCCCAGA CGGTTCTCCA GACGCCCTAC TCCATCGGCT ACGTTGAGTA CGCCTATTGG
TCGCAGAACA GAGACAAGTA CGACAAGGTC GGCGGCGTTG CCTATCTGAA AAACGACAAC
GATGGAAAGT TCTACTTCCC CGCCGCCGAG TCCGTATCAG CCGGGGCCGA TGCAGGTTTA
AGACGCTACG TTGCGAAATA CGGCACCCTG CCGTCTCCAG ACGCCGACTG GAACCAAGTG
TCCATCGAAT TCACCAACCC CCCCGCCGGC TACCCGATAC TGGCCTTCGT GTATGTCTTC
TTGTGGAAGG ACTACTCAGC TGAGGGCTAC GGCTACGCCG CAACCAAGGC CGCGTTGTTG
AGAGAGTTCT TCAAGTGGGT TTTAACAATT GGGCAGACCC AGTTGGTGGA GGGCTACATA
CCGCTACCTG AGTCTGTCGC CCAGTTAGGG CTCCATGCAT TACAGCAAGT AAAGCCATAA
 
Protein sequence
MNLKLIVPAV AAVVVIAIAL ALLASSPGAG VPGTTQTTPS PSKTTPPGSG TTGGQNTQPA 
SAVGQGTQQT PSPTQATPST TQATTQPPRL SGTVTGGGST FINPQMIAWS RRFYELTGAQ
VNYQSIGSGA GAAQFLAKKL EFAASDVPMP RDKYEQFRGR FLQFPVVIGS IVLVYNIPEV
AYEKTGKYLN LTSEVISLIY MGEIRQWCDE RIQKLNPGLR LPCKDIVAVH RSDGSGTTAA
FTLYLAVAYP PWNQTVGWGY TVKWPADEKA EGTGAKGNEG VAQTVLQTPY SIGYVEYAYW
SQNRDKYDKV GGVAYLKNDN DGKFYFPAAE SVSAGADAGL RRYVAKYGTL PSPDADWNQV
SIEFTNPPAG YPILAFVYVF LWKDYSAEGY GYAATKAALL REFFKWVLTI GQTQLVEGYI
PLPESVAQLG LHALQQVKP